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Spatial logics are used for describing and reasoning about any kind of geometrical entities and 
relations, broadly construed. In the past decade, such logics have attracted attention in such diverse 
fields as Artificial Intelligence, Computer Science, Mathematics, Philosophy and Physics. This 
handbook creates, for the first time, a systematic account of the resulting field. Following a general 
introduction, there are fourteen chapters by invited authors, with self-contained overviews, the 
principal insights and methods to date, and the central open problems. 

The result is a comprehensive survey of this rapidly expanding subject. 
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Recently there has been a vigorous development in logics of space, stimulated by new applications in 
Computer Science and Artificial Intelligence, and by new attitudes to Logic which these disciplines 
have helped to foster. The Handbook will be a landmark publication, defining the field, and providing a 
superb resource for students and researchers, specialists and visitors alike. 

SAMSON ABRAMSKY, Strachey Professor of Computing, Oxford University 


Logic has been the language of choice for the study of Space ever since Euclids Elements. This alliance 

has deeply influenced the way we do science. Contemporary Logic - with its new tools - continues this 

tradition, while adding dynamic and information-oriented themes to the mix. This Handbook covers 
the extensive new developments in spatial reasoning, and it may revolutionize the field. 

SERGEI N. ARTEMOV, Professor of Computer Science and Mathematics, 
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The geometry and dynamics of space have motivated much mathematics, but logic has lagged behind. 
This prodigious volume changes that picture significantly. It provides a wealth of material of wide 
interest, from physics to philosophy. Many chapters are substantial monographs in their own right. The 
book will stimulate further research deepening our understanding of the world around us. 

ROBERT GOLDBLATT, Professor of Mathematics, Victoria University of Wellington 


This book redraws boundaries in an imaginative way. Its common theme is how to represent spatial 
arguments within limited formalisms. Motivations for this range from image processing to foundations 
of physics, and from computational complexity to whole-part structures. Besides presenting many things 
that deserve to be better known, the book invites us to find the logical viewpoint that unifies them. 
WILFRID HODGES, School of Mathematical Sciences, Queen Mary College, London 
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Logics of time have been developed over many decades. Logics of space appear to be much harder, and 
this book contains several new attempts which promise applications to Artificial Intelligence, especially 
to robotic manipulation of solid objects. 

JOHN McCARTHY, Department of Computer Science, Stanford University 


The precise relationship between spatial and logical structure has been an outstanding issue in Logic, 
Philosophy and Mathematics since the time of Immanuel Kant. This Handbook provides an up-to-date 
and accessible introduction to the current state of the art, in the 
light of recent work in Logic and Computer Science. 
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Preface 


Space, with its manifold layers of structure, has been an inexhaustible source 
of intellectual fascination since Antiquity. The science that began with the 
empirical discoveries of the Egyptian ‘rope-stretchers’, and that has inspired 
many of the greatest developments in mathematics over the centuries, now 
comprises such topics as spatial databases, automated geometrical reasoning 
and digital image processing. In this long intellectual history, however, one 
relatively recent, yet crucial, event stands out: the rise of the logical stance 
in geometry. Fundamental to this development is the analysis of geometrical 
structures in relation to the formal languages used to describe them, and the 
recognition of the special mathematical challenges—and opportunities—which 
such an analysis presents. The interplay between logic and geometry is the 
subject of this book. 

By a spatial logic, we mean any formal language for describing geomet- 
rical entities and configurations, where ‘geometrical’ is understood in a broad 
sense. Unlike their well-studied temporal counterparts, spatial logics have been 
curiously neglected in the literature on mathematical logic, despite some early 
pioneering work by Tarski and others on the foundations of geometry and topol- 
ogy in the middle years of the previous century. Only in the last decade have 
spatial logics attracted renewed attention from logicians, partly as a response 
to work in such diverse fields as artificial intelligence, database theory, physics 
and philosophy. 

Today, there is a fast-growing body of literature on spatial logics, with mo- 
tivations ranging from computational issues to the foundations of knowledge 
and information. But, for the newcomer to the field, this diversity of influences 
and approaches constitutes something of a mixed blessing: the field may be ina 
state of rapid development; but there is as yet no common research agenda, and 
no common vocabulary to allow ideas to be shared across disciplines. The aim 
of this book is to provide a resource which presents a view of the best current 
work in different communities worldwide, and which makes a first attempt at 
systematic linkage. We hope this will stimulate the development of spatial logic 
itself, but beyond this narrower purpose, we also hope to have provided a text 
that should be of value to non-logicians with an interest in formal theories of 
space. 

The book consists of a general introduction followed by fourteen invited 
contributions on various topics in spatial logic, with authors representing the 
major active centres in the field. Each of these chapters provides a self-contained 
overview of its topic, describing the principal results obtained to date, explaining 
the methods used to obtain them, and listing the most important open problems. 
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Every contributed chapter has one or more “second readers’—experts in the 
field, who worked with the authors and editors to help ensure a comprehensive 
(and comprehensible) account of the topic in question. 

The book is intended as a technical resource for academic researchers and 
graduate students. Familiarity with basic undergraduate-level logic, topology 
and geometry is generally assumed. Very roughly, the criterion of accessibil- 
ity we have worked to is that a good graduate student interested in the area 
should, by working through any of these chapters, be able to acquire a firm 
understanding of the current state-of-the-art in that topic within the space of 
a few weeks. Jointly, these chapters provide—to the extent that this is ever 
possible in a rapidly evolving discipline—a comprehensive survey of the field 
of spatial logic as it stands today. 


Marco AIELLO 
IAN PRATT-HARTMANN 


JOHAN VAN BENTHEM 
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Chapter 1 


WHAT IS SPATIAL LOGIC? 


Marco Aiello 


University of Groningen 


Ian Pratt-Hartmann 
University of Manchester 


Johan van Benthem 
University of Amsterdam & Stanford University 


By a spatial logic, we understand any formal language interpreted over a class 
of structures featuring geometrical entities and relations, broadly construed. 
The formal language in question may employ any logical syntax: that of first- 
order logic, or some fragment of first-order logic, or perhaps higher-order logic. 
The structures over which it is interpreted may inhabit any class of geometrical 
‘spaces’: topological spaces, affine spaces, metric spaces, or perhaps a single 
space such a the projective plane or Euclidean 3-space. And the non-logical 
primitives of the language may be interpreted as any geometrical properties 
or relations defined over the relevant domains: topological connectedness of 
regions, parallelism of lines, or perhaps equidistance of two points from a 
third. What all these logics have in common is that the operative notion of 
validity depends on the underlying geometry of the structures over which their 
distinctively spatial primitives are interpreted. Spatial logic, then, is simply the 
study of the family of spatial logics, so conceived. 

An analogy will help elucidate this rather austere-looking definition. From 
our stance, spatial logic parallels the more established area of temporal logic. 
A temporal logic is a formal language interpreted over some class of structures 
based on frameworks of temporal relations, broadly construed. The language 
in question, though usually some modal fragment of first- or higher-order logic, 
may in principle employ any logical syntax; the objects over which that syntax 
is interpreted may include points, paths, or extended intervals over any variety 
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of partial orders; and the assumed partial order ultimately provides the interpre- 
tation for the distinctively temporal primitives of the formal language. What 
all temporal logics have in common, whether point- or interval-based, is that 
their operative notion of validity depends on the assumed properties of the un- 
derlying temporal flow. And what gives them their enduring appeal is the way 
in which the formal languages they employ balance expressive power against 
computational complexity. In this respect, temporal logic is the computation- 
ally motivated study of time. 

Let us set the scene for the treatment of spatial logic in this book by examining 
some of the historical trends that have given rise to it. Classical geometry, the 
cultural model of deductive proof par excellence since Euclid’s Elements, was 
finally analyzed in full mathematical precision in Hilbert’s Grundlagen der 
Geometrie (Hilbert, 1909; see also Hilbert, 1950), when all its axioms, and 
possible variations on them, had become clear. Yet, despite its starkly abstract 
view of points, lines and planes, the Grundlagen is still couched not in a formal 
language, but rather in (lightly mathematicized) idiomatic German. Hilbert’s 
Axiom of Parallels provides a good example: 

Let a be a line, and A a point not on a. Then, in the plane determined by a and A, 


there is at most one line which passes through A and does not meet a. (tr. from 
Hilbert, 1909, p. 20) 


No attempt is made to tease out the implicit logical syntax of this language, 
or to analyze the underlying inference engine much beyond what Euclid had 
already done in his Common Notions. This is perhaps clearest in the case of 
Hilbert’s final Axiom of Completeness: 

The elements (points, lines, planes) of the geometry form a system of objects 

which is not capable of any extension, subject to maintenance of all the preceding 

axioms. That is to say: it is not possible to add to the system of points, lines and 

planes another system of objects in such a way that, in the combined system, all 

[previous] axioms are satisfied. (Ibid., p. 22.) 


It was not until after the development of the apparatus of formal logic and model- 
theoretic semantics in the first half of the Twentieth Century that logicians were 
able to probe the precise inferential and expressive resources of geometry, in 
a second round of formalization culminating in Tarski’s Elementary Geometry 
(Tarski, 1959). 

Tarski’s decisive contribution in his 1959 paper was not simply to force 
Hilbert’s axioms into the regimented syntax of some formal language, but 
rather, to investigate what happens when that syntax is restricted. Specifi- 
cally, Tarski employs a first-order logic, with variables ranging over points in 
the Euclidean plane, and with non-logical predicates standing for two primitive 
spatial relations: a ternary relation of ‘betweenness’ and a quaternary relation 
of ‘equidistance’. The resulting language is sufficiently expressive to formu- 
late much of Euclidean geometry—for example, Pythagoras’ theorem, or the 
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existence of the nine-point Feuerbach circle. The computational reward for 
this loss of expressive power is considerable. Tarski showed that the theory of 
elementary geometry is decidable: there is a mechanical procedure to deter- 
mine, of any given sentence in the relevant language, whether that sentence is 
true under the advertised interpretation. By contrast, the second-order theory 
needed to express all of Hilbert’s axioms is undecidable. 

Tarski’s discovery illustrates the most distinctive feature of logic in the wake 
of the model-theoretic revolution of the previous century: its fundamentally lin- 
guistic orientation. The model-theoretic approach to logic takes as its central 
concern the often intricate relationship between mathematical structures and 
languages which describe them. On this view, spatial logic, as defined above, 
becomes the study of the relationship between geometrical structures and the 
spatial languages which describe them. It is this preoccupation with language 
which divides spatial logic from geometry as traditionally conceived. More 
recently, of course, the enterprise of automating logical deduction using elec- 
tronic computers has necessitated new levels of precision and sophistication 
in reasoning about the properties of formal languages and their relationship to 
their subject matter. In this setting, the issue of balancing the expressive power 
of a language against the computational complexity of performing deductions 
within it occupies centre-stage. 

We can broaden our perspective by considering two further examples of spa- 
tial logics in addition to Tarski’s Elementary Geometry. To motivate our second 
example, recall that, in Elementary Geometry, all variables are taken to range 
over points in the Euclidean plane. This allows for quantification over geometri- 
cal figures defined by a fixed number of points, such as line segments, triangles, 
circles, and so on, but not over spatial constellations defined by point-sets of 
arbitrary finite size, such as polygons, let alone those defined by infinite sets of 
points, such as, for example, arbitrary connected regions. The question there- 
fore arises as to what happens when these restrictions are lifted. In fact, Tarski 
himself had already investigated such a language in his Geometry of Solids 
(Tarski, 1956). This system employs the syntax of second-order logic, with the 
object variables ranging not over points, but instead over certain ‘regions’ in 
three-dimensional Euclidean space (hence, the set-variables range over sets of 
regions). The regions in question—Tarski called them solids—are the regular 
closed subsets of R3, namely, those subsets of R? equal to the closure of their 
interior. Tarski’s language features two non-logical predicates: one standing 
for the binary relation of parthood, the other for the unary property of being 
spherical. Again, Tarski establishes a remarkable fact about the relationship 
between the formal language and the structure it is interpreted over: the resulting 
theory can be axiomatized completely (in a second-order sense), and moreover 
is categorical: all models of this theory are isomorphic to the standard inter- 
pretation on the reals. This sort of axiomatization in very powerful logical 
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languages has found many successors, e.g., in qualitative axiomatizations of 
physics. 

For our third example of a spatial logic, we turn to topology. While Euclidean 
geometry is associated with rigid transformations like translations, rotations, 
and inversions, the mathematicians creating topology in the early decades of 
the 20th Century focused on much coarser transformations deforming 
shapes up to tearing and knotting. Subsequent to its invention, topology, too, 
became an object of logical study, and yet again, Tarski’s work proved seminal. 
Tarski observed that topology has small decidable fragments which could be 
brought to light by treating the topological interior operation as a modal oper- 
ator (McKinsey and Tarski, 1944). The connection to the other spatial logics 
discussed above becomes apparent if we subject McKinsey and Tarski’s original 
modal language to some essentially cosmetic reformulation. The variables of 
this language are taken to range over arbitrary subsets of any fixed topological 
space. These variables may be combined to form complex terms by means of 
function-symbols denoting various set-theoretic operations (union, intersection 
and complement), and topological operations (interior and closure); such terms 
denote subsets of the topological space over which they are interpreted. With 
terms constructed in this way, the language then features equality as its only 
predicate. Here we have extreme poverty of expressive resources: primitive 
function-symbols expressing only set-theoretic and topological operations, no 
non-logical predicates, and no quantifiers. But there is again a computational 
reward: the satisfiability problem for this logic is decidable in polynomial space. 
While too inexpressive to represent much of topology, this language has had 
profound repercussions in other areas, in particular in the universal algebra of 
Boolean algebras with added operators, and much contemporary modal logic. 
Furthermore, it has also been the inspiration for much recent work on topo- 
logical logics, many of them equipped with more elaborate syntax and richer 
topological primitives, as the reader of this book will soon discover. 

With these examples to guide us, let us return to the abstract characterization 
of spatial logic with which we began. Spatial logics arise by making a number 
of design choices, along three principal dimensions. The first concerns the col- 
lection of geometrical entities which make up our interpretations: points, lines, 
regions (of various kinds), and so on. In Tarski’s (plane) Elementary Geometry, 
variables range over the collection of points in the Euclidean plane; likewise, 
in his Geometry of Solids, variables range over the collection of regular closed 
subsets of R*; and in his modal topological language, variables range over 
the collection of all subsets of some topological space. The second principal 
dimension concerns the choice of primitive relations and operations over these 
entities to interpret the non-logical primitives of our language. This choice of 
primitives of course reflects the level of spatial structure the particular logic 
is concerned with—metric, affine, projective, or topological; but even within 
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these broad divisions, there is room for almost endless variation. The third prin- 
cipal dimension concerns the purely logical resources at our disposal. We have 
already seen that these can be set at many levels: from weak ‘constraint’ lan- 
guages through to richer first-order languages or even higher-order formalisms 
which include the resources of set theory. Needless to say, none of the choices 
along these principal dimensions is intrinsically right or wrong: they simply 
parametrize the family of available spatial logics. 

Classification of geometrical languages in terms of the range of the spa- 
tial primitives they feature of course recalls the long-standing classification of 
‘geometries’, broadly conceived, given by Klein’s Erlanger Programm (Klein, 
1893b; see also Klein, 1893a). And indeed the most sophisticated accounts of 
expressive power of such languages today are couched in terms of invariance 
relations between models (isomorphism, bisimulation, and the like), much in 
the same spirit. However, the logical approach opens up many new possibilities 
in this regard, such as, for instance, a new sort of invariance between topologi- 
cal patterns, much coarser even than topological homeomorphism, viz. modal 
bisimulation. This is topology taken to the extreme, but there are interesting 
interpretations in terms of model comparison games—a style of thinking which 
might have appealed to the founders of geometry, given Brouwer’s early use of 
games in defining the notion of topological dimension (Brouwer, 1913, p. 148). 

Once we have fixed a spatial logic, four salient issues present themselves. 
First, how can we characterize its valid formulas? Second, what is its expres- 
sive power? Third, what is its computational complexity? And fourth, what 
alternative interpretations does it have? We briefly consider each of these in 
turn. The first issue is so familiar as to require little explanation. Given a formal 
language interpreted over a certain class of geometrical structures, it typically 
makes sense to ask (depending on details of syntax) which sentences of that 
language are true in all structures of that class. Mostly, these characterizations 
are couched in the form of a list of axioms and (finitary) rules-of-inference. 
However, there are cases where additional machinery is required, for example, 
where the set of validities is not recursively enumerable, or where explicit proof 
systems are required to provide geometrical ‘constructions’ in Euclid’s sense. 

Second, we have already noted that current treatments of expressive power 
in logic are derived from the geometrical notions of invariance relations across 
models, setting the level of semantic resolution beyond which the given lan- 
guage cannot probe. Examples of such invariance relations are potential iso- 
morphism for first-order logic, or bisimulation for modal languages; but there 
are many more. Within given models, such relations specialize to notions of 
automorphism or internal bisimulation—a viewpoint which is actually some- 
what closer to the mathematician’s usual way of thinking about ‘symmetries’ 
of a spatial structure. Weyl] at one point observed that point tuples in Euclidean 
space which are related by an automorphism must satisfy the same geometrical 
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formulas, and raised the converse question of whether sharing the same prop- 
erties in some given logical language implies automorphism invariance (Weyl, 
1949, p. 73). Indeed, invariance is not just descriptive weakness, but also the 
source of information flow across situations! Logical model theory has a host 
of sophisticated results concerning invariance. In particular, invariance rela- 
tions can be fine-tuned in terms of games, such as Ehrenfeucht-Fraissé games 
matching first-order logic. Given a notion of invariance, the model theory of 
definability can start, and indeed, many results about expressive power of spatial 
languages can be found in the chapters to follow. 

Third, complexity-theoretic analyses of logical systems typically focus on 
two problems: model-checking (determining whether a given formula is true in 
a given interpretation) and satisfiability checking (determining whether a given 
formula is true in some interpretation or other). Model-checking has been little- 
explored in the context of spatial logics; satisfiability checking, by contrast, 
has received much more attention. Most first- (or higher-) order spatial logics 
interpreted over familiar spatial domains are undecidable; therefore, this issue is 
obviously of greatest interest when dealing with spatial logics with more limited 
expressive power. A striking example is provided by spatial logics interpreted 
over the regular closed sets of arbitrary topological spaces whose language 
involves just Boolean connectives (no quantifiers) and whose spatial primitives 
represent various topological relations and functions. The satisfiability problem 
for such logics is generally decidable, and its complexity has been determined 
for a range of cases. In this light, spatial logics actually do pose an interesting 
challenge which is not yet well-understood. The general methodology in logic 
design has been to find expressive yet decidable formalisms, cleverly steering 
a middle course between the opposing evils of expressive poverty and unde- 
cidability. However, methods of analysis which work with general models 
are often powerless when confronted with languages interpreted over specific 
structures, as is generally the case with spatial logics. Sometimes, the spatial 
models over which one is working themselves support decidability for rich 
languages—witness again Elementary Geometry, where it is the structure of 
Euclidean space that drives the quantifier elimination procedure establishing 
decidability. We are still far from understanding the precise balance between 
all these triggers of higher or lower complexity in spatial logics. 

Fourth, and most speculatively, we have the issue of alternative interpreta- 
tions. Tarski’s Geometry of Solids possesses, as we have seen, just one model 
up to isomorphism, but most spatial logics have many models. To some extent, 
this is just the expression of a familiar phenomenon in logic, and mathemat- 
ics generally. Some theories, such as group theory or the theory of affine 
spaces, are designed to have many models, and the more of these there are, the 
greater their range of applicability. Other theories were intended to describe 
one particular structure, such as the natural numbers, Euclidean space, or most 
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imperialistically of all, the set-theoretic universe. Geometry provided early 
examples of how theories originally conceived as characterizations of specific 
structures could turn out to have alternative models. This issue is brought to the 
fore in the subject of spatial logic, where the formal systems under investigation 
expressly invite the search for alternative interpretations and thus alternative 
ways of conceptualizing space. Even bolder views were ventured by Beth in 
the 1950s, who claimed that it was geometry’s move from one unique Space to 
a plurality of ‘spaces’ that underlay the system-based methodology of modern 
science and the fall of Aristotelian a priori dogmatism (Beth, 1959, Sec. 21). Be 
that as it may, the present editors agree that spatial logic can have philosophical 
repercussions beyond its narrower technical confines. 

More prosaically, much of the renewed interest in spatial logic in recent years 
has come from computer science. We identify three examples of this trend. The 
first comes from artificial intelligence, where attempts have recently been made 
to develop logics of qualitative spatial reasoning. The motivation is as follows: 
numerical co-ordinate descriptions of the objects which surround us are hard to 
acquire, inherently error-prone, and probably unnecessary for most everyday 
tasks we want to perform (or want a machine to perform); therefore—so goes 
the argument—reasoning with purely qualitative descriptions of those objects’ 
spatial configurations is closer to human reasoning and thus will lead to more 
efficient and effective AI. But which qualitative spatial terms, exactly, should 
we reason with? Ready-made tools from geometry or topology will not do: we 
have to devise new logics for ourselves. Many of these logics are discussed in 
this book. 

The second example comes from the theory of spatial databases. In com- 
puter applications, spatial data is frequently stored in the form of polygons (or 
polyhedra)—in effect, sets of points definable by Boolean combinations of lin- 
ear inequalities. These sets can be finitely represented, and their well-behaved 
character makes them particularly amenable to computer processing. But in 
fact there is no need to set our expressive sights so low; for polygons and 
polyhedra are a special case of the more general class of semi-algebraic sets, 
that is, those sets of points definable by Boolean combinations of polynomial 
inequalities. Within mathematics, semi-algebraic sets form the basis for real 
algebraic geometry; within computer science, they have given rise to the dis- 
cipline of constraint databases. In a constraint database, spatial data is stored 
in the form of first-order formulas in the language of fields. The key fact here 
is the quantifier-elimination theorem for the theory of the reals. This result 
allows constraint databases to be accessed effectively using queries which are 
likewise written as first-order formulas over an appropriate vocabulary. The 
relevant chapter in this book explores some of the intricate logical issues that 
arise from this approach to spatial data. 
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Our third example comes from image processing, where it is convenient to 
describe objects as sets of vectors that can be ‘added’ (taking all linear sums) 
or ‘subtracted’ (taking all linear differences). By variously combining these 
‘Minkowski operations’, certain useful processing tasks can be performed, as, 
for example, when one set of vectors, representing an ‘eraser’, is used to ‘clean 
up’ the boundary of another, representing a perceived object. Mathematical 
morphology is a theory of subsets of vector spaces with the two operations 
of addition and subtraction at its core; the properties of these operations are 
generalized in abstract algebraic and category-theoretic ways. Looking at space 
in this way brings to light a surprising amount of new structure. This theory was 
not developed within mathematical logic; but the relevant chapter in this book 
will show how logical patterns do arise, involving both modal and first-order 
languages, while the calculus of valid principles shows surprising analogies with 
logical systems proposed in recent decades for very different purposes, such as 
linear logics of computational resource management. Again, we see how new 
choices of spatial objects and spatial structures lead to new mathematics—and 
there is no reason to think that this creative process has yet run dry. 

Finally, let us remove a possible misunderstanding, again taking a cue from 
the history of geometry. Our presentation may have made it look as if there is 
a vast collection of different spatial logics, each a world unto itself in terms of 
objects, primitive relations, and logical strength. But one of the most striking 
discoveries in the foundations of geometry in the 19th Century, prominently 
displayed in Hilbert’s Grundlagen, was the fact that very different-looking 
theories can turn out to be related at a deeper level of analysis. Inspiring 
examples are the embeddings of non-Euclidean spaces into Euclidean ones 
given by Klein and Poincaré. Likewise, spatial logics show inter-connections 
which may be brought out by various means: semantic model transformations, 
direct linguistic translations, and so on. Even though little is known about 
the precise links between most known systems, we emphasize this point as a 
reassuring thought about the coherence of the field. 

This concludes the editors’ thoughts about the general setting for this book, 
while providing a way of positioning specific chapters. But of course, the real 
content is in the chapters themselves, which do much more than fit editorial 
preconceptions. Each tells a story about a particular approach to spatial logic. 
The chapters have been arranged in the following thread, though they can be 
read in other orders as well. 

We start in Tarski’s geometrical spirit, with first-order languages. In Ch. 2, 
Pratt-Hartmann considers first-order topological languages interpreted over 
low-dimensional Euclidean spaces, applying techniques from logical model 
theory to analyze expressive power and axiomatizability. In Ch. 3, Bennett 
and Diintsch study both first-order and weaker modal topological languages 
over a large class of topological spaces, emphasizing basic decidable 
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structures of wide use in AI and beyond. Renz and Nebel take this even further 
in Ch. 4, with syntactically highly restricted constraint languages for spatial 
structures, allowing for great computational efficiency. 

From fragments of first-order languages, there is a natural transition to modal 
logics for topology, continuing the tradition started by Tarski and others in the 
1930s. Ch. 5 by van Benthem and Bezhanishvili tells the story of modern 
modal approaches to topology (and a few other spatial structures), emphasizing 
the main axiomatic and semantic techniques developed in modern modal logic. 
This theme is then continued in Ch. 6 by Parikh, Moss and Steinsvold, who 
explore the other logical tradition of thinking about topology, viz. as an account 
of information structure. Next, Ch. 7 by Balbiani, Goranko, Kellerman and 
Vakarelov takes the modal viewpoint to the study of affine and metric geometry, 
moving up to first-order languages where needed. In particular, completeness 
theorems turn out to be related to the basic geometrical issue of coordinatization. 
Finally, Ch. 8 by Vickers takes the epistemic view of topology to the higher 
mathematical level of topos theory, merging spatial logic and epistemic logic 
with category theory and type theory. 

Just as in science generally, so too in spatial logic, space enters into natural 
combinations with other fundamental notions. One obvious case is the combi- 
nation of space and time, which is unavoidable in many practical computational 
settings, and of course, also, in the foundations of physics. Ch. 9 by Kontchakov, 
Kurucz, Wolter and Zakharyaschev studies temporal logics with added affine 
and metric modalities, using sophisticated techniques from current research on 
the complexity of combined modal logics. A special case of this type of com- 
bination is found in Ch. 10 by Kremer and Mints, who add a dynamic temporal 
operator of one-step system evolution to modal logics of topology, and show 
that this simple move provides significant results like the Poincaré recurrence 
theorem. Finally, Ch. 11 by Andréka, Madarász and Németi goes far beyond 
simple modal languages of space-time, and develops both the special and the 
general theory of relativity on a first-order basis, continuing Tarski’s program 
for geometry to obtain striking new foundational results which are at the same 
time conceptually enlightening. 

The next group of chapters represent a counterpoint to the ‘logical’ inves- 
tigations so far, reporting further mathematical and computational advances. 
Ch. 12 by Smyth and Webster explores the extent to which topological ideas 
can be developed in discrete spaces, moving closer to the discrete topolo- 
gies used in modern mathematics, pattern recognition, and image processing. 
Ch. 13 by Geerts and Kuijpers describes the use of algebraic constraints for 
spatial databases to describe regions in Euclidean space, reminding us of the 
great tradition of analytic geometry which also underlies the coordinatizations 
employed by Tarski, and by several authors in our book. Ch. 14 by Bloch, 
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Heijmans and Ronse develops the theory of mathematical morphology, both 
on concrete vector spaces and in algebraic abstraction, and introducing, at the 
end, logical formalisms based on them. 

Beyond these technical subjects, our book still has a coda. We have indi- 
cated already that spatial logic also has a broader conceptual aspect. Ch. 15 
by Varzi is an extensive discussion of spatial structure in the philosophical tra- 
dition, both ancient and modern, using logical tools to develop philosophical 
conceptions. 

Despite the wealth of topics in our fifteen chapters, this book also set itself 
definite limits. First, we have not even exhausted the mathematical connec- 
tion, witness the long-standing historical interest in ‘diagrammatic reasoning’ 
spawned by Euclid’s Elements, and reinforced by modern research on graphi- 
cal representation of information and associated styles of inference. There are 
deep issues here about the connection between symbolic and visual paradigms, 
bypassed in our cheerfully technical account of ‘spatial logics’. We acknowl- 
edge them; but they are beyond the scope of this book. Likewise, many further 
varieties of spatial representation and spatial reasoning occur in disciplines like 
linguistics and psychology, and many more patterns await formal logical study. 
In addition, cognitive neuro-science tells us about the often surprising inter- 
play between visual, diagrammatic, and more symbolically oriented parts of 
the brain in any reasoning task. Again, we think this is a fascinating theme, and 
we trust that many interesting interactions with the spatial logics of this book 
will one day come to light. But we have chosen the current set of chapters for 
their coherence in topic and methodology, and frankly also, their mathematical 
quality. We see the broader area of spatial reasoning; we recognize its relevance 
to the contents of this book; and exclusion does not imply disrespect. Broader 
texts on spatial reasoning should, and no doubt will, appear. But, in putting 
together this tighter book, the editors have stuck to what they see as the basic 
axiom of ‘social geometry’: Always leave room for others. 
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FIRST-ORDER MEREOTOPOLOGY 


Ian Pratt-Hartmann 
University of Manchester 


Second Reader 


Ivo Duntsch 
Brock University 


1. Introduction 


One of the many achievements of coordinate geometry has been to provide a 
conceptually elegant and unifying account of the nature of geometrical entities. 
According to this account, the one primitive spatial entity is the point, and 
the one primitive geometrical property of points is coordinate position. All 
other geometrical entities—lines, curves, surfaces and bodies—are nothing but 
collections of points; and all properties and relations involving these entities 
may be defined in terms of the relative positions of the points which make them 
up. The success and power of this reduction is so great that the identification of 
spatial regions with the sets of points they contain has come to seem virtually 
axiomatic. 

Over the years, however, various authors have expressed disquiet with this 
conceptual régime. The primary source of the disquiet is the conviction that 
our theory of space should use only those resources absolutely necessary to 
systematize the data of spatial experience. For points are such remote abstrac- 
tions from the objects with which we daily interact, and coordinate position 
such a distant relative of the spatial properties and relations which we directly 
perceive, that the question arises as to whether alternative mathematical models 
of space are not possible—in particular, models in which the primitive spatial 
entities are not points, but regions, and in which the primitive spatial properties 
and relations are qualitative rather than quantitative. 
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An example will help to make these worries more concrete. Consider any 
stable, medium-sized physical object, for example, a coffee cup. We all agree 
that this cup has a particular shape, which we may take to correspond to the 
region of space which it occupies at some instant. On the familiar point-based 
model of space, this region is a set of points. But suppose we now ask: is this 
set topologically open, semi-open or closed? That is: does it include none, 
some, or all of its boundary points? It is hard to see how we could answer 
this question. Not by microscopic analysis, since physical objects lose their 
definition on very small scales. And not by mathematical argument, since a 
world in which—say—cups are closed and saucers open is surely as logically 
possible as one where these topological characteristics are reversed. But if 
space really is made up of points as (modern) textbooks tell us, any assignment 
of a region of space to the coffee cup must answer the question. Perhaps then 
this model postulates too much. 

This chapter addresses the question: what region-based accounts of the topo- 
logical structure of space are possible? What can we say about them? How do 
they relate to each other and to the point-based models with which we are so 
familiar? 


2. Mereotopologies 


The purpose of this section is to outline the conceptual framework for region- 
based theories of space adopted in this chapter. Specifically, we introduce the 
concept of a mereotopology over a topological space, we discuss the role of 
mereotopologies as interpretations of signatures of topological primitives, and 
we list some key mathematical questions concerning them. 

We assume familiarity with fundamental concepts and standard facts of point- 
set topology and Boolean algebra: for details, see, e.g. Kelley, 1955 and Kop- 
pelberg, 1989, Ch. 1, respectively. In the context of point-set topology, if u is 
any subset of a topological space X, we denote the interior of u by u? and the 
closure of u by u`. (The more usual notations of w and [u] for the closure of 
u are reserved for other purposes.) We write F(u) to denote the frontier of u, 
namely u~ \ u. 


2.1 Regular open sets 


How might we go about building a region-based model of the space we 
inhabit? The example of the coffee cup suggests that any such model should 
resolve the issue of frontier points. The following technical details are well- 
suited to this purpose. 


DEFINITION 2.1 Let u be a subset of some topological space X. We say that 
u is regular open (in X) if u is equal to the interior of its closure. We denote 
the set of regular open subsets of X by RO(X). 
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To fix our intuitions, consider the space X = R?. The elements of RO(R?) 
are the open subsets of R? having no “cracks” or “pin-holes” (Fig. 2.1). Cor- 
responding remarks apply to the case X = R3. Taking regions of space to be 


aS A 
OSs oe 


Regular 





Non-regular 


Figure 2.1. Some regular and non-regular open sets of the Euclidean plane. 


regular open subsets of R3 finesses the issues encountered above concerning 
frontier points: regions are open by fiat. At the same time, however, it provides 
us with satisfying formal reconstructions of the intuitive notions of intersect- 
ing, merging and complementing regions, by means of the following standard 
theorem (see, for example, Koppelberg, 1989, pp. 25-27). 


PROPOSITION 2.2 Let X be a topological space. Then RO(X) is a Boolean 
algebra under the order ©. In this Boolean algebra, top and bottom are defined 
by 1 = X and 0 = Í, and Boolean operations are defined by x-y = £ N y, 
z +y =(xUy)~° and -z = X \ z7. 


Again, we can fix our intuitions regarding Proposition 2.2 by considering the 
case X = R?. The product, x.y, of two regular open sets x and y is simply 
their intersection, which is guaranteed to be a regular open set. The sum, x +y, 
of two regular open sets x and y is a little more complicated; very roughly, it 
is the union of x and y with any internal boundaries removed (Fig. 2.2). Finally, 
the complement, —z, of a regular open set x in RO(R72) is simply that part of 
the plane not occupied by x or its frontier. Corresponding remarks apply to the 
case X = R. 

It sometimes helps to reformulate the definition of regular open sets as fol- 
lows. If u C X, then U{o C X|o open, o N u = Ø} is the largest open subset 
of X disjoint from u. We call this set the pseudo-complement of u, denoted 
u*. From the above definitions, u* = X \ u~ and u** = (u~)°. Hence, u is 
regular open if and only if u = u**; and, if u is regular open, u* is simply —u. 
The following lemma shows that every subset of X is ‘close’ to a regular open 
subset. 


LEMMA 2.3 Let X be a topological space. For every u C X, the set r = 
(u7)? is an element of RO(X) such that u? C r C u™. If wis open, then r is 
unique. 
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Proof Obviously u? C r C u`. To show that (u~)° € RO(X), it suffices 
to show that u**** = u**. If v is any set at all, then v** N v* = Ø, whence 
v* C u***, Moreover, if o is any open set, then o*** is an open set disjoint from 
o** and hence disjoint from every open set disjoint from o* and hence disjoint 
from o itself, whence o*** C o*. Thus, for any open set o, o*** = o*. Since 
u* is open, we have u**** = u™. For the final statement, if s € RO(X) also 
satisfies u C s C w`, then the (regular) open sets s - —r and r - —s are both in 


u” \ wand so are empty. QED 
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Figure 2.2. Three pairs of regions and (below) their sums in RO(R?). 


For the above reasons, it has become common practice in discussions of 
region-based theories of space to model regions of space as regular open subsets 
of RÌ; and that is the approach we shall take here. In the sequel, we shall always 
use the letters r, s, t to range over regular open sets; when we are concerned only 
with regular open sets, we write r < s in preference to r C s, 0 in preference 
to J and r - s in preference to r N s. Resorting to regular open sets is of course 
not the only way of dealing with boundary disputes. One obvious alternative 
is to use regular closed sets (sets equal to the closures of their interiors), since 
the regular closed sets of any topological space also form a Boolean algebra, 
which is in fact isomorphic to the Boolean algebra of regular open sets. Thus, 
in modelling regions as regular open sets of R, it is understood that it is the 
resulting structure that is important, not the precise constitution of its elements. 
Understanding what this idea means in detail forms a central theme of this 
chapter. 

We conclude our discussion of regular open sets by proving some technical 
results which will be useful below. Recall in this context that, if u, v are 
connected subsets of a topological space, with uNv Æ 0), then uUv is connected. 
Moreover, if u is connected and u C v C u7, then v is connected. 


LEMMA 2.4 Let X bea topological space, letu,v C X andletr,s € RO(X). 
We have: 
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i) (u U v)~° =u ty"; 
w)rUsCr+sCruUsU(r Ns) C(rus); 


Myre) Hr Us = (rUs); 


(iv) ifr and s are connected with r - s > 0, then r + s is connected. 


Proof (i) By Lemma 2.3, (u U v)~° is a regular open set which evidently 


contains the regular open sets u~° and v~°. Certainly, then eae Cc 


(uUv)~°. For the reverse inclusion, (X \ u)? N (X \ v)? N (uU v~’ = 


Ø, whence (X \ u)?’ N (X \ v)? A (uUv)~° = 0, whence (X \ u)?” 
(X \ v)? n(u Uv)~° = 0, whence ((X \ WPT A(X \ v)°~)A(wUv)-° = 
Ø, whence ((X \ u)® A(X \v)9°)® A(wUv)~° =O. Thatis: (wUv)~° 
(u? U B But by Proposition 2.2, (u~° U v7’) tSu Cag, 


(= 
=) 


IM 





(ii) The only non-trivial inclusion is r + s C rU s U (r7 N s7). So suppose 
p Z sandp ¢ r`. Thatis, p € (—s)~ and p € —r. But then, for all open o with 
p € 0, oN —r is also open with p € o N —r, whence (oN —r) N —s # Ø—that 
is, oN (—r - —s) # Ú. Hence p € (-r-—s)” so p ¢ —(—r -—s) =r +s. A 
similar argument applies if p ¢ r and p ¢ s7. 


Gii) (r+s)— = X\-(r+s) = X\(—r--s) = (X\-r)U(X\-s) = rT Us. 





(iv) Certainly, r U s is connected, and by (ii), rUs C r+s C (rUs)~, whence 
r + s is connected. QED 


We note in passing that determining the validity of statements such as those 
of Lemma 2.4 is actually a decidable problem. See, e.g. Cantone and Cutello, 
1994, Nutt, 1999, Pratt-Hartmann, 2002 and, for a fuller discussion, Ch. 9. 


2.2 Mereotopologies 


We have argued, provisionally, that, for a subset of R to count as a region, 
it should be regular open. But it would be hasty to assume that all regular open 
subsets of R? should count as regions, at least if regions are supposed to be 
parts of space occupied (or left unoccupied) by physical objects, since RO(R*) 
contains some pathological members. This presents us with the question: if 
not all regular open subsets of R? qualify as bona fide regions, which do? As 
we shall see, the answers available and the issues which hinge on them require 
detailed analysis. 

In view of these uncertainties, we adopt a very general notion of a region- 
based model of space—just sufficiently constrained that we can sensibly confine 
attention to the structure of regions in question without worrying about the points 
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of which they are composed. In the context of point-set topology, a topological 
space is commonly said to be semi-regular if it has a basis of regular open sets, 
and locally connected if it has a basis of connected sets. It easy to see that, ina 
locally connected space, every component of an open set is open. Recall also, 
in the context of Boolean algebras, that, if B is a Boolean algebra and B’ a 
Boolean sub-algebra of B, then B’ is said to be dense (in B) if, for every b € B 
with 0 < b, there exists b’ € B’ with 0 < b' < b. 


DEFINITION 2.5 Let X be a topological space. A mereotopology over X is 
a Boolean sub-algebra M of RO(X) such that, if o is an open subset of X and 
p € o, there exists r E€ M such that p € r C o. We refer to the elements of M 
as regions. If M is a mereotopology such that any component of a region in M 
is also a region in M, then we say that M respects components. 


Note that a mereotopology over X is always a dense sub-algebra of RO(X). 
Our first task is to check that RO(X) is a mereotopology, for a suitable class 
of topological spaces. 


LEMMA 2.6 Let X be a semi-regular space. Then RO(X) is amereotopology 
over R”; if X is also locally connected, then RO(X) respects components. 


Proof The first part of the lemma is instant from the relevant definitions. For 
the second part, let r € RO(X), and let s be a component of r. Since X is 
locally connected, s is open, whence, by Lemma 2.3, (s7)? is regular open with 
sc (s7)? C s~. Then, s~’ is a connected subset of r including s, whence 
s =s by the maximality of s. QED 
Some etymological explanation is in order here. The term mereology was first 
introduced by Leśniewski, and denotes the logic of the part-whole relationship. 
(For a survey, see, e.g. Simons, 1987.) The term mereotopology is a much more 
recent coinage, and standardly denotes the study of topological relationships in 
which regions, rather than points, are the primitive objects. (It is unclear where 
the word first appeared in print.) The employment of the word as a count-noun 
in Definition 2.5, to denote a certain class of mathematical structures, is new 
here, and prompted by analogy with the parallel usage of the word topology. 
The foregoing discussion suggests that our search for a region-based model 
of space should begin with an examination of mereotopologies over R3. This 
approach may at first seem dissatisfying, because it depends for its formulation 
on the very point-based model of space we are trying to escape. As we shall see, 
however, it is the structure of the resulting collection of regions that will interest 
us—and the characterization of that structure in purely intrinsic terms forms 
one of the main themes of this chapter. But before we can seek such intrinsic 
characterizations, we must first clarify what it is we want to characterize. 
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2.3 Geometric mereotopologies 


The question before us is to identify the regular open subsets of R? which we 
are prepared to count as “sensible” regions of space. Here is a standard answer 
from the mathematical literature. Let L’ be the first-order language with the 
arithmetic signature (<,+,-,0,1), interpreted over R in the usual way. (This 
interpretation is of course completely separate from our use of the same symbols 
to denote Boolean operations on regular open sets!) For the purposes of this 
chapter, we may say that a set u C R” is semi-algebraic if there exists an 
L'-formula ¢(Z, y) in n + m variables z, y and an m-tuple of real numbers b 
such that 


u = {a € R"| the (n + m)-tuple a, b satisfies the formula ¢(z, y)}. 


For a detailed discussion of semi-algebraic sets, see, e.g. van den Dries, 1998, 

Bochnak et al., 1998 and also Ch. 13. (The more standard definition of semi- 

algebraic sets is equivalent to ours, and makes the name less puzzling.) For 

mereotopological purposes, we are exclusively interested in those semi-algebraic 
subsets of IR” which are regular open. 


DEFINITION 2.7 Forn > 0, we denote the set of regular open, semi-algebraic 
sets in R” by ROS(R"). 


LEMMA 2.8 Forn > 0, ROS(R”) is a mereotopology over R”. 


Proof We first show that ROS(R”) is a Boolean sub-algebra of RO(R”). 
Evidently, 0,1 € ROS(R”). Moreover, if a set u is definable by a first-order 
formula in the language of arithmetic, then so are its closure u~ and its interior 
ul. Hence, if r,s € ROS(R”), then so are r -s = rN s, r +s = (r U s)7’ 
and —r = R” \ r7. We must establish that, for p € o with o C R” open, there 
exists r € ROS(R”) such that p € r C o. But this is obvious since any open 
ball is an element of ROS(R”). QED 


The structure of regular open semi-algebraic subsets of R? might have a better 
claim to count as a region-based model of space than the whole of RO(R?), 
because it does a good job of ruling out various pathological regular open sets. 

More generally, semi-algebraic sets count as well-behaved. One of their 
fundamental properties is that they admit of “cell decompositions”. If d > 0, 
a d-cell in R” is any semi-algebraic subset of R” homeomorphic to the open 
d-dimensional ball; a 0-cell in R” is a singleton; and a cell is a d-cell for some 
d(0 < d < n). The following result is standard (van den Dries, 1998, Ch. 3, 
Theorem 2.11). 





PROPOSITION 2.9 (CELL DECOMPOSITION THEOREM) Jf u is a semi- 
algebraic subset of R”, then u is the union of a finite collection of pairwise 
disjoint, semi-algebraic cells. 
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For regular open semi-algebraic sets, this yields: 


LEMMA 2.10 Everyr € ROS(R") is the sum of finitely many pairwise disjoint 
n-cells in ROS(R”). 


Proof By Proposition 2.9, let r = u1 U- - -U Um where the u; are pairwise dis- 
ee : : : —0 —0 

joint, semi-algebraic cells. Since r is regular, r = r™` = (uy U<- U Um) = 
uj” + + us", by Lemma 2.4 (i). If u; is a d-cell for d < n, then u7’ = 0; 
if u; is an n-cell, uy” = Uj. QED 


The following notion will play an important part in the ensuing discussion. 


DEFINITION 2.11 A mereotopology M is finitely decomposable if every 
region in M is the sum of finitely many connected regions in M. 


LEMMA 2.12 ROS(R”) is finitely decomposable. 


Proof By Lemma 2.10, since cells are connected. QED 


LEMMA 2.13 Every finitely decomposable mereotopology M over a locally 
connected space X respects components; moreover, every region in M is the 
sum of its components. 


Proof Suppose r € M, and s is a component of r. By Lemma 2.6, s € 
RO(X). Letri,..., rn be connected elements of M such that r = r1 +- --+rn. 
By the maximality of s and Lemma 2.4 (iv), either r; < s or r;- s = 0 for all i 
(1 < i < n). Thus, s is the sum of those r; such that r; < s. QED 


Of course, the converse of Lemma 2.13 is false: although RO(X) respects 
components for any locally connected space X, itis easy to see that, forexample, 
RO(R") is not finitely decomposable for any n > 0. 

The mereotopology ROS(IR”) is thus at least a plausible region-based model 
of the space we inhabit. But it is not the only candidate for this job. Observe that 
any (n — 1)-dimensional hyperplane of R” cuts R” into two residual domains, 
which we shall call half-spaces. It is easy to see that these half-spaces are 
regular open, with each being the pseudo-complement of the other. Hence, we 
can speak about the sums, products and complements of half-spaces in RO(R”). 


DEFINITION 2.14 A basic polytope in R” is the product, in RO(R”), of finitely 
many half-spaces. A polytope in R” is the sum, in RO(R"), of any finite set of 
basic polytopes. We denote the set of polytopes in R” by ROP(R"”); we call 
the polytopes in ROP(R?) polygons and those in ROP (R?) polyhedra. 


Thus, polytopes (in our sense) may be unbounded, disconnected, and may 
have disconnected complements. Fig. 2.3 shows a selection of polygons. (In 
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Figure 2.3. Three (differently shaded) regions in the mereotopology ROP (R°). 


alternative parlance, the elements of ROP(R”) are the regular open semi-linear 
sets.) Evidently, the polyhedra constitute a more parsimonious region-based 
model of space than does ROS(R?). 

Indeed, the following construction gives us a more parsimonious spatial 
ontology still. If an (n — 1)-dimensional hyperplane in R” is defined by an 
equation ag + 4,21 +--+ + n£n = 0, where the a; (0 < i < n), are rational 
numbers, we call it a rational hyperplane; and if a half-space is bounded by a 
rational hyperplane, we call it a rational half-space. Now we define: 


DEFINITION 2.15 A basic rational polytope in R” is the product, in RO(R"), 
of finitely many rational half-spaces. A rational polytope in R” is the sum, 
in RO(R"”), of any finite set of basic rational polytopes. We denote the set 
of rational polytopes in R” by ROQ(R”); we call the elements of ROQ(R?) 
rational polygons and those of ROQ(R?) rational polyhedra. 


Evidently, ROQ(R”) € ROP(R”) Ç ROS(R") ¢ RO(R”). Note that 
ROQ(R”) is countable. 


LEMMA 2.16 The collections ROP(R”) and ROQ(R") are finitely decom- 
posable mereotopologies over R”. 


Proof Basic polytopes are convex, and hence connected. QED 


As models of the space we inhabit, ROP(R*) and ROQ(R?) may seem overly 
austere—for they contain no regions with curved boundaries. However, their 
study turns out to be instructive, as we shall see below. 


2.4 Interpretations 


So far, we have discussed various ways of selecting a collection of “regions” 
from among the subsets of R”. But this selection process only really becomes 
interesting when we consider formal languages whose variables range over these 
collections, and whose non-logical constants belong to a limited repertoire of 
spatial primitives. 

We assume familiarity with basic first-order logic: for details, see Hodges, 
1993, Ch 1. In this context, we employ the following standard notation and 
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terminology. Let be a signature consisting of (zero or more) predicates, 
function-symbols and individual constants; we denote the first-order language 
with signature X by Ly. An Ly-formula with no free variables is called an 
Ly-sentence. Let X be a structure interpreting the symbols in & over some 
domain A (assumed non-empty). For any Ly-formula $(Z), with n > 0 free- 
variables z and any n-tuple a from A, we write X ¢[a] if a satisfies ¢(z) 
in 2; similarly, for any Dy-sentence ¢, we write 2 | ¢ if ¢ is true in U. We 
call {w | Y an Ly-sentence and 2 |= y} the Ly-theory of 4, denoted Ths (2). 
Two structures 2 and B are elementarily equivalent (for £), written A =y B, if 
Thy (XA) = Thy(B). We write f : XA ~s B if f is a U-structure isomorphism 
from 2 onto B (and A ~y B if such an f exists). It is a simple result that if 
f : A ~y B and $(Z) is an Ly-sentence, then A = ¢[a] implies B = | f(a)] 
for every tuple a from A; in particular, 20 ~s B implies A =» B. We write 
A Cy B, if Wis a submodel of B (i.e. A C Band is the restriction of B to A), 
and A <s B if XA is an elementary submodel of $ (i.e. A C B and and every 
tuple a of A satisfies the same Ly-formulas in both 2 and 8). We say that % is 
elementarily embeddable in $ if 21 is isomorphic to an elementary submodel 
of B. Trivially, A <s B implies A =» %. Reference to the signature X, and 
the associated subscripts, is suppressed when clear from context. 

Let M be a mereotopology over some topological space X. If X is a signa- 
ture whose symbols conventionally denote familiar mereological or topological 
concepts, then M can always be regarded as a )-structure by interpreting the 
symbols of © in the familiar way. In particular, we take the symbols 0, 1, +, -, — 
and < to have the obvious (Boolean algebra) interpretations over M; similarly, 
we take the unary predicate c to denote the property of being connected, and 
the binary predicate C to denote the relation which holds between two regions 
if and only if their topological closures intersect. Table 2.1 gives a formal 
summary. Under these interpretations, we may regard any mereotopology M 
as an interpretation for the signature © = (0,1,+,-,—,<,c,C), or any subset 
thereof. That is: any Ly-sentence has a truth-value in M, and any Ly-formula 
(x) with n > 0 free variables defines an n-ary relation over M, namely, the 
set of n-tuples from M satisfying ¢(Z). We remark that our interpretation of C 
is intended as a rational reconstruction of the relation which Whitehead, 1929 
called “extensive connection”, and which has historically played a prominent 
role in region-based theories of space. Since Whitehead’s term risks confusion 
with the standard topological notion of connectedness, we follow more recent 
usage and read C(x, y) as “x contacts y”. 

Some examples will help to clarify the issues that arise concerning first-order 
languages interpreted over mereotopologies. 





EXAMPLE 2.17 Let © = (C,c, <), and let Wing be the Ly-sentence 





VaVy(C(a2,y) > Jz(c(z) Az < yA C(a,z))). 
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Symbol | Type Interpretation 
0 individual constant | 0” = ĝ 
1 individual constant | 1% = X 
+ binary function +” (r,s) = ((rUs)~)° 
binary function Mirs)=rns 
— unary function —M(r) = X\r7 
< binary predicate <= {(r,s) € M? |r Cs} 
c unary predicate cM = {r € M |r connected} 
C binary predicate C™ = { (r,s) € M? | r7 Ns 49} 


Table 2.1. Interpretations of common mereotopological primitives, where M is a mereotopol- 
ogy over a topological space X. 














Figure 2.4. Two elements in RO(R?), one with infinitely many components. 


This sentence “says” that, if a region contacts another region, then it contacts 
some connected part of it. Let M be any finitely decomposable mereotopology; 
then M = wing. For suppose M = Cir, s], and let 81,..., Sm, be connected 
regions of M summing to s. By Lemma 2.4(iii), s~ = s] U+- U Sm, whence 
M H Clr, s;| for some i. On the other hand, it is not difficult to see that 
RO(R?) H Wing. Fig. 2.4 shows two regular open regions r,s in the plane, 
where r has infinitely many components, and s touches the closure of r but is 
separated from each of its components. 








Example 2.4 shows, in particular, that the differences between the region- 
based models of space RO(R?) and ROS(R?) are “visible” to certain first-order 
languages with signatures of topological primitives. In fact, the existence of 
regions with infinitely many components is not the only difference between 
these mereotopologies, as the next example shows. 


EXAMPLE 2.18 Let © = (c, +), and let sum be the Ly-sentence 
Va VxqVx3(ce(x1)Ac(@2)Ac(a3)Ac(a1+22+23) > (c(a1+22)Ve(x1+23))). 


This sentence “says” that if three connected regions have a connected sum, 
then the first must form a connected sum with one of the other two. We show 
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r3 
ri 





T2 


Figure 2.5. Three elements in RO(R?). 


in Lemma 2.56 below that, if M is any of ROS(IR?), ROP (R?) or ROQ(R?), 
then M € Wsum. However, it turns out that RO(R?) 4 sum. For let 








rı = {(z,y)|-—1<2<0; -l-x<y<1+z} 
rg = {(2,y)|0<a2<1; -1-2#<y<sin(1/z)} 
r3 = {(z,y)|0<2 <1; sin(1/z) <<y<1+2}, 


as depicted in Fig. 2.5. It is easy to check that rı +r +13 is the large triangle, 
and so is certainly connected, but that neither rı + rg nor rı + r3 is connected. 


We shall see in Sec. 5 that, in some sense, Examples 2.17 and 2.18 represent 
the only differences between RO(R?) and ROS(R?). 

Our final example illustrates a rather different set of issues concerning first- 
order mereotopological theories. We require the following fact about the topol- 
ogy of Euclidean spaces (Newman, 1964, p. 137). 


PROPOSITION 2.19 Ifd: and də are non-intersecting closed sets in R”, and 
points p and q are connected in R” \ dı and also in R” \ dg, then p and q are 
connected in IR” \ (di U də). 


EXAMPLE 2.20 Let X = (C,c,:,—), and let Ysep be the Ly-sentence 
Vavy(e(x) A ely) > (cla: y) V C(—#, -y))). 


This sentence “says” that the closures of the complements of any two connected 
regions whose product is not connected intersect. Suppose that r,s € RO(R") 
are connected, with r- s not connected. Putting dı = R” \r and dz = R” \ s, we 
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= 


Figure 2.6. The (complements of) two connected elements r and s in the regular open algebra 
of a torus: r - s is not connected, and (—r)~ N(—s)~ = 0. 


have dı U d2 = R” \ (r - s), whence, by Proposition 2.19, (—r)~ N(—s)~ 4 0. 
Thus, if M is a mereotopology over any of the spaces R”, M = Wsep. However, 
Wsep İs not true for all mereotopologies. For example, let X be a torus, let M 
be RO(X), and let r,s E€ M be such that —r and —s are as illustrated in 
Fig. 2.6. By inspection, r and s are connected, r - s is not connected, and —r 
does not contact —s. Hence, M |= “Weep: 





Thus the regular open algebras of the torus and the Euclidean plane have dif- 
ferent first-order mereotopological theories over the signature {C, c, -, —}. 

There is nothing privileged about the above collection of primitives: in prin- 
ciple, we could employ any signature whose symbols can be given fixed inter- 
pretations over the structures we choose to confine our attention to. Since this 
chapter deals with topological notions, we consider only signatures with fixed 
topological interpretations—that is, signatures whose interpretations are pre- 
served by homeomorphisms of the underlying topological space. For brevity, 
we speak of a “signature of topological primitives’. For investigations of region- 
based theories with non-topological signatures, see, e.g. Davis et al., 1999, 
Pratt, 1999. 

Given a mereotopology M and a signature © of topological primitives, three 
salient issues present themselves. The first concerns the expressive power of 
the first-order topological language Ly over the mereotopology M. Any Ly- 
formula ¢(Z) with free variables = x1, . . . , £n defines an n-ary relation over 
M—namely, the set of n-tuples 7 satisfying ¢(Z) in M. And it is therefore nat- 
ural to ask which relations can be so defined, and in particular, which primitives 
can be defined in terms of which others. Of particular interest in this regard 
is the property of being topologically indistinguishable from a specific object 
or tuple of objects. That is, given a tuple 7 from M, we would particularly 
like to know whether Ly is expressive enough to give a topologically complete 
characterization of 7. The answers to these questions depend heavily on the 
mereotopology M: Sections 3 and 4 analyse the expressive power of various 
first-order topological languages for well-behaved mereotopologies over the 
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Euclidean plane. Sec. 6 analyses the much more difficult case of well-behaved 
mereotopologies over R. 

The second salient issue concerns the Ly-theory of M. Examples 2.17 
and 2.18 show that restricting regions to be semi-algebraic (regular open) sets 
does affect the resulting first-order theory over some signatures of topological 
primitives. And the question therefore arises as to what other restrictions might 
be sensible, and what effect, if any, these restrictions have on the resulting 
first-order mereotopological theories. Most ambitiously, perhaps, we might 
ask whether the set of first-order sentences true in various mereotopologies 
can be axiomatically characterized. Sec. 5 provides an example of such an 
axiomatic characterization. As a by-product of this analysis, we show that 
a wide range of plane mereotopologies share the same Ly-theory for (most) 
topological signatures X, and we venture to take that theory as the standard 
first-order Ly-theory of plane mereotopology. In this sense, the choice of 
what, exactly, counts as a region is much less critical than we might at first have 
supposed. 

The third salient issue concerns the ontological commitments entailed by 
first-order mereotopological theories. To understand this issue, recall that a 
mereotopology M is a collection of subsets of some topological space, which 
we have chosen to regard as a ¥-structure, for some signature © of topological 
primitives. Any such mereotopology M thus defines an Ly-theory Ths (M). 
But of course, any © structure 2 with Thy(21) = Ths(M) can be thought 
of as a (region-based) model of space which, from the point of view of Ly, 
makes exactly the same predictions as M. It is therefore natural to ask which 
structures these are, and what, if anything, we can say about their relation- 
ship to M. Notice that the elements of such ¥-structures need not be regions 
of topological spaces at all; as such they are genuinely region-based theories 
of space. In particular, we may ask whether mereotopologies in general admit of 
intrinsic characterizations making no reference to the topological spaces whose 
regions they make up. And we may further ask—particularly in the light of 
Example 2.20—what information those intrinsic characterizations yield about 
the topological spaces in question. Sec. 7 answers these, and related, questions. 

The above three issues constitute the primary agenda of mereotopology, as 
conceived here. 


3: Defining topological relations 


Our task in this section is to compare the relative expressiveness of first-order 
languages having different signatures of topological primitives. Our main result 
is that Lc is at least as expressive as Le,< over all sensible mereotopologies. 
We also show that over some mereotopologies of interest, Le,< is also at least 
as expressive as Lo. 
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We assume familiarity with the standard (T;-) separation properties of topo- 
logical spaces. Terminology varies here: we adopt the convention according 
to which T;-separation for 2 > 2 does not by definition imply T,-separation; 
and we say that a space X is Hausdorff if it satisfies T2-separation, regular if 
it satisfies both T3- and T,-separation, and normal if it satisfies both T4- and 
T,-separation.) In addition, we occasionally employ the following less familiar 
separation property (Diintsch and Winter, 2005). 


DEFINITION 2.21 A topological space is weakly regular if it is semi-regular 
and, for any non-empty open set u, there exists a non-empty open set v with 
v Cu 
We have 


X is normal > X is regular => X is weakly regular > X is semi-regular. 


The reverse implications all fail (see Diintsch and Winter, 2005 regarding weak 
regularity, and Steen and Seebach, 1995 for the other cases). 


3.1 Contact 
We begin by defining the part-of relation in Lc. 


LEMMA 2.22 Let M be amereotopology over a weakly regular space X, and 
letrı,r2 € M. Then rı < r2 ifand only if M = ¢<|ri,r2], where d<(x1, £2) 
is the Lo-formula Yz(C (x1, z) => C(x, z)). 





Proof Ifrı <rgthenr; Cryz,sos Mr; # 0 implies s7 Mrz 4 O for any 
s. Conversely, if rı £ r2, by weak regularity, let u be a non-empty, open set 
such that u` C rı - (—r2). Since M is a mereotopology, let s € M be such 
that 0 As Cu. Thens Mr; £9, buts" Ory = 0. QED 


In dealing with mereotopologies over weakly regular spaces, we may there- 
fore write the expression u < v in Lc-formulas, as a shorthand for d<(u, v). It 
follows that the Boolean constants and functions 0, 1, + , - and — are also Lc- 
definable for mereotopologies over weakly regular spaces, and we again freely 
employ these symbols in L¢-formulas as a shorthand for their definitions. 

We now turn to defining the property of connectedness in Lc. We need some 
technical lemmas. 


LEMMA 2.23 Let M be amereotopology over a regular topological space X. 
Ifd C X is closed and p £ d, there exists r € M such that p € r andd C —r. 
In fact, there exist r,s € M such that p € r, d C s and r7 N s7 = Ô. 


Proof For the first statement, by T3-separation, let u, v be disjoint open subsets 
of X such that p € u and d C v. Since M is a mereotopology, there exists 
r € M such that p € r C u, whence d C v C X \r~ = —r. The second 
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statement follows by two applications of the first: choose s € M such that 
p € —s and d C s; now choose r € M such that p € r and s~ C —r. QED 


LEMMA 2.24 Let r,s € RO(X) for some topological space X. If p € r7 
and p € s, then p € (r-s)~. 


Proof Let u be any open set containing p. Then u N s is also an open set 
containing p, whence (u N s) N r 4 Q, since p € r~. That is, u N (s - r) Æ Í. 
QED 


LEMMA 2.25 Let M be a mereotopology over a regular topological space. 
Forallrı, r2 € M, ry Ary N(rı+r2) # 0 ifand only if there exist r,r, € M 
such that rr’, < rı, ri Sro ri Arh” 4 Wand (r+r)7~9(-(ritra))~ = 4. 


Proof The if-direction is immediate. For the only-if-direction, suppose p € 
ri N r3 N (rı + r2). By Lemma 2.23, let s € M be such that p € s and 
(—(r1 + r2)) © —s; and let r} = rı - s and rh = rg: s. By Lemma 2.24, 
per, Mrs , whence ri and rf, have the required properties. QED 


LEMMA 2.26 Let M be a mereotopology which respects components. Then 
r € M is connected if and only ifr, Ary Ar + O for all non-empty, disjoint 
rı, ro E M such thatr, + r2 =r. 


Proof Suppose rı and rz are non-empty, disjoint elements of M such that 
rı + r2 = r and ri Arz Nr = Í. By Lemma 2.4 (ii), r = rı Ura, so 
that r is not connected. Conversely, suppose r is not connected. Let rı be 
a component of r and let ro = r \ rı. Since M respects components, rı € 
M. Since rı C r1 U (rI Q r2) C ry, rı U (rI N r2) is connected, whence 
ri N r2 = Ú by maximality of components. Thus, ro = r \ rI =r- (—11). 
Moreover, since rı is open and rı N r2 = 9, we have rı N To = Ø. Therefore 
Ø = ri Arg N (rı Ure) =r] Ary Nr as required. QED 


LEMMA 2.27 Let M be a mereotopology over a regular topological space X 
such that M respects components, and let r € M. Then r is connected if and 
only if M = ¢¢|r], where ġe(x) is the Lo-formula 





VzıYzro(xı >O0At%2 > 0AT: £2 = 0AT +t = £ —> 
xy axh(xy < z1 Ary < £2 A C(21,29) AnCz] + xh, —x))). 
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Proof Lemmas 2.25 and 2.26. QED 


Together, Lemmas 2.22 and 2.27 guarantee that, for all mereotopologies over 
regular topological spaces which respect components, the language Lç is at 
least as expressive as Le,<. We take it that all mereotopologies of interest fulfil 
these conditions: that is, the above reconstructions of the part-whole relation 
and the property of connectedness in Lç are very robust. 

We present a further—and more surprising—demonstration of the expres- 
sive power of Lc in mereotopologies defined over R?. We require the follow- 
ing fact about the topology of Euclidean spaces (Newman, 1964, p. 112, c.f. 
Proposition 2.19). 


PROPOSITION 2.28 Let dı and də be closed sets in R?, at least one of which 
is bounded. If R? \ dı, R? \ dz and dı N dz are all connected, then so is 
R? \ (dy U də). 


LEMMA 2.29 Let s1, s2,t € RO(IR?) such that: (i) either sı is bounded or 
s2 is bounded; (ii) —(s1 + t), —(s2 + t) and t are all connected; and (iii) 
8, N s3 =. Then — (sı + s2 + t) is also connected. 





Proof Setd; = (s;+t)~ (fori = 1,2). Thus, the complement of d; is —(s;+t) 
(for i = 1,2), and the complement of dı U dz is — (s1 + s2 + t). Moreover, 
since t is connected, so is t~, whence dı N dz = (s1 + t) A (s2 +t) = 
(s; Ut~) A (sz Ut) = (s1 Nsy)Ut” = t is connected. The result follows 
by Proposition 2.28. QED 





Let œc be as defined in Lemma 2.27, and let dup (yi, y2) be the Lc-formula 





z(be(—(y + 2)) A be(—(y2 + 2)) A delz) A npe(=(y1 + y2 + 2))). 


LEMMA 2.30 Let M be amereotopology over R? such that M respects compo- 
nents and every unbounded element in M includes regions s1, S2 and t situated 
as in Fig. 2.7. Then for allr € M, r is bounded if and only if M = ¢,2[r], 
where p2 (x) is the Lo-formula: 


VyiVyo(yr < £A y2 < £A bup(y1, Y2) > C (y1, y2))- 


(The superscript 2 in ¢,2 refers to the fact that this formula works for mereotopolo- 
gies over R?, and not, for example R3.) 
Proof If r does not satisfy @,2(a) then, by Lemma 2.29, r contains two 


unbounded regions, so is certainly itself unbounded. Conversely, if r is un- 
bounded, let s1, s2,t E€ M be subsets of r situated as in Fig. 2.7. Thus, sı < r, 
sg < r and s] Ms, = 0, but at the same time, s1, s2 satisfies dyp(y1, Y2), 
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[p> tae Sı 











S2 








Figure 2.7. Expressing boundedness in Lo: sı and sz are unbounded to the right. 


with ¢ a witness for the existentially quantified z. Hence r does not satisfy 


bp? (=). QED 


It is simple to verify that the mereotopologies RO(R7), ROS(R?), ROP(R?) 
and ROQ(R7) satisfy the conditions of Lemma 2.30. Hence, the property 
of boundedness is Lc-definable in all these mereotopologies. Nevertheless, 
Lemma 2.30, unlike Lemmas 2.22 and 2.27, has a fragile character, in that it 
depends on a very specific feature of the topological space R?; in particular, 
it fails to define boundedness for the corresponding mereotopologies over R3. 
We will see in Sec. 6 that boundedness is also Do-definable in well-behaved 
mereotopologies over R3, but we have to go to much more trouble. 


3.2 Reconstruction of points 


In mereotopologies, the primitive objects—that is, the entities over which 
variables range—are regions, rather than points; but it is often simple to ‘con- 
struct’ points from regions, and ‘simulate’ statements about points using state- 
ments about regions. One way to construct the point p is as a pair of regions 
whose closures intersect in the singleton {p}, as we now proceed to show. 
(There are also more sophisticated ways, described in Sec. 7.1.) 


LEMMA 2.31 Let M be a mereotopology over a regular topological space, 
and let r,s € M. Then r~ N s~ is a singleton if and only if M = lr, s], 
where pq(#1, £2) is the formula 


C (x£1,£2)^ 
VyıVy2(yı < z1 A yo < z2 A C(y1, £2) A C(y2, £1) > C (y1, y2))- 


Furthermore, if r= N s7 = {p} and t € M, then p € t if and only if M = 
pelr, s, t], where pe(z1, £2, £3) is the formula 





Jyı (yı < z1 A C(y1, £2) A =C (y1, —23)); 
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likewise, p € t~ if and only if M = gelr, s,t], where dé(#1, £2, x3) is the 
formula 


Vyi(yr < #1 A C(y1, £2) > C(yi, z3)). 


Proof Routine by Lemmas 2.23 and 2.24. QED 


If M is a mereotopology over a topological space X, let us say that M is com- 
plete if every point in X is the singleton intersection of some pair of regions in 
M. For example, the mereotopologies ROP(R”), ROS(R") evidently possess 
this property; by contrast, ROQ(IR”) does not. We might say that, in a complete 
mereotopology, points can be “simulated” by pairs of regions satisfying the for- 
mula p4. If M is a complete mereotopology over a regular space, Lemma 2.31 
gives us the right to include expressions such as, for example, zı N x3 4 0 
or F(x1) NF (a2) C F(a3) N F (xa) etc. in Lo-formulas with the obvious 
interpretation, since such expressions can evidently be replaced by bona fide 
Lc-formulas with the appropriate extension over M. 

The following lemma illustrates how easily we can express various topolog- 
ical relations in Lo: 


LEMMA 2.32 Let r,s € ROP(R"”). Then r~ N s~ is connected if and only if 
ROP(R") E ¢eilr, s], where Qalx, y) is the formula 





Yz Ny NAZABAL Ny N-zZFOARZ Ny CzU-z). 


Proof The only-if direction isimmediate. So suppose r~Ms~ is not connected; 
we must find a witness for z to show that ROP(R”) H “galr, s]. But, by 
construction of ROP(R”), both r~ and s~ are expressible as finite unions of 
closed, convex semi-linear sets; and so, therefore, is r~ N s7. Since this latter 
set is not connected, it can be written as d U e, such that dM e = @ and d and 
e are both finite unions of non-empty, closed, convex semi-linear sets—say, 
d=d,U---Ud), e = e1 U- +- U em. Given that such pairs of sets in R” can 
be separated by a hyperplane, we have half-spaces hi j such that di C hi j and 
ej C —hi j for alli, j (1 <i <1,1 <j <m). Then the required witness is 


t= X. II hij. 


1<i<l1<j<m 





QED 
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3.3 Compactifications 


Before discussing the expressive power of Le,<, we introduce some addi- 
tional technical material that will be useful throughout this chapter. Recall 
that a topological space is said to be locally compact if every point has a 
compact neighbourhood. This property “transfers”, for Hausdorff spaces, to 
mereotopologies defined over them: 


LEMMA 2.33 Let M be a mereotopology over a locally compact, Hausdorff 
space X, and let p € X. Then p is contained within some r € M such that r~ 
is compact. 


Proof Let p € X. Assuming X is locally compact, let d C X be compact 
and o C d be open such that p € o. Now let r € M such that p € r Co Cd. 
But a closed subset of a compact set is always compact, and, in a Hausdorff 
space, every compact set is closed. Therefore r~ C d~ = d is compact, as 
required. QED 


Let X be a topological space, and let 7 denote the collection of open sets of 
X. Now set X = X U {oo}, where oo is some object not in X. For o € 7, 
denote by ò the set 


, o U {co} if X \ o is compact; 

D= 
o otherwise, 

and denote by 7 the set TU {6 | o € T}. Then we can take X to be a topological 

space whose collection of open sets is 7. Under this topology (which we always 

assume), we call X the one-point (or Alexandroff) compactification of X. The 


object oo is called the point at infinity. The space X is always compact. If X 
is locally compact and Hausdorff, then X is also Hausdorff, and hence normal. 


NOTATION 2.34 In this chapter, we denote spheres, open balls and closed 
balls in Euclidean spaces as follows 


S” = {(a1,..-,ang41) ER? | ap tea — 1} 
B” = {(a1,.--4dm) E R” | ap ++ + an < 1} 
D” = {(aj,...,an) € n |a? +- +a? <1}; 














and we assume the usual topology on these sets. 


(Recall that, by a d-cell, we mean any set homeomorphic to the open 
d-dimensional ball B£.) In the special cases X = R”, it is well-known that X 
is homeomorphic to S” via the mapping: 


oo # (0,...,0,1) 


(@1,---,@n) > (als angi), 
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Figure 2.8. Stereographic projection of S? onto the 1-point compactification of R?. 


where 
al, = 4a;/(a? +--+ a2 +4) forl<i<n 
aay = (at + +02 —4)/(af+---+02 +4). 


This mapping may be regarded as a stereographic projection by embedding 
R” in the hyperplane of R"*! defined in Cartesian geometry by the equation 
In+1 = —1. This projection is depicted for the case n = 2 in Fig. 2.8. By way 
of allusion to this homeomorphism: 


NOTATION 2.35 Let S” denote the 1-point compactification of R”. 


LEMMA 2.36 Let X be a non-compact topological space. Then the mapping 
r ++ t is a Boolean algebra isomorphism from RO(X) to RO(X). 


Proof The function o +> òis monotone, because a closed subset of a compact 
set is compact. Let 0, and 02 be open subsets of X, with o = 01 N 02. Since 
X \o = (X \o01) U(X \o2) is compact if and only if both (X \ 01) and (X \ 02) 
are compact, we have oo € ò if and only if œœ € 0, N 62, whence ò = 01 N 02. 


If u is open in X, let u* denote the pseudo-complement of u in X, and let (ù)* 
denote the pseudo-complement of ù in X. We claim that, for any open set u 
of X, with v = u*, (ù)* = ù. By definition, ù is open in X, and we have just 
shown that ùN ù = @ = Ø. Moreover, if w is any open set in X disjoint from ù, 
then for some open subset w’ of X, we have either w = w’ or w = w'. Either 
way, u N w’ = 0, whence w’ C v and w’ C ù by monotonicity. Hence ù is the 
largest open subset of X disjoint from ù, i.e. (ù)* = ù. 


Note that if r € RO(X), we have r = r** and —r = r*. Hence r** = f, 
so that * € RO(X). Conversely, if u’ € RO(X), then u’ = ù, for some open 
u C X. But then, we have ù = ù** = x, where x = u**. Since the function 
o > òis injective, u = u**. That is, u € RO(X). QED 
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LEMMA 2.37 Let X be atopological space ando C X open. If ois connected 
in X, then òis connected in X. Conversely, suppose X is non-compact, and for 
any closed subsets dı and dz of X with X = dı Ud2 and dı N də compact, either 
dı is compact or dz is compact. If 6 is connected in X, then o is connected 
in X. 


Proof Suppose o is open in X. If ò is not connected in X, let 01, 02 be 
non-empty open subsets of X such that 6 = 6; U 62 and 6; N 62 = Ø. Then 
o = 01 U 02 and 01 N 02 = 9, so o is not connected in X. Conversely, suppose 
o is not connected in X, so let 01, 0g be non-empty open subsets of X such 
that o = 01 U o2 and 0, N o2 = Ý. If X \ o is not compact, then neither 
X \ 0; nor X \ o2 is compact, so that ò = o = 01 U 02 = 64 U 62 and 
6162 = 0, whence òis not connected. If, on the other hand, X \ o is compact, 
by the condition of the lemma, either X \ o1 or X \ o2 is compact, whence 
ò = o U {oo} = 01, U 02 U {co} = 6) U 62. Moreover, by repeating the first 
paragraph of the proof of Lemma 2.36, we have 6; N 62 = 0 = (). It follows 
that ò is not connected. QED 


The well-known Heine-Borel theorem states that, in R”, a set is compact if and 
only if it is closed and bounded. It is therefore easy to see that R” satisfies the 
condition of Lemma 2.37. 


LEMMA 2.38 Letn > 0 and let M be any mereotopology over R”. Then the 
mapping r > t defines a structure isomorphism from M to M for the signature 
(c, <): that is, M ~.< M. 


Proof Lemmas 2.36 and 2.37. QED 


LEMMA 2.39 Let X be a locally compact, non-compact topological space 
and M a mereotopology over X. Define M = {i |r € M}. Then M isa 
mereotopology over X. We call M the 1-point compactification of M. If M is 
finitely decomposable, then so is M. 


Proof Suppose that oo € ò with o open in X; we show that there exists some 
r € M such that œ € r C ò. Since M isa mereotopology over X and X 
is locally compact, Lemma 2.33 gives us a cover of X \ o by elements of M 
whose closures are compact. Since oo € 0, X \ o is compact, so that this 


cover has a finite sub-cover, say r1,..., rn. Let r = —(r1 +--+ rn). Thus, 
X \r=rj, U+- Ur, is compact and includes o, whence r has the required 
properties. The rest of the Lemma follows from Lemma 2.37. QED 


Suppose now that X = R” for some n > 0, and let M be a mereotopology 
over R” respecting components. Then X satisfies the condition of Lemma 2.37, 
so by Lemma 2.39, M is a mereotopology over R” respecting components. 
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Since S” denotes the 1-point-compactification of R”, the 1-point compactifica- 
tion of RO(R”) is thus RO(S”). 


NOTATION 2.40 Let ROS(S") denote the 1-point compactification of 
ROS(R"), and similarly for ROP(S"), ROQ(S”). 


It is often more convenient to work with S? and S? rather than R? and RÌ. When 
we need to make the distinction explicit, we refer to elements of ROP(R”) 
as polytopes (polyhedra, polygons) in open space and those of ROP(S") as 
polytopes (polyhedra, polygons) in closed space. Note that, by Lemma 2.38, 
the mereotopologies RO(R”), ROP(R”), ROQ(R”) and ROS(R”) certainly 
all have the same Le <-theories as their respective 1-point compactifications. 


3.4 Connectedness: the closed plane 


We have seen that, over most mereotopologies of interest, the language Lc is 
as expressive as the language Lc,<. The question therefore arises as to whether 
aconverse reduction is possible. In this section, we show that, for well-behaved 
mereotopologies over S?, the answer is positive. 

We assume familiarity with basic geometric topology in the plane: for details, 
see Newman, 1964. Recall in this context that a Jordan arc in a topological 
space X is a homeomorphism from the unit interval [0, 1] into X, and a Jordan 
curve in X, a homeomorphism from the unit circle St into X. The Jordan 
curve theorem states that the locus of a Jordan curve in R? separates R? into 
two residual domains, exactly one of which is bounded. If we regard S! as the 
intersection of the plane x; = 0 with Sĉ, the Schénflies Theorem states that a 
Jordan curve y : St — S? may be extended to a homeomorphism S? <> S?. 
Thus, if y is a Jordan curve in S2, the residual domains of |y| are 2-cells in S?; 
and if y is a Jordan curve in R?, the bounded residual domain of y is a 2-cell 
in R?. 

The following concepts are important in understanding the good behaviour 
of the mereotopologies ROS(R?), ROP(R?) and ROQ(R?). 


DEFINITION 2.41 Let X be a topological space, u C X and p,q E€ F(u). 
An end-cut to pin u is a Jordan arc in X such that f (1) = pand f([0,1[) C u. 
Likewise, a cross-cut from p to q in u is a Jordan arc in X such that f(0) = p, 
f(1) = qand f(]0,1[) C u. Let M be amereotopology over X. We say that M 
has curve-selection if, for allr € M and all p € F(r), there exists an end-cut 
in r to p. 


The existence of end-cuts is by no means a universal property of regular open 
sets in R” (for n > 1). However, the regions in ROS(R?), ROP(R?) and 
ROQ(R?) are well-behaved in this regard, as the following results show. 
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LEMMA 2.42 Let r € ROP(R") and p € r~. Then there exists a linear 
function f : [0,1] — R” such that f(1) = p and f([0,1[) C r. If p has 
rational coordinates, we may choose f so that it has parameters from Q. 


Proof The proposition holds for basic polytopes because their closures are 
convex. It holds for all polytopes because if r = rı +--+ rn, r = 7, U Urg 
by Lemma 2.4 (iii). QED 


The semi-algebraic case is much more involved. However, we have the follow- 
ing theorem (van den Dries, 1998, Ch. 6, Corollary 1.5; Bochnak et al., 1998 
Theorem 2.5.5). 


PROPOSITION 2.43 (CURVE-SELECTION LEMMA) Let S be a semi- 
algebraic subset of R” and p E€ S~. Then there exists a continuous semi- 
algebraic function f : [0,1] — R” such that f(1) = p and f([0,1[) C S. 





Thus, the mereotopologies ROS(R?), ROP(R?) and ROQ(R7) all certainly 
have curve-selection. Moreover, by making only minor modifications to the 
relevant arguments, it can be shown that ROS(S?), ROP(S?) and ROQ(S?) 
all have curve-selection too. 

With these preliminaries behind us, we can turn to the expressive power of 
Le,<. We note in passing that, since < is a primitive of L..<, we may write the 
Boolean operators and constants +, -, —, 0 and 1 in L,,<-formulas, assuming 
them to be replaced by their usual definitions. In mereotopologies over the 
closed plane having curve-selection, we can express the property of being a 
2-cell using an L,,<-formula. To see this, we recall that the Jordan Curve The- 
orem has the following converse (see Newman, 1964 Ch. VI, Theorem 16.1). 


PROPOSITION 2.44 (CONVERSE OF JORDAN’S THEOREM) Let d be a 
closed subset of S? such that S? \ d has two components, and suppose that, 
for each p € d, and each component o of S? \ d, there is an end-cut to p in o. 
Then d is the locus of a Jordan curve. 


Then we have: 


LEMMA 2.45 Let M be any mereotopology over S? having curve-selection. 
Then, for allr € M, r is a 2-cell if and only if r is non-zero and connected 
with non-zero connected complement—that is, if and only if M — w3|r], where 
s(x) is the Le, <-formula 


ce(x) Ax >0A^c(—r)^ -zr > 0. 


Proof If M — ys[r], then d = F(r) satisfies the conditions of Proposi- 
tion 2.44, since M has curve-selection. The other direction is immediate. QED 
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Furthermore: 


LEMMA 2.46 Let M be a mereotopology over R? having curve-selection and 
also satisfying the conditions of Lemma 2.30. Then r € M is a 2-cell if and 
only if r satisfies the Lo-formula 


o(x) At > 0A b-(-—2) Az > 0A d2(2), 


where (x) and $,2(x) are as defined in Lemmas 2.27 and 2.30, respectively. 


Proof If r satisfies the formula, then the bounded set F(r) is the locus of a 
Jordan curve in S? and hence in R? by the same reasoning as for Lemma 2.45, 
and since r is the bounded residual domain of this set, it is a 2-cell. The other 
direction is again immediate. QED 


We now proceed to a direct comparison between Le, < and Lc. Proposition 2.28 
has a closed-plane variant, in which the condition that one of dı and də is 
bounded may be dropped. 


PROPOSITION 2.47 Let d; and dz be closed sets in S*. If S? \ di, S? \ dz and 
dı N dz are all connected, then so is S$? \ (d; U dọ). 


This leads to a closed-plane variant of Lemma 2.29: 





LEMMA 2.48 Let s1, 82,t € RO(S?) such that: (i) —(s1 +t), —(s2 +t) and t 
are all connected; and (ii) 8; O s3 = Ø. Then — (sı + s2 +t) is also connected. 


Proof As for Lemma 2.29, using Proposition 2.47 in place of Proposition 2.28. 
QED 


We can now state the lemma which ensures that, for certain mereotopologies 
over the closed plane, Le,< is as expressive as Lo. 


LEMMA 2.49 Let M be any finitely decomposable mereotopology over S? 
having curve-selection, let Wup(y1, y2) be the Le <-formula 





dz(c(—(y + 2)) A e(=(y2 + 2)) A e(z) A nel (y1 + y2 + 2))); 


and let po(z1, £2) be the Le <-formula 














yıJy2(yı < z1 A y2 < x2 A up(y1, Y2))- 
Then, for all r1, r2 € M, ry Ary #90 ifand only if M } velri, rə]. 








The if-direction is given by Lemma 2.48. The only-if-direction is given by 
Lemma 2.83, which we present at the end of Sec. 4.3 (at which point we will 
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be in a better position to give a succinct proof). In the meanwhile, however, 
we observe that, for the mereotopologies ROQ(S?), ROP(S?) and ROS(S?) 
at least, the only-if direction of Lemma 2.49 is almost immediate. (The reader 
may wish to treat it as an exercise.) 

Putting together Lemmas 2.22, 2.27 and 2.49, we see that Le < is exactly as 
expressive as Lc in well-behaved mereotopologies over the closed plane S?. 

As a final example of the expressiveness of the language Lc, we observe 
that it can distinguish between R? and its 1-point compactification. 


THEOREM 2.50 Let M be any of ROS(R”), ROP(R?) or ROQ(R?). Then 
M žc M. 


Proof Recall the Lc-formula p2 (x) defined in Lemma 2.30, and expressing 
the property of being bounded over M. Evidently, M j+ Var@,2(x). But it is 
an easy consequence of Lemma 2.48 that M — Yzop: (x). QED 


Theorem 2.50 stands in sharp contrast to the situation with the signature {c, <} 
reported in Lemma 2.38. 


4. Expressiveness of first-order languages in plane 
mereotopologies 


In the previous section, we examined the relative expressive power of the 
languages Lc and Le,< for various mereotopologies, in particular those defined 
over R? and S?. This section characterizes that expressive power in a more 
‘absolute’ way. We employ the following terminology: 


DEFINITION 2.51 Let X be a topological space and let i = u1,..., Un, 
U = U1,...,Un be n-tuples of subsets of X. We say that ù and 0 are similarly 
situated (in X), and write  ~x v, if there is a homeomorphism of X onto 
itself mapping ù to v. If X is clear from context, we omit reference to it, and 
simply write i ~ v0. Now let M be a mereotopology over X and X a signature 
of topological primitives. For any Ly-formula @ with free variables z, we say 
that ġ is topologically complete (in M over X) if any pair of tuples of the 
appropriate arity satisfying @(x) in M are similarly situated in X. 


Readers familiar with basic geometric topology will recognize that the mereotop- 
ologies ROS(S?), ROP(S?) and ROQ(S?) are all (finitely) “triangulable” (in 
the sense of van den Dries, 1998). Moreover, the observations of Sec. 3.2 
strongly suggest that triangulations in these mereotopologies can be combina- 
torially described using first-order formulas with C'as their only primitive. And 
since combinatorially isomorphic triangulations are similarly situated, it should 
be entirely unsurprising that every tuple in these mereotopologies satisfies a 
topologically complete Lc-formula (and hence also a topologically complete 
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Le <-formula). That is: every tuple of regions in any of the mereotopologies 
ROS(S?), ROP(S?) and ROQ(S?) can be completely topologically described 
by an Lc-formula (or by an Le <-formula). Results of this general kind were 
proved, independently, by Kuijpers et al., 1995, Papadimitriou et al., 1999 
and Pratt and Schoop, 2000, by a variety of methods. Our objective here is a 
systematic and general investigation of this topic, using an approach which will 
prove useful in Sections 5 and 7. 


4.1 Connected partitions 


We have seen that, given a collection © of topological primitives, any mer- 
eotopology can be regarded as a /-structure by interpreting the symbols in X 
in the standard way. And the question then naturally arises as to whether we 
can obtain a converse to this observation. That is: under what conditions is 
a given X -structure isomorphic to some mereotopology—or perhaps, to some 
mereotopology belonging to a certain class? Since this question will preoccupy 
us in the sequel, some of the results below will be presented at a higher level 
of generality than their immediate applications warrant. 

Accordingly, throughout Sections 4.1 and 4.2, % shall denote an arbitrary 
structure interpreting the signature {0,1,+,-,—,c}, such that the reduct of 
A to the signature {0,1,+,-,—}, is a Boolean algebra. To avoid notational 
clutter, if a,b € A, we write 0,—a,a-+ b etc., rather than the more correct 
0%, —*(a), + (a,b) etc. In addition, abusing terminology slightly, we call an 
element a € A connected if A — cla]; and we say that X is finitely decom- 
posable if, for every a € A, there exist connected elements aj,...,a@n of 2 
such that a = aj +--+ + an. Of course, in case X is a mereotopology M, this 
usage is consistent with that adopted above. As usual in the context of Boolean 
algebras, we take a partition in 21 to be a tuple of non-zero, pairwise disjoint 
elements summing to 1. If @ is any tuple from 2 (not necessarily a partition), 
and b a partition in 2, we say that @ can be refined to b if every element of @ 
can be written as the sum of (zero or more) elements of b. 


DEFINITION 2.52 A partition à = ay,..., an in XA such that a; is connected 
for alli (1 < i < n) is called a connected partition. 


Let Peon denote the Le <-sentence 
VeVy(c(2) ^A cly) Anz: y#0> c(x+y)). 


Thus, Ycon “says” that the sum of two overlapping connected regions is 
connected. 


LEMMA 2.53 Let M be any mereotopology. Then M — con. 


Proof A restatement of Lemma 2.4 (iv). QED 
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CLAIM 2.54 Suppose 2 is finitely decomposable, and A = weon. Then every 
tuple in 2 can be refined to a connected partition. 








Proof Given elements aj,...,@,, collect all the non-zero products 
b,,...,6n of the form: +a- --- -+a,. For each j (1 < j < N), let 
bj.1,--+,65,n, be connected elements of 2{ summing to b;. If, for any two of 


these elements, say b; ;, and b;, we have bj ;,-b;, > 0, then we can replace them 
by their sum b; , + bj, (which is connected, because M | won). Proceeding 
in this way, we obtain the desired refinement. QED 


Note that, in particular, every tuple in any finitely decomposable mereotopology 
can be refined to a connected partition. 

Let us restrict attention now to finitely decomposable mereotopologies over 
S? having curve-selection. 


LEMMA 2.55 Let M be amereotopology over R? or S? having curve-selection. 
If rı, r2 and r3 are pairwise disjoint, connected elements of M, then there exist 
at most two points lying on the frontiers of all three regions. 


Proof We suppose that pı , po and pg are distinct points all lying on the frontiers 
of 71,72 and r3 and derive a contradiction. Choose points q1, gz, q3 such that 
qi E€ ri (i = 1, 2,3). By curve-selection, draw three end-cuts in r;, say ¥i,1, Yi,2 
and Ņ;,3 from q; to p1, p2 and p3, respectively. It is easy to see that, within each 
ri (l < j < 3), the q; j can be chosen so that they intersect only at q;. But since 
the r; are disjoint, each q; j intersects any other y; j’ only in p; or qi. And it is 
well known that this is impossible (see the right-hand graph in Fig. 2.10). QED 


For n > 2, let Wi, denote the Le <-formula 


Warr «Warm (olay + +++ + 2n) A TAN c(zi) > VV o(a +24)). 


1<i<n 2<i<n 


(The formula ¢sum of Example 2.18 is just #3,,,-) Thus, Y% m “says” that, if n 
connected regions have a connected sum, the first must form a connected sum 


with at least one of the others. 


LEMMA 2.56 Let M be a finitely decomposable mereotopology over S? 
having curve-selection. Then M = Y$ im for alln > 1. 


Proof Letr,,...,%n be connected with r1 +---+7r,, also connected. Assume 
first that the r; are pairwise disjoint. Let p E€ rı and q E€ r2 +- + rn. By 
the connectedness of rı + --- + rn, draw a Jordan arc y from p to q lying 
within rı +: + Tn. By Lemma 2.55, only finitely many points can lie on the 
frontiers of more than two of the r;, and we may certainly ensure that y avoids 
all such points. By renumbering if necessary, we may assume that ~y visits a 
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point p E€ r} Nry N (rı +... + rn). But by the construction of y, p ¢ r; for 
all i > 2, whence p € —r; for all such i. Therefore, p € r} N ry N (rı +r2), 
whence rı + rg is connected. Finally, we relax the assumption that the r; 
are pairwise disjoint. Since M is finitely decomposable, we have that each 
element of 7 is the sum of zero or more members of a tuple 5 of pairwise 
disjoint connected elements with the same sum. The result then follows easily 
by repeated applications of Lemma 2.53. QED 


In the sequel, we abbreviate the formula 
Tı + £2 = £ A £1 > 0AT > 0AT: £2 =0Ac(21) A clr) 


by x1 ® x2 = zx; thus, x; £2 = x “says” that x can be partitioned into 
non-zero, disjoint connected regions x; and x2. Now let Wpreak denote the 
Le <-formula 


veWyn ye ((e(2) A c(y1) A c(y2) A elx + y)A 
c(t t+ y2)Az-y =0At-y2=0A2rFZ0) = 

















aye (x1 xq = z Ac(zı +y1) Ac(x1 + y2) Ac(x2 + y1) Ae(a2+y2)) ). 


Thus, Wpreak “Says” that, if r, S1, S2 are connected regions such that r is non- 
zero, disjoint from sı and s2, and forms a connected sum with both sı and s2, 
then r can be partitioned into connected, non-zero regions r1, r2 such that each 
of rı and r2 forms a connected sum with each of sı and s2. Fig. 2.9a illustrates 
this configuration; note that —r need not be connected. 


LEMMA 2.57 Let M be afinitely decomposable mereotopology over S? having 
curve-selection. Then M H wpreak- 


Proof Let r, 51, sg be as above. We may assume that sı and s2 are non-zero, 
since otherwise, similar or easier arguments apply. Refer to Fig. 2.9b. For 
i = 1,2, since r + s; is connected, by Lemma 2.4 (ii), r7 N s} N (r + si) 4 0. 
In fact, since the removal of finitely many points from the open set r + s; does 
not disconnect it, we can choose four distinct points p;, qi (i = 1, 2) such that 
Pi G ET Ns; N (r++ si). Since M has curve-selection and r is connected it 
is easy to see that, by exchanging qı and q2 if necessary, we can draw cross-cuts 
y from pı to pz and 6 from qı to q2 such that |y| and |6| are disjoint. Since S? is 
normal and M a mereotopology, we can cover || with elements of M whose 
closures are disjoint from |y|. By compactness of |ô], this cover has a finite 
subcover, t;,..., ty, say. Lett = r- (tı +--+ ty); evidently, qı and qə lie 
on the frontier of the same component t of t. Likewise, pı and po lie on the 
frontier of the same component of r - —t’: call this component rı € M, and let 
r2 = r:—rı. Itis easy to check that rı and r2 have the required properties. QED 
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a) b) 


Figure 2.9. The configuration of Yhreak. 


4.2 Neighbourhood graphs 


As before, X shall denote an arbitrary structure interpreting the signature 
{0,1,+,-,—, c}, such that the reduct of 2 to {0, 1, +,-, — } is a Boolean algebra. 
Recall the notion of connected partition introduced in Definition 2.52. 


DEFINITION 2.58 Let @ = ay,...,@n be a connected partition in XA. We say 
that @ is a c”-partition if, for every I C {1,...,n} such that |I| < h, the 
element (— )),<7 ai) is connected. 


IfA H c(1), then a c!-partition in 2 is the same thing as a connected partition. 
Furthermore, if 2 is in fact a mereotopology over S? having curve-selection, 
then, by Lemma 2.45, a c?-partition in 2 is the same thing as a partition consist- 
ing entirely of 2-cells. It is c?-partitions, however, that will mainly preoccupy 
us in the sequel. 

We assume familiarity with basic graph theory: for details, see Diestel, 1991 
Ch. 1. Recall in this context that a graph is a pair G = (V, E) where V is 
a finite set (called vertices) and E is a set of 2-element subsets of V (called 
edges). We denote V by V(G) and E by E(G). Note that, on this definition, 
graphs have no “loops” or “multiple edges”. If G is a graph and U a proper 
subset of V(G), we denote by G \ U the result of deleting all the nodes in U 
from G; and if e = (v, v’) € E(G), we denote by G'/e the result of contracting 
G by merging v and v’ into a single (new) node v”, such that (v”, w) is an edge 
of G /e just in case either (v, w) or (v’, w) is an edge of G. If a graph H can be 
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obtained from G by a sequence of deletions and contractions, then H is said to 
be a minor of G. Finally we take the terms path, cycle, connected, component 
to be defined in the standard way. In particular, recall that, for h > 0, G is said 
to be h-connected if G \ U is connected for every U C G such that |U| < h. 


DEFINITION 2.59 Let @ = aj,...,Qn be a tuple from 2. If a; + a; is con- 
nected for 1 < i < j < n, we say that a; and a; are neighbours. The 
neighbourhood graph of a, denoted Nz, is the graph with nodes {a1,..., an} 
and edges {(a;,a;) | a; and a; are neighbours}. 


CLAIM 2.60 Suppose A = eon and A =| WR, forall n > 2. Leta = 
Q1,---,Q@n be a tuple of connected elements of 2, such that an-ı + an is 
connected. Let a! = a1,...,Qn—2,(@n—1 + Gn). Then Ny = Ng/(aGn—1, an). 


Proof Forl <j < n- 1, aj + (an-ı + an) is connected if and only if 
a; + Gn—1 is connected or aj + an is connected. QED 


CLAIM 2.61 Suppose A = Yeon and A = V2, foralln > 2. Leta = 
a1,- . . , ân bea tuple of connected elements of A, witha = ay +...+@n. Then 
a is connected if and only if Ng is a connected graph. 


Proof The if-direction follows easily from the fact that 2 = won. For the 
only-if direction, note that the claim is trivialifn = 1,soassume n > 1, and that 
the claim holds for tuples of fewer than n elements. Since A = Y? m there exists 
i (1 < i < n) such that a; and ap are neighbours. By renumbering if necessary, 
assume 7 = n — 1, and let g’ be as in Claim 2.60, so that Ny = Na/(an-1, Gn). 
But Ny is connected by inductive hypothesis, whence Ng is connected too. 
QED 








CLAIM 2.62 Suppose LE Weon and A = Ym forall n > 2. Letabea 
connected partition in A, and let h > 1. Then ā is a c"-partition if and only if 
Ng is an h-connected graph. 


Proof Immediate by Claim 2.61. QED 


CLAIM 2.63 Suppose A = c(1), A H= Yeon, A = Wh, for all n > 2, 
and XA | Wpreak. Then every connected partition in A can be refined to a 
c?-partition. 








Proof We make free use of Claim 2.61. Let @ be a connected partition. We 
show first that @ can be refined to a c?-partition. Choose an element a of ā 
such that the number k of components of the graph Na \ {a} is maximal. And 
let there be m > 0 elements a for which this maximum value is attained. If a 
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is not already a c?-partition, then k > 1. Let Hy, Hə be distinct components 
of Na \ {a}. Since Ng is connected, there exist bı € H1, b2 € H2 such that 
a + bı and a + bg are connected. Since 2 = Wpreak, let a1, a2 be non-zero, 
connected, disjoint elements summing to a with a; + b1, a1 + b2, a2 + bı and 
a2 + be all connected; and let b be the connected partition which results from 
replacing a by a; and ag. Evidently, for i = 1,2, Nz \ {ai} has strictly fewer 
than k components. That is, the number of elements b in b such that Nz \ {b} 
has k components is strictly less than m. Proceeding in this way, we eventually 
obtain a c?-partition. 


Now let @ be a c?-partition. We show that @ can be refined to a c®-partition. If 
a is not ac?-partition, choose a pair of distinct elements a and a’ such that the 
number k of components of the graph Nj \ {a, a’} is maximal; and let there be 
m > 0 unordered pairs (a, a’) for which this maximum value is attained. Let 
Hı, Hp be distinct components of Na \ {a, a’}. Since @ is a c?-partition, there 
exist bı € Hy, bo € Hə such that a + bı and a + bə are connected. And since 
A = Wpreak, let a1, a2 be non-zero, connected, disjoint elements summing to 
a with ay + b1, a1 + b2, a2 + by and az + bə all connected; and let b be the 
connected partition which results from replacing a by aj and a2. Evidently, for 
i = 1,2, N3 \ {a;, a’} has strictly fewer than k components. Moreover, suppose 
a” is any other element of a (distinct from a and a’) such that Ng \ {a, a” } also 
has k components. We claim that Nj \ {a1, a” } and Nj \ {a2, a” } cannot both 
have k components. Working for the moment on this assumption, we see that 
the number of pairs b, b' in b such that Nj \ {b, b’} has k components is strictly 
less than m. Proceeding in this way, we eventually obtain a c?-partition. 


It remains only to verify that the graphs Nz \ {a1,a”} and Nj; \ {a2,a"} 
encountered above do not both have k components. If a € A and B C A, 
let us say that a is a neighbour of B if a is a neighbour of some element of B. 
Let the components of Ng \ {a, a”} be H4, . .. Hy. Since bis ac?-partition, we 
have that, for all ¿į (1 <i < k), ais a neighbour of H;, and therefore either a1 
or a2 is a neighbour of H;. Hence, we can re-order the H; if necessary so that, 
for some p,q with 0 < p< q < k + 1, a is a neighbour of H; if and only if 
i < q and az is a neighbour of H; if and only if p < i. Thus, the components 
of Nz \{a1,a"} are H1, ... , Hp, ({a2} U Hp41---U Hg), and the components 
of N3 \ {a2, a" } are ({a1} U Hı -+ U Hg-1), Hq,..., Hp. If these number k 
in each case, we have p = k — 1 and q = 2. But’ lies in one of the Hj, and a, 
and az were chosen so that they are both neighbours of this a’. Hence a; and a2 
are both neighbours of H;, whence p < q—1. This yields k < 1, contradicting 
our assumption that @ is not a c?-partition. QED 


We finish with a technical result which will be required later. 
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DEFINITION 2.64 Ifa = a1,...,ay is a connected partition in X such that, 
for any neighbour a; of ai, —(aj + aj) is connected, we say that @ is radial 
about aj. 


Note incidentally that a c?-partition is radial about each of its members. 


CLAIM 2.65 Suppose A = c(1), A H peon A E Vey for all n > 2, and 
A H Wpreak. Let n > 1 and leta = a,...,an be a connected partition in 
A with —a, connected. Then ā can be refined to a c?-partition a1, b2,..., bn, 
radial about ay, in which ay has at least three neighbours. 











Proof Similar to the above. QED 


We conclude with a further corollary of Claim 2.61. We employ the following 
fact from graph theory, whose proof we leave to the reader. 


PROPOSITION 2.66 IfG is a finite 2-connected graph of order at least 2, and 
v E€ V(G), then there exists a w € V(G) such that {v,w} € E(G), and the 
removal of both v and w from G leaves a connected graph. 


COROLLARY 2.67 Let M be a finitely decomposable mereotopology over S? 
having curve-selection, and let T = 11,...,1» be a partition in M consisting 
entirely of 2-cells. Then, by re-numbering if necessary, we have, for all k 
(l<k <n), rı +--+ rp is a 2-cell. 


That is: partitions of the closed plane into 2-balls are always ‘shellable’. The 
analogous result for three-dimensional space fails (Rudin, 1958). 


4.3 Partition graphs 


We now prove that, if 7 is a c?-partition in a finitely decomposable mereo- 
topology over S? having curve-selection, then the neighbourhood graph of 7 
fixes its topological properties completely. 

We assume familiarity with the basic theory of plane graphs: for details, 
see Diestel, 1991 Ch. 4. In this context, suppose that e C S? is the locus of 
a Jordan arc. Then e has two endpoints; all other points are called interior 
points, and we denote the set of these interior points by (e). (Of course, e is 
not the topological interior of the set e in S?; but no confusion should arise in 
this regard.) A plane graph is a pair G = (V, E), where V is a finite subset of 
S? and E is a collection of sets e C S? such that e is the locus of a Jordan arc, 
satisfying the following conditions for all v € V and all e, e’ € E: 


1 ife € E and pis an endpoint of e, then p € V; 
2 v ¢ (e), and if e Æ e then (e) N (e’) = 9; 


3 if e # e’, then e and e’ do not join the same pair of endpoints. 
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Figure 2.10. The non-planar graphs K* and K3,3. 


The elements of V are called the vertices of G, and the elements of FE, the edges 
of G; an edge e € E is said to join the vertices at its endpoints. We denote V 
by V (G), E by E(G) and V U U E by |G|. The components of S? \ |G] are 
called the faces of G, and we denote the set of these faces by F (G). A plane 
graph is semi-algebraic if its edges are the loci of semi-algebraic Jordan arcs; 
similarly for the terms piecewise linear and rational piecewise linear. Notice 
that, on our definition, plane graphs have no “loops” or “multiple edges”. (Some 
authors prefer the term simple graph.) A plane graph will be regarded as an 
abstract graph in the obvious way, and we carry over notation and terminology 
accordingly. Conversely, if G = (V, Æ) is an abstract graph, a drawing of G 
is a plane graph G” = (V’, E’) for which there exists a function € mapping V 
1-1 onto V’ and E 1-1 onto E’ such that for all (v, v’) € E, €((v,v’)) joins 
e(v) and e(v'). We call € an embedding. If G has a drawing, G is planar. Not 
all abstract graphs are planar, of course: the graphs K5 and K3 3 illustrated in 
Fig. 2.10 are familiar non-planar graphs. Indeed, this fact has a converse: 


PROPOSITION 2.68 (KURATOWSKI, WAGNER) A graph is planar if and 
only if it has no minor isomorphic to either K° or K3 3. 


We further assume familiarity with the notion of duality for plane graphs. 
Let G and G” be plane graphs. We say that G” is a geometrical dual of G if 
there are bijections fr : F(G) — V(G’) and fg : E(G) > E(G’) such that, 
for all f € F(G) and e € E(G): 


1 fr(f) €f; 


2 fr(e) Me isa single point interior to both fg(e) and e, and fgle) Ne = 0 
for alle’ Æ e. 


In our terminology, not every plane graph has a dual, because we do not allow 
graphs to contain loops or multiple edges. However, we rely below on the 
following sufficient condition (Wilson, 1979, p. 76). 
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PROPOSITION 2.69 Every 3-connected plane graph has a dual. 
The following fact is also well-known. 


LEMMA 2.70 Let G and G' be connected plane graphs such that G' is a 
geometrical dual of G. Then there is a bijection fy : V(G) > F(G") such 
that, for all v € V(G), v € fy(v). Hence, G is a dual of G". 


Proof Every face of G” contains at least one vertex of G by construction; it 
contains at most one by Euler’s formula |F'(G)|—|E(G)|+|V(G)| = 2 applied 
to G and G’. QED 


Finally, duals are unique, in the following sense (Diestel, 1991, p. 88). 


PROPOSITION 2.71 Let G be a plane graph and let G' and G" be plane 
graphs which are both geometrical duals of G. Then there is ahomeomorphism 
h : S2 — S? mapping G' to G". In fact, h can be chosen such that, for all 
v € G, if f' and f" are the faces of G' and G", respectively, containing v, then 
h maps f' to f". 


Now let us apply these ideas to the graphs whose faces are c?-partitions in 
well-behaved, closed-plane mereotopologies. 


LEMMA 2.72 Let X be a topological space, and let r, s be disjoint elements 
of RO(X) with p € F(r) \ F(s). Then p E€ F(—(r + s)). 


Proof By Lemma 2.4 (ii), p Z r+ s. QED 


LEMMA 2.73 Let M be amereotopology over S? having curve-selection, and 
let T = r1,...,T be a c®-partition in M. For alli, j (1 <i < j < n), 
F (ri) OF (rj) is connected. 


Proof We may assume that n > 3. Since 7 is certainly a c?-partition, every 
F(ri) A < i < n) isa Jordan curve by Lemma 2.45. Suppose, for contra- 
diction, that F(r;)  F(r;) is not connected, and let p,q E F(ri) O F(r;) 
be separated in F(r;) by {p',q'} C F(ri) \ F(r;). By Lemma 2.72, p',q' € 
F(—(ri + r;)), so that, by the connectedness of —(r; + rj), we can draw a 
cross-cut 7’ (Definition 2.41) from p’ to q’ in —(r; + rj) C —r;i. By the con- 
nectedness of rj, we can likewise draw a cross-cut y from p to q in r; C —7j. 
But —r; is a 2-cell, whence y and y’ are easily seen to intersect at an interior 
point, which is impossible, since r; N —(r; + rj) is empty. QED 


LEMMA 2.74 Let M be amereotopology over S? having curve-selection, and 
let? =11,...,T (n > 4) be ac?-partition in M. Then there exists a unique 
plane graph G drawn in S? such that the collection of sets {r1,..., fn} are 
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exactly F(G) and the collection of sets {F(ri) 1 F(rj) | 1 <i<j< 
n, ri + rj is connected} are exactly E(G). 

Proof Let i,j,k be distinct integers in the range [1,n]. Since 7 is a c3- 
partition, r; Ur, = S? \ —(rj + rg) does not separate the non-empty sets r; 
and — (r; + rj + rp), whence F(r;) N (F (rj) U F(rg)) is not the whole of the 
Jordan curve F(r;). And since, by Lemma 2.73, F(r;) N F(r;) is a connected 
subset of F(r;), F(ri) N F(r;) is either a point or the locus of a Jordan arc. 
Indeed, F (r;) must include at least three Jordan arcs of the form F(r;) NF (r;) 
for various j distinct from i. Let the vertices of G be the endpoints of all Jordan 
arcs of the form F(r;) N F(r;), and let the edges of G be the segments of the 
various F(r;) connecting them. To show that G is a plane graph, we must 
establish that if F(r;) OF (rj) is a Jordan arc y, then for all k (1 < k < n) with 
k Æ i, j, F (rg) contains no interior points of y. For otherwise, let p’ € F (rp) be 
an interior point of y, and pick any q' € F(ri)\F(r;). Then p' € F(—(ri+r;)) 
and also, by Lemma 2.72, q! € F(—(ri + r;)). If we now choose p and q in 
F(ri) O F(r;) separating p' and q’ on the Jordan curve F(r;), the derivation 
of a contradiction proceeds as in Lemma 2.73. Hence no point of F (rẹ) is an 
interior point of y, as required. Moreover, no two Jordan arcs in Æ can have 
the same end-points, since 7 is a c?-partition. It follows that G is a plane graph 
as required. Evidently, F(G) = {r1,...,17n} and E(G’) is the collection of 
sets F(r;) 1 F(r;) for 1 < i < j < n which are Jordan arcs. 


It therefore remains only to show that F (r;) QF (rj) is a Jordan arc if and only if 
rj +r; is connected. Note that rı Ur is trivially not connected. By Lemma 2.4 
(ii), Ur; C retry C riUrjU(F(ri)OF(r;)), and the removal of a single point 
from a connected, open set does not render it disconnected. Hence, if r; +r; is 
connected, F(r;) NF (rj) is neither empty nor a singleton, and hence is a Jordan 
arc. Conversely, suppose F (rj) QF (r;) isa Jordan arc. We have already shown 
that, if p is an interior point of this arc, p ¢ Upzi jr, = ops rk) . That is, 
P E — D rzij Tk = Ti+ rj. Hence F(ri) O F(rj) N (ri + 75) is non-empty, 
whence r; + rj is connected. QED 


DEFINITION 2.75 Let M be a mereotopology over S? having curve- 
selection, and let T = 11,...,% (n > 4) be a c?-partition in M. We call 
the unique plane graph G satisfying the conditions of Lemma 2.74 the partition 
graph of 7. 


Warning: the neighbourhood graph and the partition graph of a c?-partition are 
not the same sort of thing. The former is an abstract graph whose nodes are 
regions and whose edges are pairs of regions; the latter is a plane graph, whose 
nodes are points and whose edges are the loci of Jordan arcs. 
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LEMMA 2.76 Let M be a mereotopology over S? having curve-selection, let 
PF =11,...,%n (n > 4) be ac?-partition in M, and let G be its partition graph. 
Then there is a plane embedding € of Nr such that e( Nr) is a geometrical dual 
of G and, for alli, (1 <i < n), e(ri) € ri. 


Proof Almost immediate from the definition of partition graph. QED 


From Claim 2.62, c?-partitions have 3-connected neighbourhood graphs. But 
3-connected graphs have the crucial property that all their drawings are topo- 
logically the same. 


PROPOSITION 2.77 (WHITNEY) Let G and G" be 3-connected 
plane graphs and f : G — G" a graph isomorphism. Then f can be extended 
to a homeomorphism h : S? — S?. 


Let M be a finitely decomposable mereotopology over S°, and let 7 = 
T1,.--,Tm and 5 = s1,...,5n, be n-tuples from M. We are interested in the 
case where the mapping ri +> s; is a graph isomorphism from Nọ; to Ns—that 
is, where, for all 7, j, (1 < i < j < n), ri +1; is connected if and only if s; + sj 
is connected. We say in this case that 7 and 5 have the same neighbourhood 
structure. 


THEOREM 2.78 Let M be a finitely decomposable mereotopology over S? 
having curve-selection. Then any two c-partitions in M having the same 
neighbourhood structure are similarly situated in S?. 


Proof It is straightforward to verify that, if n < 3, all n-element c*-partitions 
in M are similarly situated in S’. Thus, we may assume that n > 4. Let 7 = 
P1,...,Tm and 5 = 81,..., 8, be c®-partitions with the same neighbourhood 
structure, and let G and H be their respective partition graphs. By Lemma 2.76, 
let G* and H* be embeddings of N; and Nz, geometrically dual to G and H, 
respectively, let p; be the vertex of G* contained in r; and let q; be the vertex of 
H* contained in s; for all i (1 < n). Hence, there is a graph isomorphism f : 
G* — H* mapping p; to qi. Since G* and H* are 3-connected, Proposition 2.77 
guarantees that f can be extended to a homeomorphism h : S? — S?. Then 
h(G) and H are both geometrical duals of the plane graph h(G*) = H*, such 
that, for all i (1 < i < n) the faces h(r;) and s; contain the vertex h(p;) = qi- 
By Proposition 2.71, let h’ be a homeomorphism mapping h(G) to H such that 
h'(h(r;)) = si. Thus, 7 and 5 are similarly situated. QED 


We finish this discussion of partition graphs with some “obvious” lem- 
mas concerning connected partitions in ROP (S?) and related mereotopologies. 
Readers irritated by proofs of such evident truths may skip to Theorem 2.82. 


LEMMA 2.79 Let G be a plane graph such that G has no isolated vertices, 
and every edge of G lies on the boundary of (at least) 2 faces of G. Then the 
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Figure 2.11. Only the left-hand graph defines a connected partition in RO(S”). 


members of F(G) are regular open, and form a connected partition in RO(S?). 
Moreover, if G’ is another such plane graph, with |G| C |G"|, then, for every 


fe F(G), f= Vif E F(@) | fC fh. 


Proof Let G = (V, E), and suppose f € F(G) and p € F(f). Since G 
has no isolated vertices, there exists e € E such that p € e and hence some 
f! € F(G), distinct from f, such that e C F(f’). Since f’ is disjoint from 
f-.p € (S?\ f-)> =S*\ (f-)% ie. p g (f7)°. Thus, the open set f 
satisfies (f7)? C f, and so is regular open. By Lemma 2.4 (ii), U F(G) C 
E F(G) C (UF(G))- = U{f- | f € F(G)} = S*. But by Lemma 2.3, 
>> F'(G) is the unique regular open set lying between J F'(G) and its closure; 
i.e. )> F(G) = 1. Hence, the elements of F'(G) form a connected partition in 
RO(S?). The last part of the lemma then follows from Lemma 2.3, since, if 
f € F(G), then both f and S°{f’ € F(G’) | f’ C f} are regular open sets 
sandwiched between ){ f’ € F(G’) | f’ C f} and its closure. QED 


Of course, the converse of Lemma 2.79 is false: the configuration of Exam- 
ple 2.18 shows that not every connected partition in RO(S°) is the set of faces 
of some plane graph. 


LEMMA 2.80 IfGisa piecewise linear plane graph such that G has no isolated 
vertices and every edge of G lies on the boundary of exactly 2 faces, then the 
faces of G form a connected partition in ROP(S?). 


Proof Let L1,..., Lm be straight lines extending (in both directions) each of 
the line segments making up G. Let G” be the graph whose nodes are the points 
of intersection of the L; (including oo) and whose edges are the i ae of the 
Li; joining them; and let P be the set of non-zero products +s - - Sm, Where 
si is one of the residual half-planes of L; for 1 < i < m. By dinke set-algebra, 
UP = U F(G'); and since every r € P is connected, and every f € F(G’) 
is a maximal connected subset of S? \ |G’|, r N f # 0 implies r C f. Hence 
every f € F(G’) is a union of elements of P. But since these elements are 
non-empty open and disjoint and f is connected, f simply is some element of 
P, and hence is an element of ROP(S?). Since |G| C |G’|, the result follows 
by the last part of Lemma 2.79. QED 
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Lemma 2.80 does have a converse: 


LEMMA 2.81 If 7 is a connected partition in ROP(S2), then F is the set of 
faces of some piecewise linear plane graph G; moreover, for any such plane 
graph G, G has no isolated vertices, and every edge of G lies on the boundary 
of exactly 2 faces. 


Proof By Claim 2.63, refine F = r1,..., fn toac®-partitiont = {t1,... tN}, 
and let Go be the partition graph of t. Suppose, by renumbering if necessary, 
that rr; = t; +---+t,. Note that, ife € E(Go), we have, forall 7 (1 < j < N), 
(e) C tj or (e) Nt; = 0. Hence if (e) Z r1, then (e) NU nejen ty FO, 
whence e C t for some j (m < j < N). 


Let G be the graph obtained from Go by first removing any edge e such that 
(e) C rı, and then removing any vertex v such that v € rı. Since rı is open, 
the endpoints of every remaining arc are among the remaining vertices, so Gy 
really is a plane graph. Moreover, if m < j < N, then ty N rı = , so that 
none of the vertices and edges removed from Go intersects tj; hence t; is a 
face of G1. Therefore, the set of points 


S = {tj E F(Go)|1<j<m}vU 
{e € E(Go) | (e) C rı} U {v E V(Go) |v Eri} 


must be the union of some faces of Gi. Trivially, S C rı. We claim that 
rı C S. Forif p € S?, exactly one of the following three cases holds: (i) p € tj 
for some j; (ii) p € V (Go); or (iii) p € e for some e € E(Go). In case (i), 
either p € S or p ¢ r1, according as j < m. In case (ii), trivially, either p € S 
or p Ž rı. In case (iii), if p ¢ S, then p € (e) Z rı, whence p € e C t for 
some j (m < j < N), whence p € (Vincjen ti) = S? \ r1. This proves 
that rı C S. Thus, rı = S is the union of a number of faces of G;. But rı is 
by assumption connected, so rı is a face of G1. Proceeding in the same way 
for r2, ..., Tn, We obtain the desired graph G = Gy. QED 


Lemmas 2.80 and 2.81 concern the mereotopology ROP(S?), but almost 
exactly similar arguments can be given for ROS(S?) and ROQ(S?). We omit 
the details, which are routine. Summarizing, we have: 


THEOREM 2.82 A tuple ù of subsets of S? is a connected partition in ROS(S) 
(alternatively: ROP(S*), ROQ(S?)) ifand only if it is the set of faces of a semi- 
algebraic (respectively: piecewise linear, rational piecewise linear) graph with 
no isolated vertices and every edge lying on the boundary of two faces. 


We now have at our disposal the means to give the promised proof of the 
only-if direction of Lemma 2.49. We therefore digress from the main business 
of this section to do so. 


52 HANDBOOK OF SPATIAL LOGICS 


ly2| E F(r2) 





a a Ne ee ee eee | 


Figure 2.12. Proof of Lemma 2.83. 


LEMMA 2.83 Let M be afinitely decomposable mereotopology over S? having 
curve-selection, and let r1, ra be elements of M such that rọ Ary #0. Then 
there exist elements r‘,,7,t of M such that: (i) r} < rı and ry < ro; (ii) 
—(r, + t), —(rh + t) and t are all connected; (iii) —(r + rh + t) is not 
connected. 


Proof We may suppose without loss of generality that rı and rz are elements in 
ac®-partition and that rı +r2 4 1. For otherwise, by Claims 2.54 and 2.63, we 
can easily find a c*-partition consisting of at least three elements, and containing 
distinct elements f; < r; (i = 1,2) such that 7; N f3 A 0; now proceed with 
r; replaced by 7;. 

Let s = —(r; + r2). From the properties of c°-partitions, s, rı + s and 
rə + s are all connected. Moreover, s is the sum of various cells of the c?- 
partition, whence, by Claim 2.61, each r; (i = 1,2) forms a connected sum 
with at least one of these cells, s;, say. Now, by Lemma 2.73, F (ri) O F(s;) is 
connected, and is certainly not empty or a singleton (by the connectedness of 
ri + si), so that F(r;) M F(s;) includes the locus of some Jordan arc yi. (Note 
that [y| C F(ri) MN F(s).) Now choose internal points p;, qi and pi; of yi, for 
i = 1,2, in the order shown in Fig. 2.12. Since M has curve-selection, let a be 
a cross-cut in s from p; to p2, a’ across-cut in s from p} to ph, and 8 a cross-cut 
in s from qı to q2, drawn as shown. Since |/3| is compact, we can cover it with 
finitely many small elements of M whose closures are disjoint from |a| U |a’|. 
Taking the sum of these elements to be t”, let t’ be the component of t” - s 
which includes (|8| \ {p, q}). Note that, by Lemma 2.13, we may freely take 
components of elements of M. 
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Since M is finitely decomposable, consider the components dj,...,d, of 
—(rı + ro +t) = s- —t. Since the sets t’, dj,...,d, are all connected, 
and their sum is s, which is also connected, Claim 2.61 implies that t’ + d; is 
connected for all j (1 < 7 < k). Since ri N r3 contains a point p, and M has 
curve selection, let ô; be a cross-cut in r; from p to q. Then |d1| U |ô2| U |6] 
separates |a| and |a’| (see Fig. 2.12). But |61| U |d2| U |B| C (rı + r2 + t')5; 
that is, œ and a’ lie in different components of — (rı + r2 +t’). Without loss of 
generality, suppose a lies in dı and a’ in d2. Now let t = t' + d3 +--+ + dg. 

Since all the t + dj are connected, t is connected. Moreover, it is easy 
to see that p; € dı + r; and p; € d2 + r; (i = 1,2), whence the four sets 
dj + ri (i = 1,2; j = 1,2) are all connected. It follows that —(t + rı) = 
r2 + dı + dz and —(t + r2) = rı + dı + dz are both connected. On the other 
hand, —(¢ + rı + r2) = dı + d2 is not connected. Setting r = rı and r4 = r2 
gives us the desired elements r/, r3, t. QED 


4.4 Expressive power of first-order languages in plane 
mereotopologies 


We are now in a position to give an absolute characterization of the expressive 
power of the languages L,.< and Lc over certain mereotopologies of interest. 
Recall the concept of topologically complete formula given in Definition 2.51. 
The following notation will be useful in constructing topologically complete 
formulas. 


NOTATION 2.84 Given a fixed Boolean algebra, a Boolean matrix is a rect- 
angular matrix whose entries are the elements 1 and 0. If 7 is an n-tuple, 5 
an N-tuple, and A a Boolean matrix with N rows and n columns, we write 
T = 8A to indicate that each element of T is the sum of certain elements of 5 as 
indicated by the elements of A via normal matrix multiplication. Similarly, we 
write £ = ZA in first-order formulas to abbreviate the obvious conjunction of 
Boolean algebra equations. 


THEOREM 2.85 Let M be any finitely decomposable mereotopology over S? 
having curve-selection, and let X be the signature (c, <, +,:,—). Every tuple 
from M satisfies some (purely existential) Ly-formula which is topologically 
complete in M over S?. 


Proof Writing Z for z1,..., zy, let ws (2) be the formula: 


| (cla) A % > O)A 


1<i<N 


A (e(—(zi + 2))) A zi zj =0) A bD z =l. 
1Si<j<N 1<i<N 
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Thus, M |} w\[s] if and only if 5 is an N-element c®-partition. If 5 = 
S1,- , SN is ac?-partition in M, let 45 (Z) be the formula: 





Metz +2;)|1<i<j< Nand s; + s; is connected}/A 


Aaa +2;)|1<i<j< Nand s; + sj is not connected}, 


where Z is the tuple of variables z,,...,2,. Thus, wi (2) encodes the neigh- 
bourhood structure of 5. Now let? = r1, .. . , rn be any tuple of elements of M. 
By Claim 2.63, there exists a c?-partition 5 = s1, ..., sy in M anda Boolean 
matrix A such that 7 = 5A. Writing 7 for 71,..., Zn, let W-(Z) be the formula 


ZE) AYE) Az = ZA). 


Certainly, M = y,|7]. And if 7’ is a tuple from M such that M H w[7’], 
let s’ = s}, . . - , Sy be corresponding witnesses for the existentially quantified 
variables Z. Then s1,..., sy and s4,..., 4, are c?-partitions in S? which have 
the same neighbourhood structure, and hence which are similarly situated in 
S?, by Theorem 2.78. It follows that 7 and 7” are similarly situated in S? too. 
Thus, W7(Z) is topologically complete in M over S?. QED 





COROLLARY 2.86 Let M be any finitely decomposable mereotopology over 
S? having curve-selection. Every tuple from M satisfies some Lo-formula 
which is topologically complete in M over S?. 


Proof Theorem 2.85 and Lemmas 2.22, 2.27 and 2.49. QED 


Thus, for certain well-behaved mereotopologies over S?, both Lo and Le,< are, 
as we might put it, “topologically fully descriptive’. 

We now turn to the question of expressive power in mereotopologies over 
R?. We need some auxiliary lemmas. 


LEMMA 2.87 Let? = 11,...,1% be a c?-partition in any mereotopology M 
over S? having curve-selection. Let p, p' € S? such that, for alli (1 < i < n), 
p €r; if and only if p' € r}. Then there is a homeomorphism h : s? — S? 
mapping p to p' and fixing each r;. 


Proof Obvious, by viewing 7 as F (G) for some plane graph G. QED 


LEMMA 2.88 Let? = r1,..., Tn and?’ = r}, ..., r} be similarly situated c°- 
partitions in any mereotopology M over S? having curve-selection. Let p € S? 
be such that, foralli(1 <i < n), p € r; ifand only if p € PE, Then there is 
a homeomorphism h : S? — S? fixing p and mapping T to 7". 
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Proof Let h’ : S? — S? be some homeomorphism mapping 7 to 7”. Then, 
for alli(1 <i < n), h'(p) € ri” if and only if p € r;~. By Lemma 2.87, let 
h” : S? — S? be a homeomorphism fixing each r, and mapping h'(p) to p. 
Then h := h” o h’ has the required properties. QED 


THEOREM 2.89 Let M be any finitely decomposable mereotopology over R? 
such that M has curve-selection. Every tuple from M satisfies some Lo- 
formula which is topologically complete in M over R?. 


Proof Given any tuple s1,...,5y from M, let 63,(Z) be the Lc-formula 


{eve (z) | 1<i< Nand s; is bounded}/A 
ANN) |1<i< Nand s; is not bounded}, 


where 2 is the tuple of variables z1, . . . , zy, and 2 is as in Lemma 2.30. Thus, 
p5- (Z) encodes the pattern of boundedness in the tuple 5. Now, given a tuple 
F, let 5 be an N-element c°-partition in M refining 7, and let A be a Boolean 
matrix satisfying 7 = 5A. Using the translation from Le,< to Lc established by 
Lemmas 2.22 and 2.27, let $Ù (Z) and ø (Z) be the Lc-formulas corresponding 
to YA (Z) and y$ (Z) in the proof of Theorem 2.85. Writing Z for 71,...,n, 
let F(T) be the formula 


32(6 (2) A 07 (2) A 03 (Z) A Z = ZA). 


Certainly, M | w;([r]; and if 7’ is a tuple from M such that M H rir], 
let 5’ = s/,...,sy again be a corresponding witnesses for the existentially 
quantified variables Z. Then §1,...,8y and $4,..., 8y are c?-partitions in S? 
which have the same neighbourhood structure, so that by Theorem 2.78 and 
Lemma 2.88, there is a homeomorphism h : S$? — S? fixing oo and mapping 
each s; to $. Hence 3 and 3’ are similarly situated in R?, whence 7 and 7” are 
similarly situated in R? too. QED 





U 
XI 


Thus, for well-behaved mereotopologies over Rĉ, Lc is, as we might put it, 
“topologically fully descriptive”. 


4.5 Homogeneous mereotopologies 


Up to this point, we have been concerned only to show that certain relations 
can be defined by first-order formulas with signatures of topological primitives. 
We turn now briefly to the question of which relations cannot be so defined. 

At first glance, one might assume that languages with purely topologi- 
cal primitives can express only topological concepts in mereotopologies over 
which they are interpreted. However, this assumption is correct only if the 
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mereotopologies in question have a certain property. Recall that, for a fixed 
topological space X, we write u ~ v to mean that the tuples of subsets ù and 
v are similarly situated in X (Definition 2.51). 


DEFINITION 2.90 Let M be a mereotopology over X. We say M is homo- 
geneous (over X) if, given any tuples 7,8 from M with T ~ 5 and any element 
r € M, there exists an element s € M withr,r ~ 8,s. Let M' also be a mer- 
eotopology over X, with M' C M. We say M' is homogeneously embedded in 
M (over X) if, given any tuple 7 from M', and any r € M, there exists s € M' 
with F,r ~ 7, 8. 


LEMMA 2.91 Let X be either R? or S?, and let M be any of ROS(X), 
ROP(X) or ROQ(X). Then M is homogeneous. 


Proof Assume M = ROS(S?); the other cases are identical. Let 7, 5 be 
tuples from M, and let r € M. Let t be a connected partition refining F,r, 
which by Theorem 2.82 is the set of faces of some semi-algebraic plane graph 
G. If h : S? — S? is a homeomorphism mapping F to 5, then h maps G to a 
plane graph H. But then it is not difficult to show that the edges of H can be 
deformed into a semi-algebraic plane graph H’, and moreover, that this may 
be done in such a way that existing semi-algebraic edges are unaffected. By 
Theorem 2.82, the faces of the resulting graph are elements of M; hence we 
have a homeomorphism mapping 7 to 5 and taking r to some element s of M. 

QED 


Homogeneity and homogeneous embedding are important because of the fol- 
lowing facts. 


LEMMA 2.92 Let M be a homogeneous mereotopology over a topological 
space X, and fix a signature X of topological primitives. If and 5 are tuples of 
M which are similarly situated in X, then T and 5 satisfy the same Ly-formulas 
in M. 


Proof We show by induction on the complexity of ¢(%) € Ly that, if 7 and 
5 are tuples of the appropriate arity which are similarly situated in X, then 
M H olr] implies M H @[5]. The base case follows from the fact that the 
primitives in © have topological interpretations. The only non-trivial recursive 
case is where ọ|z] = 3yy(z, y). If M H @[r], there exists r € M such that 
M H= yir,r], and by homogeneity, if 7 ~ 5, there exists s € M such that 
r,r ~ 5,s, whence M — y[5, s] by inductive hypothesis, so that M — ¢[s] as 
required. QED 














Lemma 2.92 gives an upper bound on the expressive power of first-order lan- 
guages with signatures of topological primitives interpreted over homogeneous 
mereotopologies: such languages cannot distinguish between similarly situated 
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tuples. It thus provides a partial converse to Theorems 2.85 and 2.89. It also 
yields an easy proof that, over well-behaved open-plane mereotopologies, Le,< 
cannot express the property of being bounded: 


THEOREM 2.93 Let M be a mereotopology over R? such that M is homoge- 
neous, and suppose M has curve-selection and contains a region r similarly 
situated in R? to the open unit disc B?. Then there exists no formula (a) of 
Le,< such that, for allr € M, r is bounded if and only if M = jr]. 





Proof Suppose such a formula y(x) exists. Then M f} fr], and by 
Lemma 2.38, M  w[7']. Since M has curve-selection, by Proposition 2.44 
both 7 and its complement —(7) in M are 2-cells in S?, and hence are similarly 
situated. By Lemma 2.92, M | y|- (+)], and so by Lemma 2.38, M  w[—r]. 
This contradicts the fact that —r is unbounded. QED 


Finally, we return to the relationship between ROS(X ), ROP(X) and ROQ(X). 


LEMMA 2.94 Let X be either R? or S?. Then ROQ(X) is homogeneously 
embedded in ROP(X ), which is in turn homogeneously embedded in ROS( X). 


Proof Virtually identical to the proof of Lemma 2.91. QED 


The following result is well-known (see, for example, Hodges, 1993 p. 55). 


PROPOSITION 2.95 (TARSKI-VAUGHT) Let A, B be structures with A C 
B, and suppose that, for any n-tuple a from A and any formula (T) of the 
form Jyy (z, y) such that B = $a], there exists a € A such that B = 1a, al. 
Then A < B. 





LEMMA 2.96 Let M, M' be mereotopologies overa topological space X, with 
M homogeneous and M' homogeneously embedded in M. Fix a signature of 
topological primitives. Then M' < M. 


Proof By assumption, M’ C M. Let F be an n-tuple of elements of M’, and 
let ọ(7) be any formula of the form Jyy (z, y) such that M — ¢[r]. Then there 
exists r € M such that M | 7[r,r]. Since M” is homogeneously embedded 
in M, there exists s € M’ such that 7,r ~ 7,s. Since M is homogeneous, 
M — y|r, s] by Lemma 2.92. The result then follows by Proposition 2.95. 


QED 





Hence, for X either R? or S*, and over any signature © of topological 
primitives, we have ROQ(X) < ROP(X) < ROS(X). In particular, these 
three structures have identical Ly-theories. We show in the sequel that this 
is no accident: almost any ‘reasonable’ mereotopology over S? has the same 
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Ly-theory. Anticipating these results, we employ the following notation and 
terminology. 


DEFINITION 2.97 Let = be a signature of topological primitives. We call the 
theory Thy(ROS(S?)) the standard Ly-theory (of closed plane mereotopol- 
ogy), and denote it Ty. 


5. Axiomatization 


In this section, we provide an axiomatic characterization of T, <, the standard 
L.,<-theory of closed plane mereotopology. The material is essentially that 
of Pratt and Schoop, 1998. The axiom system in question will help us to 
identify mereotopologies over S? having the standard Le <-theory. 

As before, we write Y% (z) for the L..<-formula stating that z forms 
an n-element c?-partition, and x = u @ v for the Le <-formula stating that 
u and v are disjoint, non-zero, connected regions summing to x. Let M bea 
mereotopology over S? having curve-selection. Consider a triple r, s, t from 
M satisfying the formula ws (x,y,z). By Lemma 2.45, each of these regions 
is a 2-cell, and it is easy to see that the closures of any two of these intersect in 
a Jordan arc. (Formally, this follows by Lemma 2.73.) Now let pit denote 
the Le <-formula 


VaVyWz (y3 (x,y, 2) 24 
uv (u Bv =x Acu +y) Ancu +z) A^ clv + z) Anc(v+y))). 

















Informally, Wspiit “says” that, given two 2-cells r and s whose frontiers intersect 
in a Jordan arc, r can be partitioned into two connected regions using a cross-cut 
whose end-points are the end-points of that Jordan arc (Fig. 2.13a). 


DEFINITION 2.98 A mereotopology M is splittable if M |= Yspiit- 
The following lemma is unsurprising. 


LEMMA 2.99 The mereotopologies ROS(S?), ROP(S?) and ROQ(S?) are 
splittable. 


Proof Almost immediate from Theorem 2.85, Lemma 2.42 and Proposi- 
tion 2.43. QED 


However, not all finitely decomposable mereotopologies over S? having curve- 
selection are splittable. If an (n— 1)-dimensional hyperplane in R” is defined by 
an equation x; = 0, where 1 < i < n, we call it an axis-oriented hyperplane; 
and if a half-space is bounded by an axis-oriented hyperplane, we call it an 
axis-oriented half-space. 


EXAMPLE 2.100 Define ROX(S") to be the Boolean sub-algebra of RO(S"”) 
generated by the axis-oriented half-spaces. It is easy to see that ROX(S") is a 
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a) b) 


Figure 2.13. a) The configuration of wWspiit: r and s are disjoint 2-cells with r~ N s~ a Jordan 
arc; r is broken into rı and r2. b) A pair of regions in ROX(R?) violating Wspiit- 


finitely decomposable mereotopology over S” having curve-selection. How- 
ever, ROX(S") F sprit, as is clear in the case n = 2 by inspection of 
Fig. 2.13b). 





Thus, whereas RO(S?) has, as it were, too many regions for the standard 
theory, ROX(S?) has too few. As we have observed, RO(S7) is not finitely 
decomposable, and lacks curve-selection, while ROX(S?) is not splittable. 
It transpires that these represent the only ways of failing to exhibit the stan- 
dard theory of closed plane mereotopology. Specifically, we show in this sec- 
tion that all splittable, finitely decomposable mereotopologies over S? having 
curve-selection have the same Le <-theory. Our strategy is to pick one split- 
table, finitely decomposable mereotopology over S? having curve-selection— 
ROP(S?) will do—and characterize its theory axiomatically. We then merely 
need to check that our axiom system is correct for all such mereotopologies. 


5.1 The axioms 


Our axiom system comprises three parts: a general inference system, a set 
of proper axioms and an w-rule. (i) The general inference system is simply 
any complete Hilbert system for first-order logic, restricted to the signature 
{+,-,—,<,c}. (ii) The proper axioms are as follows: 


1 the usual axioms of Boolean algebra, and the axiom 0 Æ 1; 
2 the axiom Won (Lemma 2.53); 


3 where n > 2, the axioms w (Lemma 2.56); 


n 
sum 
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4 the axiom 


dry... das ( J (ela; | A zi EON 

















A (c(aj + £j) Azi: æj =0)); 


1<i<j<5 


5 the axiom 


orei ze( N (c(i) A ri # OA 

















1<i<6 
A Ti zj =ON \\ ¢ c(zi + Tj )); 
1<i<j<6 1<i<3 


4<j<6 


6 the axioms c(0) and c(1); 
7 the axiom break (Lemma 2.57); 
8 the axiom ®spiit (Definition 2.98). 


(iii) The final component of our axiom system is the w-rule. If n > 1, we let 
We (x) stand for the formula 

















Zp... zal A c(2i) A (w= 21 +++ zn), 
1<i<n 


Thus, Y? (x) “says” that x can be formed by summing n connected regions. 
The w-rule is then the (infinitary) rule of inference: 


{Va(We(a) > o(@))|n > 1} 
Vao(x) 


Let ® be a set of Le <-sentences. A proof with premises ® in the above system 
is a sequence of L,,<-formulas {ġa }a<g, for some ordinal 8 (not necessarily 
finite) such that every ĝa is either (i) an element of ® or (ii) an axiom or (iii) 
the result of applying a rule of inference to some formulas ¢, with y < a. If 
w is the last line of some such proof, we write ® | y. If ® = {¢} we write 
ot y, and if 6 = Û we write H w and call y a theorem. Let us denote the set 
of theorems by TAx. The main result of this section is: 





THEOREM 2.101 Tax is the complete Le <-theory of any finitely decompos- 
able, splittable mereotopology over S? having curve-selection. 
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Proof Lemmas 2.103 and 2.105, below. QED 


Of course, this entails that all such mereotopologies, considered as {c, <}- 
structures are elementarily equivalent. 

The w-rule is less unfamiliar than one might at first think. Essentially, it says 
that if a property holds of every region which is the sum of finitely many 
connected regions, then it simply holds of every region. This conditional 
is obviously true in a finitely decomposable mereotopology. Thus, a proof 
involving the w-rule is analogous to an argument of the kind encountered in 
elementary algebra textbooks in which one proves a property of all polynomials 
by showing that it holds of all polynomials of some arbitrary degree n. Nev- 
ertheless, the inclusion of an infinitary proof rule does mean that we ought to 
check the deduction theorem. 


LEMMA 2.102 Let ġ be an L;,<-sentence and w an Le <-formula such that 
oF y. ThenF 6 = y. 


Proof By assumption, there is a proof {¢q}a<g+1 With premises {¢ġ} and 
last line g = w. Without loss of generality, we may assume that the first 
(actually, zeroth) line of the proof Yo is ¢. We proceed by induction on 8. 
The case 3 = 0 is trivial, since ¢ — ¢. If 8 > 0, then either dg is an 
axiom or is derived from applying a rule of inference to earlier lines of the 
proof. The only interesting case is where ¢ = Vær is derived by the w-rule 
from the formulas Vx(7?(a) — 7m) occurring earlier in the proof. But the 
inductive hypothesis then yields + ¢ > Va(wW?(x) — r), for each n, whence 
 Va(yr(x) + ($ — m)). The w-rule then yields  Vz(@ — r), whence 
F @ — Var (note that @ is a sentence), as required. QED 


We remark in passing that the axiom c(0) is actually redundant: it can be derived 
from the other axioms and proof rules. 


5.2 Correctness 


In this section, we establish the easy half of Theorem 2.101. 


LEMMA 2.103 If M isa splittable, finitely decomposable mereotopology over 
S? having curve-selection, then M | Tax. 


Proof We follow the enumeration in Sec. 5.1, showing that the proper axioms 
are all true in M and that the w-rule is truth-preserving. 


1 M is a mereotopology. 
2 Lemma 2.53. 
3 Lemma 2.56. 
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4 Suppose r1,...,75 are connected, non-empty and pairwise disjoint, and 
that any pair of them have a connected sum. By Lemma 2.26, choose 
points p; € r; and qij E F(ri) Fle A (ri+r)A<i<j <5). 
For each ¿ (1 < i < 5), draw end-cuts in r; from p; to all the points 
qi j and qj; it is easy to see that these can be chosen so that any pair of 
these end-cuts intersect only in the point p;. Ignoring the points q; j, we 
have a plane drawing of the graph K5, which is known to be non-planar 
(Fig. 2.10). 


5 As for axiom 4, but with /‘3 3 instead of K ae 
6 Trivial. 

7 Lemma 2.57. 

8 M is splittable. 


The w-rule is obviously truth-preserving, because M is finitely decomposable. 
QED 


5.3 Completeness 


In this section, we establish the difficult half of Theorem 2.101. We make 
use of the omitting types theorem: for details, see, e.g. Hodges, 1993, pp 333. 
Let 2 be a structure, (x) a set of formulas with free variable x, and T a set 
of sentences. We say that X omits ®(x) if, for alla € A, A F [a]. We 
say that T locally omits ®(x) if, for every formula 0(x) with free variable 
x such that @ is consistent with T, there exists ¢(2) € ®(x) such that T 4 
Vu(0(x) — ¢(x)). The following theorem is a well-known strengthening of 
the completeness theorem for first-order logic. 


PROPOSITION 2.104 (OMITTING TYPES THEOREM) Jfa consistent the- 
ory T locally omits a set of formulas ®(x), then T has a countable model 
omitting B(x). 


With these preliminaries behind us, we can proceed with our completeness 
proof. 


LEMMA 2.105 If ¢ is an Le <-sentence, and ROP(S?) | ¢, then ¢ € Tax. 





Proof Suppose that ¢ ¢ Tax. We are required to prove that ROP(S?) = 7¢. 
Let T be the set of all and only those Le <-sentences ~ such that ~o F 4. 
By Lemma 2.102, T is a consistent set of sentences, and from the w-rule, T 
locally omits the type {74}? (x) | n > 0}. By Proposition 2.104, there exists 
a countable model 2 = T omitting that type. Fix the structure X for the 
remainder of this proof. 
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We now proceed in three stages. Stage 1 establishes some basic facts about 
A; Stage 2 shows that 2 can be embedded in the {c, <}-structure ROP(S?); 
Stage 3 shows that the embedding we have chosen is in fact elementary. 


Stage 1: Axioms 1 ensure that the reduct of 2 to the signature {+,-,—, <} is 
a Boolean algebra. Such structures were discussed in Sec. 4.1, where various 
terminology and notational conventions were introduced. We carry these over 
to the present proof. Using that terminology, another way of saying that 2 
omits the type {7y} (x) | n > 0} is to say that % is finitely decomposable. 


By Axioms 2, 3, 6 and 7, all the claims in Sections 4.1 and 4.2 hold of 2l. In 
particular, every tuple can be refined to a connected partition, and thence to a 
c?- and a c®-partition. Furthermore, we have 


CLAIM 2.106 Let b = bi,...,0n be a connected partition in 2. Then the 
neighbourhood graph of b is planar. 


Proof By Proposition 2.68, if the neighbourhood graph G of b is not planar, 
it contains either K° or K3 3 as a minor. But then there is a sequence of 
contractions of G resulting in a graph H which has either K° or K33 as a 
sub-graph. By repeated applications of Claim 2.60 (re-numbering the b; as 
necessary), there is a connected partition 5 in 21 whose neighbourhood graph 
contains K* or K3,3 as a sub-graph. But this is impossible by Axioms 4 and 5. 

QED 


Stage 2: Since 2 is countable, let A = {a1, a2,...}. Let No = 1 and let 2° be 
the 1-tuple whose element is the unit of the Boolean algebra 2. Trivially, Z° is 


(n) — cl”) (n) 


ac?-partition. For n > 0, suppose that the c?-partition Z CN, in 


XA has been defined; then, by Claim 2.63, let a(nt+1) — one, KEA en bea 
(n) ) 


c3-partition in 2 refining the tuple Cin aches ce 


n 


,An+41. It is then obvious that, 
for each n > 0, @”) refines the tuple a1, .. . , an and also the tuple é”) for all 
m (0 < m < n). We fix the enumerations ag, a1, ... and go, gD, ... for the 
remainder of Stage 2. 


For brevity, denote ROP (S?) by S. We now map each initial segment a1, ..., an 
of A into S. Let w\” be the set of functions g™ : 10, Da pe — S satis- 


fying the conditions: 


G1: the regions g), gg ee) form a connected partition; 


G2: foralli,7 1<i<j< Np), g (c) + g (cs) is connected if and 
(n) +” 


only if c; is connected. 
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We remark that, in G2, we have g™ (c oy g™ (c n w € S and W, em E€ A. 
Hence, different senses of “+” and “connected” apply in the two cases. 


CLAIM 2.107 Foralln € N, w™ +. 


Proof For the proof of this claim, we shall drop the n-sub- and superscripts and 
write N for Nn and c; for M, Let G be the neighbourhood graph on c1, . . . , CN- 
By Claim 2.106, G is planar. By Axioms 6 and Claim 2.61, G is connected. 
Let H be a drawing of G in S? (under some mapping € : V(G) — V(H)); 
we may assume that H is piecewise linear. By Proposition 2.69, let H* be a 
geometric dual of H, which we may likewise assume to be piecewise linear. By 
Lemma 2.70, every vertex of H lies in exactly one face of H*. It follows that 
every edge of H* is on the boundary of two faces; moreover, H* by construction 
contains no isolated nodes. By Theorem 2.82, the faces of H* form a connected 
partition in S. So define g(c;) to be the face of H* containing the H-vertex 
e(ci). Properties G1 and G2 are then almost immediate. QED 


CLAIM 2.108 Let I C ee Nn}, and let g™ € w™. Then Yer ci is 
connected if and only if $ icr g™ (ci) is connected. 


Proof Claim 2.61 and property G2. QED 
Suppose n > m > 0, so that 2) refines e™. For all i a <i < Nm) 
let ci,1,. - - , Ci,M; be the collection of elements of 2”) which sum to gm, If 


g™® € w(”), then, we may define the restriction of g™ to ce), written g™® P 
as follows: 
a E +... +9 (E,) 


CLAIM 2.109 Let g™ € w™ with0 < m <n. Then g™|m €w™, 


Proof We must prove that G1 and G2 hold of g™ |m. G1 is trivial. For G2, 
we note that, by construction, 


a a +--+ (Cf) 
+ g (L) Tersi gin): 
By Claim 2.108, this element of S' is connected if and only if the element of A 


(n) (n) (n) + a 


Gia H Pe M, tG, (n) = of) 


1 ti te 


is connected. Hence G2 holds as required. QED 
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CLAIM 2.110 Let g € w™. Then there exists a gl € wt) such that 
/ 
J |n =g. 


Proof Choose any g” € wt). By Claim 2.109, g'n € w™. Letting 
T = g(c1),.--,g(cn,) and 5 = g”|n(c1),..., 9” |n(en,,), we see that 7 and 
3 are c®-partitions in S with the same neighbourhood graphs—namely, the 
neighbourhood graph of ci,...,¢n,,. By Theorem 2.78, let h : S? — S? be 
a homeomorphism taking 3 to 7. Thus, h o g” maps a+) to the faces of 
a plane graph G in S? whose edges include the frontiers of the elements 7. 
Now let h’ : S2 — S? be a deformation making all the curved edges of G 
piecewise linear, while leaving any already piecewise linear edges unaffected. 
By Theorem 2.82, g! = h’ o ho g” € wt) maps e"+ to an N,,41-tuple in 
S and it is easy to see that g’ satisfies the conditions of the claim. QED 


By Claim 2.110, there exists a sequence of embeddings: 
0 = g gg... 


such that, for all n (0 < n), g®™ maps 2) to S, and, for all m,n(0O<m <n), 
gP |m = g. 
Now let a € A be such that a = a” feet ee Then we define 
g(a) = (cP) +--+ g™ (CP). 

The fact that GO = g™ whenever 0 < m < n means that this mapping is 
well defined. It is easy to see that g : A — S isa Boolean algebra isomorphism; 
moreover, by Claim 2.108, g(a) is connected if and only if ais connected. That 
is, we have proved: 


CLAIM 2.111 can be isomorphically embedded in ROP(S?), regarded as 
a {c, <}-structure. 


In view of Claim 2.111, and in order to simplify notation, we might as well take 
Y to be a substructure of ROP(S?). Note that the previously distinct uses of 
the Boolean functions and the term “connected” become unambiguous, as do 
“connected partition”, “c”-partition’”, “neighbour”, and so on. Moreover, since 
A C S, we may meaningfully talk about the frontier F(a) of any a € A, and 
apply all the results established previously about elements of ROP(S?). For 
example, by Lemma 2.73, if r1, . . . , "n isac?-partition in A radial about r1 such 
that rı has at least 2 neighbours, then, for any neighbour r; of r1, F(r1) NF (ri) 
is a Jordan arc. Recall that, for tuples 7 and 5 from ROP(S?), we write 7 ~ 5 
if 7 and 5 are similarly situated (in S*), 


Stage 3: In the previous stage, we established that 2l can be chosen to be a 
substructure of ROP(S?). In this stage, we show that, in that case, 21 is in fact 
an elementary substructure of ROP(S?). 
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2, h E 


x(p) = {i} x(a) = {i,j} x(7) = {ft}, te x0) = {{itt 


a) b) c) 
Figure 2.14. The “hub” a; of a radial partition. 


CLAIM 2.112 Let aj,...,@, € A be ac?-partition radial about a; such that 
ay has atleast 3 neighbours. Letri, ro € S be disjoint 2-cells witha, = r1 +ro. 
Then there exist c1, c2 € A such that a1, ... , an, C1, C2 ~ Q1, ..., An, T1, T2. 


Proof Since a1,r1,r2 are 2-cells with a, equal to the disjoint sum of rı and 
rg, rı and rg must be separated by a cross-cut y in aj. For any neighbour 
a; of a1, F(a1) O F(a;) is a Jordan arc. Let p € F(a1). By inspection, p 
lies on either one or two Jordan arcs of the form F (a1) N F(a;) where a; is 
a neighbour of a,. We define the character of p, written x(p) to be the set of 
those į (2 < i < n) such that a; is a neighbour of a; and p € F(a;) (Fig. 2.14a). 
Note that x(p) has either 1 or 2 elements. If x(p) has one element, then p lies 
on some Jordan arc F (a1) N F(a;), but not at its endpoints. If x(p) has two 
elements, then since a, has at least three neighbours, x(p) determines p. Now 
let y be a cross-cut in a1. We define the character of y, written x (7) to be the set 
of characters of its endpoints. (See Fig. 2.14b and Fig. 2.14c for examples.) It 
is routine to show that, if y1 and y2 are two such cross-cuts and y(71) = x(72), 
there is a homeomorphism of the closed plane onto itself taking a; to itself for 
allz (1 < i < n) and taking 71 to y2. So, to prove the lemma, it suffices to 
establish that, if 7 is any cross-cut in a1, there exist disjoint 2-cells c1, c2 € A 
with ay = c1 + c2 such that the cross-cut y> separating cı and c2 in a, satisfies 


x(71) = x(72). 


Let the endpoints of yı be p and q. We prove the result for the special case where 
x(y), x(p) and x(q) all contain two elements; the other cases are dealt with 
similarly. Fig. 2.15a shows the sub-case where x(p) and x(q) are non-disjoint; 
Fig. 2.15b shows the sub-case where y(p) and x(q) are disjoint. 


The sub-case of Fig. 2.15a is trivial: Axiom 8 with a, substituted for x and a; 
for y immediately guarantees the existence of c1, c2 E€ A partitioning a1, and 
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Figure 2.15. The construction of a cross-cut with a given character. 


hence separated by a cross-cut y2; moreover the connectivity conditions on c1 
and c> mean that yı and y2 have the same endpoints, so that x(y1) = x(72). 


The sub-case of Fig. 2.15b requires a little more work. However, two appli- 
cations of Axiom 8 guarantee the existence in A of the regions a;, a}, as in 
Fig. 2.15c. Axiom 7 then guarantees that the region labelled a‘, in Fig. 2.15c 
can be split into two regions as shown in Fig. 2.15d. Summing together appro- 
priate subdivisions of a; produces c1, cg € A separated by an arc y2 satisfying 


x("1) = x(72). QED 


The rest of this section is devoted to showing that we can relax the conditions 
of Claim 2.112. 


CLAIM 2.113 Letn > 1 and let a,,...,@, E€ A be a partition such that a; is 
a 2-cell. Let 71,72 € S be disjoint 2-cells with ay = rı + r2. Then there exist 
C1, C2 E A such that @1,--+,4n,C1,€2 ~ @1,---,4n,71, 12. 


Proof Immediate given claims 2.65 and 2.112. QED 


CLAIM 2.114 Letn > 1 and let aj,...,a, E€ A be a partition such that a 
is a 2-cell. Let r € S be such that r < a,. Then there exists c € A such that 
Q1,...,An,C ~ A1,..-,An,T. 


Proof By the construction of S = ROP(S?), we can partition a; into 2-cells 


T1,- -, Tm Such that r can be expressed as the sum of various r;. It suffices to 
show that there are c1, ...,Cm E A such that 
QA1,-++54n,T1,-++5Tm ~ A1,--+,4n;C1,--+-5Cm- 


We proceed by induction on m. If m = 1, then rı = a, and we are done. If 
m > 1, by Corollary 2.67, we can renumber the r; if necessary so that rı and 
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r} = f2 + ... + rm are 2-cells. By Claim 2.113, there exist c1, c € A such 
that a1, ..., an, T1, r ~ Q1,---, n, C1, Ch. Let h be a homeomorphism of the 
closed plane onto itself mapping a; to itself, rı to cı and r% to cy. By exactly 
the same argument as for Lemma 2.91, h can be chosen so that h(r;) € S for 
all i (2 < i < m). But then the h(r;) partition the 2-cell ch into 2-cells. So 


consider the partition ch, c1, a2,...,@n. By inductive hypothesis, there exist 
C2, ...,Cm E A such that 

ch, C1, Dy So , An, h(r2), ---, hrm) ~ 63, €1,02,- . < , An, C23 -< , Cm - 
The result then follows. QED 
CLAIM 2.115 Letn > land letai,...,an € Abeac?-partition. Letr € S. 
Then there exists c E€ A such that a1, .. . , an, C ~ Q1, ... , An, T. 


Proof Write r as the sum r : a, +--+: +r - an By considering these terms 
separately, we use Claim 2.114 and an induction similar to that used in the proof 
of Claim 2.114. The details are routine. QED 


CLAIM 2.116 Letn > Oandletai,...,an € A. Letr € S. Then there exists 
c E€ A such that ay,...,Gn,7 ~ Q1, ..., an, C 


Proof Immediate given Claims 2.54, 2.63 and 2.115. 
CLAIM 2.117 A < ROP(S?). 


Proof We certainly have 2 C ROP(S?). Let n > 0 and let 6(x1,...,2n) 
be any formula of the form Jyy(z1,..., £n, Y). Let a1,...,@n E A such that 
ROP(S2) H @[a1,...,@n]. Then there exists r € S such that ROP(S?) = 





plai, ..., an, r]. By Claim 2.116, there exists c € A such that a1, ... , an, r ~ 
a1, ..-,@n,€. By Lemmas 2.91 and 2.92, ROP(S?) H Yla, ..., an, c]. The 
claim then follows by Proposition 2.95. QED 


By Claim 2.117, 21 and ROP (S?) have the same theory. But by construction, 
A H ad, whence ROP (S?) H ~g, which completes the proof of Lemma 2.105. 


QED 





COROLLARY 2.118 All splittable, finitely decomposable mereotopologies over 
S? with curve-selection have the same Lc, <-theory, and hence also the same 
Lc-theory. 


Thus, while Examples 2.17, 2.18 and 2.100 show that there certainly are elemen- 
tarily inequivalent mereotopologies over R? and S?, Corollary 2.118 indicates 
that there is nothing like the free-for-all one might initially expect. At least 
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for the signatures {c, <} and {C}, the reference to Ty as the standard first- 
order mereotopological theory of the closed plane is justified. Corollary 2.174 
generalizes this result to apply to any signature of topological primitives. 

For reasons of simplicity (which we trust the reader will appreciate) we have 
provided an axiomatization of well-behaved plane mereotopologies only for the 
language Le,<. It should be clear from the foregoing discussion, however, that 
an analogous result could be obtained for the language Lc, which as we noted, 
is more expressive over ROP(R?). Such an axiomatization was developed 
in Schoop, 1999. 

Of course, it is one thing to have an axiomatic characterization of the Le,<- 
theory of ROP(S?)—quite another to determine whether a given Le <-sentence 
is amember of it. The question therefore arises as to the computational charac- 
teristics of this problem. Dornheim, 1998 showed (in effect) that this theory is 
undecidable and hence (since it is a complete theory), not r.e. It follows that the 
w-rule (or some equivalent mechanism) is indispensable in this axiomatization. 
In fact, Schaefer and Stefankovit, 2004 showed (in effect) that the decision 
problem for Th, <ROP(S?) is at least as hard as that of second-order arith- 
metic. Specifically, Schaefer and Stefankovit effectively encode second-order 
arithmetic in a first-order language with variables ranging over 2-cells in R? 
and primitive predicates expressing the so-called RCC-relations (see Randell 
et al., 1992, Egenhofer, 1991; but it is easy to see that that theory can in turn be 
effectively encoded in The <ROP(S?). Schaefer and Stefankovit also consider 
the complexity of the quantifier-free fragment of their logic, a problem closely 
related to the well-known problem of recognizing so-called string-graphs (see 
e.g. Erlich et al., 1976, Kratochvil, 1988, and show that it is in NEXPTIME. 
In Schaefer et al., 2003, this bound is improved to NP—a very surprising result. 


6. Spatial mereotopology 


In this section, we extend the main results of Sec. 4 to the spatial mereo- 
topology ROP(R?). This material is a tidied up version of Pratt and Schoop, 
2002. 





6.1 Facts about ROP (R?) and ROP(S?) 


Recall that a 2-manifold is a Hausdorff space locally homeomorphic at every 
point to the open disc B?, and that a surface is a connected 2-manifold. 


LEMMA 2.119 Let X be either R” or S”, and let M be a mereotopology over 
X having curve-selection. If r € M with r and —r both connected, then F(r) 
is connected. 


Proof Consider the case X = R”. Let r € M be connected and non-empty 
with connected, non-empty complement, and suppose the closed set F(r) is 
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not connected. Let dı and dz be closed sets partitioning F(r), and let p € r, 
q € —r. Since r is connected with connected complement, it is easy to see that 
the conditions of Proposition 2.19 are fulfilled, so that p and q are connected in 
R” \ (d1 U d2). But this is absurd given that dı U d2 = F(r). The case X = S” 
follows easily. QED 


LEMMA 2.120 Let r € ROP(S?) be such that r and —r are non-empty and 
connected, and F(r) is not a surface. Then the graph K° can be drawn 
in F(r). 


Proof It is easy to see that F(r) can be finitely triangulated. Call any point 
where F(r) is not locally homeomorphic to B? a bad point; and call any edge 
of the triangulation all of whose points are bad a bad edge. By the properties of 
triangulations, any bad point either occurs on a bad edge or else is an isolated 
bad point at a vertex of the triangulation. 


If there is a bad edge, then more than two triangles must share this edge, and the 
embedding of K* in F (r) proceeds as shown in Fig. 2.16a. Assume, then, that 
there are no bad edges, but that some vertex p of the triangulation is an isolated 
bad point. Call two triangles with p as a vertex neighbours if they share an edge 
having p as a vertex. Since all edges are good, these triangles can clearly be 
arranged into disjoint cycles such that each triangle belongs to the same cycle 
as its two neighbours. Choose one such cycle. By applying a homeomorphism 
if necessary, we may assume that this triangle-cycle forms a cone with vertex 
p as shown in Fig. 2.16b. Since there are only finitely many triangles in the 
triangulation, we can ensure that we choose a triangle-cycle such that the points 
inside the tip of the cone either all belong to r or all belong to —r. Let s be 
either r or —r depending on which of these possibilities is realized. Note that, 
since r is non-empty and connected with non-empty, connected complement, 
so is s. 


Let t € ROP (S?) be a small element representing the tip of the cone, indicated 
by the light dotted lines in Fig. 2.16b. Removing t from s visibly does not 
disconnect s, so that s - —t is connected; moreover, t shares some face with — s, 
so that t + —s = —(s - —t) is also connected. Thus, s - —t is non-empty and 
connected with non-empty connected complement, whence, by Lemma 2.119, 
F(s - —t) is connected. Moreover, since p is bad, there must be at least two 
triangle-cycles with p as vertex; whence p € F(s -—t). Thus we may choose a 
point q on the base rim of ¢ and connect it to p by a Jordan arc a in F(s) such 
that the locus of a is disjoint from F(t) except for its endpoints, as shown in 
Fig. 2.16c. The embedding of K° in F(s) = F(r) then proceeds as depicted. 


QED 
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a) 


Figure 2.16. Embedding K* in non-surfaces (Proof of Lemma 2.120). 


One notable difference between S? and S° is that the Schénflies Theorem, 
which holds in the former, fails in the latter. In fact, the pathological “region” 
known as Alexander’s horned sphere (Alexander, 1924a) is the best-known 
counterexample: the frontier of this region is homeomorphic to S?, but its 
exterior is not simply connected, and is certainly therefore not homeomorphic 
to B3. Nevertheless, Alexander, 1924b also proved a Schénflies-type result for 
polyhedra, which, in our notation, can be written as follows. (See also Moise, 
1977, Ch. 17.) 


PROPOSITION 2.121 Let r € ROP(S?) be such that F(r) is homeomorphic 
to S*. Then both r` and (—r)~ are homeomorphic to D®. 


To avoid cumbersome locutions in the sequel, we define: 


DEFINITION 2.122 Let X be either R” or S”. A ball in X is a subset of X 
similarly situated in X to the unit ball B?. A polyhedral ball in X is a ball 
which is an element of ROP(X). 


Thus, if r € ROP (S?) with F(r) homeomorphic to S°, then r and —r are both 
balls in S*. Furthermore, ifr € ROP(R?) with F(r) homeomorphic to S? (and 
hence bounded), then then exactly one of r and —r is a ball in R®. We note in 
passing: 


LEMMA 2.123 Ifr € RO(S®) is a (polyhedral) ball in S°, then so is —r. 
Proof By definition, r is similarly situated in S3 to u = B°(0, 1). By consid- 
ering a spherical inversion, u is similarly situated in S? to —u. QED 


The following well-known theorem will also prove useful in the sequel (see, 
e.g. Massey, 1967, p. 10). 


PROPOSITION 2.124 (CLASSIFICATION THEOREM FOR SURFACES) 
Every compact surface is homeomorphic to either (i) S? or (ii) the sum of 
finitely many connected tori or (iti) the sum of finitely many projective planes. 
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6.2 Expressing familiar spatial concepts in Lo 


Our next task is to show that certain familiar concepts defined on the mer- 
eotopology ROP(R®) can be expressed using Lc-formulas. As a preliminary, 
recall the discussion of Sec. 3.2, which showed that: (i) expressions such as 
x Ny Nz Æ Ü etc. can be regarded as Lc-formulas; and (ii) there is an 
Lco-formula ¢¢;(, y) which we may read as “x7 N y~ is connected”. 

Now suppose r and s are elements of ROP(IR3), and consider, for exam- 
ple, the set F(r) \ F(s). Evidently, this set is connected if and only if it is 
piecewise-linear arc-connected, and therefore if and only if any two points in 
it are contained within some connected set of the form r~ Nt~ C F(r) \ F(s) 
with t € ROP(R®). It follows from the discussion of Sec. 3.2 that there is an 
Lc-formula satisfied by a pair of regions r, s if and only if F(r) \ F(s) is con- 
nected. In the sequel, then, we write, without further commentary, expressions 
such as c( F(x) \ F(y)) ete. as Lc-formulas having the obvious interpretations. 


LEMMA 2.125 There exists an Lo-formula ¢x5(x) such that, for all r € 
ROP(R?), ROP(R?) H ġgs[r] if and only if K° is embeddable in F(r). 


Proof The graph K° is evidently embeddable in F (r) if and only if there exist 
polyhedra v; (1 <7 < 5)ande;; (1 < i < j < 5), all disjoint from r and from 
each other, satisfying the following conditions: 


1 for allt (1 < i < 5), v; Nr” isa singleton; 


2 foralli,7 1 <i<j <5), eij N r7 is connected; 


3 foralli jt, j A Sij 51i <j’ <5), {ij} N {ij} = 


Ø implies e;; N ey; Or = O, and {i,j} {v',7"} = {k} implies 
eij N Cp OT = 0). A 





(Note incidentally that the polyhedra e; ; are not themselves required to be 
connected—only the sets e; ;r~ = F (eij) NF (r).) But the above conditions 
are expressible in Lc over ROP(R?). QED 


LEMMA 2.126 There exists an Lo-formula ¢y*(x) such that, for all r € 
ROP(R?): 


1 if F(r) is connected and unbounded, then ROP(R*) = Qy [r]; 
2 if F(r) is homeomorphic to S?, then ROP(R®) j- py» [r]. 





Proof Let dp«(x) be 














yistye(yr £ =OA yo £ =ONCF(2)NF(y1) NF (y2))A 
c(F (x) \ F(yi)) A CF (2) \ Flya)) A mel F(a) \ (Fu) U F(y2))))- 
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Fir) N F(s1) 





F(r) NF (s1) NF (se) ` 


Figure 2.17. Arrangement of F(r)M F(s1) and F(r) N F(s2) on G (Proof of Lemma 2.126). 


Thus, ¢p« (a) “says” that there exist polyhedra yı and y2, disjoint from x, such 
that the sets F(x) N F(y1) N F(y2), F(x) \ F(y1) and F(x) \ F (yz) are all 
connected, but the set F(x) \ (F(y1) U F(y2)) is not. 


Suppose F(r) is connected and unbounded. Let r be a Boolean combina- 
tion of finitely many half-spaces, corresponding to a finite set of planes, say, 
P,,..., Pm; it is then easy to see that F(r) C Pi U-+-U Pm. Since F(r) is 
unbounded, we can draw in F(r) a rectangular figure G, unbounded on one 
side (dotted lines in Fig. 2.17), such that G intersects only one of the P;. Let 
81, 52 E€ ROP(R®) be laminas, infinitely extended in one direction, and placed 
on G (on the outside of r) so that F(r) N F(s1) and F(r) MN F(s2) are arranged 
as shown. Since F(r) is connected, F(r) \ F(s1) and F(r) \ F(s2) are also 
connected; and since G lies on just one of the P;, F(r) \ (F(s1) UF (s2)) is not 
connected. Thus ROP(R?) | 4[r]. The second part of the Lemma follows by 
Proposition 2.47. QED 





Let ¢c(x) be the Lo-formula defined in Lemma 2.27 and satisfied by r € 
ROP(R?®) if and only if r is connected, and let ¢j(x) abbreviate the formula 


L-OALZHAIAG(2) A G(—Z). 
LEMMA 2.127 Forallr € ROP(R?), r satisfies 3(x) \7¢x5(x) Adbp* (2) 
if and only if F(r) is homeomorphic to S?. 


Proof Suppose F(r) is homeomorphic to S?. Certainly, by Proposition 2.121, 
ROP(R®*) = ¢3[r]; by Lemma 2.125, ROP(R?) H 7¢x5[r]; by Lemma2.126, 


s 
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ROP(R*) H =p [r]. Conversely, suppose that r satisfies y(x) An¢q5 (x) A 
adp«(xz). By Lemma 2.119, F(r) is connected, and by the first part of 
Lemma 2.126, F (r) is bounded. Moreover, K° cannot be embedded in F(r), 
by Lemma 2.125. Hence F(r) = F(t) is a compact surface, by Lemma 2.120. 
The result then follows from Proposition 2.124. QED 


LEMMA 2.128 Let r € ROP(R?) satisfy 63(x) A ~bx5(x) A p(x). Then 
r is unbounded. 


Proof Suppose for contradiction that r is bounded, so that we also have r € 
ROP(S?). By Lemma 2.120, F(r) is a surface. Moreover, since r is bounded, 
F (r) is compact, and since K* cannot be drawn in F (s), F (r) ishomeomorphic 
to S? by Proposition 2.124. But since ROP(R*) H øp» [r], this contradicts the 
second part of Lemma 2.126. Hence, r is unbounded. QED 


LEMMA 2.129 There exists an Lo-formula s(x) such that, for all r € 
ROP(R?), ROP(R?) H ġps[r] if and only if r is bounded. 


Proof Let p(x) be the formula 














yaz(a <yAy-z=O0A 
bs(y) Amors ly) A mp ly) A b3(2) Amors (2) A bbe (Z))- 


If r is bounded, let s € ROP(R*) be a ball in R® such that r < s; and let 
t € ROP(R?) be a half-space disjoint from s. By Lemma 2.126, ROP(R?) — 
—ġp* |s] and ROP(R?) | p» [t]. Thus, s and t are suitable witnesses for y and 
z in ġps (£), so that ROP(R?) = ¢,8[r]. 


Conversely, suppose that ROP(R*) H ¢,3[7]. Let s and t be witnesses for 
y and z. By Lemma 2.127, F(s) is homeomorphic to S2, whence, by Propo- 
sition 2.121, exactly one of s and —s is a ball in RÌ. By Lemma 2.128, t is 
unbounded, and so intersects the complement of every ball in R3. Therefore 
—s is not a ball in RÌ, so s is. Hence, r is bounded. QED 











THEOREM 2.130 There exists aformula g(x) such that, for allreROP(R°), 
ROP(R?) H ¢p[r] ifand only if r is a polyhedral ball in R?. 


Proof Let g(x) be 


by(&) A ners (z) Anp (x) A Prs (2), 
and apply Lemmas 2.127 and 2.129. QED 
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Thus, with a little effort, we can define certain familiar topological notions 
over ROP(R®) using Lo-formulas. The following technical material, which 
is devoted to defining some decidedly unfamiliar topological notions over 
ROP(R?), will be used in the sequel. We recall the discussion of compact- 
ifications in Sec. 3.3, and consider the mapping r +> 7 from ROP(R?) to its 
1-point compactification ROP(S?). By Lemmas 2.36 and 2.37, this mapping is 
a Boolean algebra isomorphism and preserves the properties of connectedness 
and non-connectedness. For technical reasons, we will occasionally need to 
consider properties of elements in ROP(R*) whose defining conditions make 
reference to their counterparts in ROP(S®). 

For all r € ROP(R®), oo € 7 if and only if —r is bounded, and oo € 7 if 
and only if r is unbounded (where the closure operator ~ refers to the topology 
on S*). By Lemma 2.129 then, it is harmless to employ the expression œ € & 
in Lc-formulas, since we can take it as a mnemonic for ¢,3(—2x); and similarly 
for expressions such as 00 € &, 00 € F(t), ete. 


LEMMA 2.131 There exists a formula $,:5(«) satisfied by r € ROP(R®) if 
and only if K5 is embeddable in F (7). 


Proof As for Lemma 2.125, making the obvious adjustments to accommodate 
the point at infinity. QED 


LEMMA 2.132 There exists a formula g(x) such that, for all 
r € ROP(R®), ROP(R®) H ¢,[r] if and only if è is a ball in SÈ. 


Proof Let p(x) be $j(%) A 7¢,¢5(). If # is a ball in SÌ, it is evident 
that ROP(R®) = ¢,[r]. Conversely, suppose ROP(R*®) = ¢,[r]. By 
Lemmas 2.120 and 2.131, F(7) is a surface in S3. Furthermore, by Propo- 
sition 2.124, F (7°) is homeomorphic to S?. The result then follows by Propo- 
sition 2.121. QED 


6.3 Characterizing triangulations in Le 


In Sec. 4, we showed that every tuple in ROP(R7) satisfies a topologically 
complete L¢-formula—that is, an L¢-formula with the property that all tuples 
satisfying it are similarly situated. Our proof exploited Whitney’s theorem 
on 3-connected graphs in the plane to show that any c°-partition in ROP (S°) 
is determined up to similar situation by its neighbourhood graph. However, 
Whitney’s theorem is not available for S3, and so we must adopt an alterna- 
tive approach to analysing the expressive power of Lo over ROP(R?). This 
approach has the advantage of being, in some ways, more straightforward than 
that of Sec. 4, though the topologically complete formulas it constructs are more 
complicated. 
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Figure 2.18. The configuration of Proposition 2.133. 


We assume familiarity with the basic theory of triangulations and PL- 
complexes: for details, see, e.g., Moise, 1977, Ch. 7. We also require the 
following “obvious” result about balls in S° (Pratt and Schoop, 2002, Theo- 
rem 3.14). 


PROPOSITION 2.133 Letr,s € RO(S®) be disjoint balls in SÌ such that r+ s 
is also a ball in S°. Then F(r) N F(s) N F(r + 8) is the locus of a Jordan 
curve, and F(r) N F(s) is homeomorphic to the closed disc D?. 


The situation is illustrated in Fig. 2.18. 


DEFINITION 2.134 A quadruple q = (r1, r2,r3, T4) of pairwise disjoint ele- 
ments of ROP(S?) is a q-cell in S° if, for all non-empty J C {1,2,3,4}, the 
polyhedron ier rj is a ball in SÌ. 


The reference to the containing space S? is significant: in the sequel, we intro- 
duce q-cells in R®. However, we sometimes speak simply of g-cells if it is clear 
from context which space we are talking about (or if it makes no difference). 


EXAMPLE 2.135 Consider the regular open tetrahedron to with vertices vı = 
(0,0,0), v2 = (1,0,0), v3 = (0,1,0), v4 = (0,0,1). Let tı, to, tz, ta be the 
four regular open tetrahedra (taken in some fixed order) each having three 
vertices from {v1,..., va} and the point (1/4,1/4,1/4) as the fourth vertex 
(Fig. 2.19). Evidently, the quadruple qo = (tı, t2, t3, t4) is a q-cell. 


THEOREM 2.136 All q-cells in $? are similarly situated in S?. 
Proof Let (a,b,c,d) be aq-cell. Since a, b, c, a+b, b+c, a+canda+b+c 


are balls, by Proposition 2.133, the sets F(a) NF (b), F(a)NF(c), F(b)NF(c) 
and F(a + b) N F(c) are all closed discs. Letting S = F(a + b + c), it is then 


First-Order Mereotopology TI 





Figure 2.19. The q-cell qo. 


easy to show that the sets F(a) N S, F(b) N S and F(c) N S must be arranged on 
Sas shown in Fig. 2.20a, up to similar situation. Lete = —(a+b+c+ d); then, 
by Lemma 2.123, X`(B U {e}) is a ball for any proper subset B C {a, b,c, d}. 
Thus, all of the sets a+b+c,d,e,a+b+c+danda+b+c+eare balls. By 
Proposition 2.133 again, F(d) N S and F(e) N S are both closed discs, whose 
common frontier in the space S' is the locus of some Jordan curve y, say. 


Consider how y might be drawn on S. Since a + d and a + e are balls, by 
Proposition 2.133, F(a) N F(d) and F(a) N F(e) are closed discs. Similarly, 
F(b) N F(d), F(b) N F(e), F(c) N F(d) and F(c) N F(e) are closed discs. 
Hence y divides each of the three sets F(a) N S, F(b) N S and F(c) N S into 
two residual domains. Moreover, ~y cannot pass through either of the points X 
or Y in Fig. 2.20a; for otherwise, one of the sets F(a) N F(d), F(b) N F(d), 
FIAN F(d), F(a) N Fle), F(b) A F(e) or F(c) N F(e) would fail to be a 
closed disc. It is then easy to see that y and the region F(d) N S it encloses 
must lie in S as shown in Fig. 2.20b or Fig. 2.20c, up to similar situation. But 
these two arrangements of a, b, c, d are mirror images. QED 


NOTATION 2.137 Ifq = (ti,...,t4) is a q-cell, denote the component poly- 
hedron t; by qi] for alli (1 < i < 4). Denote the polyhedron tı +--+ + t4 


by å. 


In Example 2.135, go is the interior of the convex hull of the points V = 
{v1,...,v4}. We employ familiar terms from discussions of simplicial com- 
plexes: a face of qo is the convex closure of any non-empty subset of V; a face 
of qo is proper if it is not the whole of (Go)~; a vertex of qo is an element of V. 
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y 





Figure 2.20. Possible arrangements of F(a) N S, F(b) N S, F(c) N S and F(d) N S, where 
S = F(a + b + c) (Proof of Theorem 2.136). 
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DEFINITION 2.138 Let q be any q-cell, and h a homeomorphism of SÌ onto 
itself taking qo to q. A (proper) face of q is a set of points h(F), where F is a 
(proper) face of qo. A vertex of q is a point h(v), where v is a vertex of qo. 


We remark that, in Definition 2.138, a suitable homeomorphism A can always 
be found, by Theorem 2.136; moreover, since the faces of qo are expressible 
as set-algebraic combinations of the polyhedra t1, ... , t4 and their topological 
closures, the precise choice of h does not matter. Thus, q-cells are simply 
homeomorphic images of the q-cell go of Example 2.135, with the notions of 
face and vertex transferred in the obvious way. 


DEFINITION 2.139 Agq-cell partition (in ROP(S*)) isasequenceg = q, ..., 
qn of q-cells in SÌ such that (i) q,,...,4n is a partition in ROP(S?); and (ii) 
foralli,j (1<i< j <n), if F isa face of qi and G a face of qj, then FAG 
is either empty or a face of both qi and qj. A vertex of a q-cell partition is a 
vertex of one of its elements. 


Thus, q-cell partitions define (finite) PL-complexes in the obvious way: each 
q-cell in the partition corresponds to a PL 3-simplex, and its proper faces to PL 
d-simplices for d < 3. 


DEFINITION 2.140 Letq = qı,..., qn and | = q,...,dy be q-cell parti- 
tions in ROP(S°). We say that q and @] are isomorphic if there is a bijection 
between the vertices of q and the vertices of! such that, for alli, j (1 <i < N, 
1 < j < 4), the vertices of qi lying on the frontier of q;{j| are mapped to the 
vertices of q; lying on the frontier of q;[j]. 


LEMMA 2.141 Isomorphic q-cell partitions in ROP (S?) are similarly situated 
in SÌ. 


Proof Isomorphic q-cell partitions define isomorphic PL-complexes. QED 


We conclude this subsection by extending the notions of q-cell and q-cell par- 
tition to the open space R3. 


DEFINITION 2.142 A quadruple q = (rı,r2,r3,r4) of elements of 
ROP(R?®) is a q-cell in R3 if ¢ = (71, 72,773, 74) is a q-cell in SÌ. A sequence 
T= qı,- --, qn Of q-cells in RÌ is a q-cell-partition in ROP (R?) if (,..-,dn 
is a q-cell partition in ROP(S®). 


DEFINITION 2.143 Let q = q,.--,dn and] = qj,..-.,¢, be q-cell parti- 
tions in ROP(R?). We say that q and q are isomorphic if: (i) the correspond- 
ing q-cell partitions qi, ..., Qn and qj, ..., qj, in ROP (S?) are isomorphic; and 
(ii) foralli,j (1 <i<n,1 <j <4), qj] is bounded if and only if q;[j] is 
bounded. 
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Intuitively, knowing which q;|j] are bounded for a q-cell partition q,..., qn 
in ROP(R?) amounts to knowing, up to homeomorphism, where the point at 
infinity is in the corresponding q-cell partition in ROP(S*). More precisely, 
we have: 


LEMMA 2.144 Let = q,..-qn and 7 = qj,...,q,, be similarly situated 
q-cell partitions in ROP(S?). Let p € SÌ such that, for alli, j 1 <i<n1< 
j <4), p € (qilg])7 ifand only ifp € (q,[j])~. Then there is a homeomorphism 
h : S? — S? fixing p and mapping @ to 7. 


Proof Parallel to the proof of Lemma 2.88. QED 


THEOREM 2.145 Isomorphic q-cell partitions in ROP(RÌ) are similarly 
situated in R®. 


Proof Letqi,...,qnandqi,..., q} be isomorphic q-cell partitions in ROP (R8). 
Then q1,..-,@ and q' osani q' n are isomorphic q-cell partitions such that, for 
allij A <i<n,1 <j <4, c € (glj]) if and only if 00 € (q’,[J])~. 
By Lemmas 2.141 and 2.144, there exists a homeomorphism h of S3 onto itself 
mapping q1,.--, Gn to q’1,.--,@/ns and fixing oo. Thus, h’ = h \ {(00, 00)} is 
a homeomorphism of R? onto itself mapping q1,- - - , qn to q4,- - qh,- QED 





6.4 Expressive power of Lc in ROP (R?) 


We are now ready to show that every tuple in ROP (R°) satisfies a formula 
which is topologically complete in ROP(R?) over R3. 





LEMMA 2.146 For all N > 0, there exists a formula oy (z) such that, for 
any 4N-tuple t from ROP(R*), ROP(R?) = oA [t] if and only if t is a q-cell 
partition in ROP(R?). 





Proof Let ¢ g(x) be as defined in Lemma 2.132, and suppose s1,..., 84 € 
ROP(R®). Then the quadruple (51,...,84) is a q-cell in S3 if and only if 
ROP(R?) E ¢g[s1,.--, $4], where $4(y1,..-, ya) is the formula 














A {¢e( dou) 1047S (1,2,3, 49}. 
jEJ 


The result then follows easily. QED 


LEMMA 2.147 Lett be a 4N-tuple forming an N-element q-cell partition in 
ROP(R®). Then we can find a formula y(Z) such that, for any 4N-tuple Y of 
ROP(R?), ROP(R?) = 4[#] ifand only if? is an N-element q-cell partition 
isomorphic to t. 
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Proof Almost immediate from Lemmas 2.129 and 2.146 and the discussion 
of Sec. 3.2. QED 


LEMMA 2.148 Every q-cell partition in ROP(R®) satisfies an Lo-formula 
which is topologically complete in ROP (R?) over RÌ. 


Proof Theorem 2.145 and Lemma 2.147. QED 


LEMMA 2.149 Any n-tuple T from ROP(R?) can be refined to an N-element 
q-cell partition. That is: there exists a 4N-tuple t from ROP(R®) and a 
(4N x n) Boolean array A such that t forms a q-cell partition in ROP(R?®) 
and? = tA. 


Proof By the definition of ROP(R?), we can certainly refine 7 to a parti- 
tion of convex regions of R3, each of which is bounded by a finite number 
of planes, and thence, by triangulating these convex regions, into a partition 
of polyhedra tı,...,ty, such that each Å; is a ball in S3, and the bound- 
ary of each t; (1 < i < N) is composed of 4 “triangles”. By subdividing 
each t;, we can construct a q-cell q; whose faces are exactly the triangles 
bounding t;, and such that ĝ; = ti. Then q,...,qy is the required q-cell 
partition. QED 


THEOREM 2.150 Every tuple in ROP(R®) satisfies some Lo-formula which 
is topologically complete in ROP(R®*) over R?. 


Proof Let 7 be a tuple from ROP(R*). Let ¢ and A be as in Lemma 2.149, 
and by Lemma 2.148 let ¢;(Z) be a formula satisfied by t which is topologically 
complete in ROP(R?) over R3. Then the formula 42(;(Z) A Z = ZA), which 
is also topologically complete in ROP(R*) over R3, is satisfied by 7. QED 





This concludes the main business of this section: the language Lc is suffi- 
ciently expressive that every tuple of polyhedra in R? can be characterized up 
to the relation of similar situation in R by one of its formulas. Moreover, it is 
easy to see that an analogous result must obtain for polyhedra in S*. Of course, 
the characterizing formulas for tuples of polyhedra obtained in this section are 
much more complicated than the corresponding L,,<-formulas for tuples of 
polygons obtained in Sec. 4. 

In Sec. 5, we exploited the high expressive power of L,.< in ROP(S?) to 
obtain an axiomatization of Th- < (ROP (S?)), and thence, a formulation of the 
conditions under which an arbitrary mereotopology over S? has the same Le <- 
theory as ROP(S?). The question therefore arises as to whether an analogous 
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approach is possible for characterizing “reasonable” spatial mereotopologies 
using the results of this section. The major disincentive to such an undertaking 
is the relative weakness of the requirement of finite decomposability in S?. For 
the plane case, the requirement of finite decomposability led very easily to the 
existence of c%-partition refinements, which paved the way for an axiomatic 
characterization of Th..<(ROP(S?)). In the spatial case, by contrast, much 
stronger assumptions are needed to guarantee the existence of q-cell partitions. 
Thus, while the identification of a standard theory of spatial mereotopology 
is certainly conceivable, it is not obvious, at the time of writing, how best to 
approach this matter. 


7. Model theory 


In Sec. 2, we defined a mereotopology over a topological space X to be a 
Boolean sub-algebra M of RO(X) in which, for all p € o C X, with o open, 
there exists r € M such that p € r C o. However, we also promised a purely 
intrinsic characterization of such structures—one making no reference to points 
or topological spaces. In this section, we fulfil that promise, and (partially) 
realize the vision with which we started this chapter, of a reconstruction of 
topology where the fundamental objects are not points, but regions. 


7.1 Abstract models of mereotopological theories 


We begin by noting some simple facts about mereotopologies over topolog- 
ical spaces of various kinds. 


LEMMA 2.151 Let M beamereotopology overa topological space X, consid- 
ered as a structure interpreting the signature {C,+,-,—,0,1,<}. 
(i) The sentences ®ca consisting of the usual axioms of Boolean algebra 
together with 


VanC (x, 0) 

Va(z > 0 — C(a,2)) 

Vavy(C(x,y) > C(y, x)) 
VaVy(C(x,y) ^y < z => C(x, 2) 
Vavy(C(2,y +z) > C(x, y) V C(x, z)) 


are all true in M. (ii) If X is weakly regular, then the sentence ext given by 
Vavy(V2(C(x, z) = C(y, z)) => x < y) 


is true in M. (iii) If X is compact and Hausdorff, then the sentence dint given 
by 





YzYy(=C (x,y) = dz(-C(a, =z) N^ =C (y, z))) 


is true in M. 
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Proof (i) Straightforward. (ii) Lemma 2.22. (iii) Suppose r,s € M with 
r7 Ns = 9. Since X is regular, by Lemma 2.23, for each point in p € r7, 
there is rp E€ M with p € rp and s~ C —rp. Since the rp cover r~, choose 
a finite subcover, and let the sum of this subcover be t. Then r~ C t and 
s C-t. QED 


The three claims in Lemma 2.151 all have converses. Specifically: 


PROPOSITION 2.152 Let 2 be a structure interpreting the signature i = 
{C,+,-,-,0,1 <}. (4) AE Bca, then A is isomorphic (as a X-structure) 
to a mereotopology over a topological space X; in fact, X can always be 
chosen so as to be semi-regular and To. (ii) IfA = Pca U{dext}, then X can 
be chosen so as to be weakly regular and T,. (iit) IfA H Pca U {dhext; Pint} 
then X can be chosen so as to be compact and Hausdorff. 


These results first appeared (in equivalent form) in Dimov and Vakarelov, 
2006, Duntsch and Winter, 2005 and Roeper, 1997, respectively. In the lit- 
erature, structures satisfying ®ca are sometimes referred to as contact alge- 
bras, the sentence dex as the extensionality axiom, and the sentence Qint as the 
interpolation axiom. Together, Lemma 2.151 and Proposition 2.152 show that 
mereotopologies over certain classes of topological spaces can be characterized 
purely intrinsically, without reference to those spaces or the points that make 
them up. We note in passing that Proposition 2.152 speaks of mereotopologies 
over X (Definition 2.5), while the sources cited refer only to dense sub-algebras 
of RO(X). This slight strengthening is immediate from the relevant proofs, 
and improves the match between Lemma 2.151 and Proposition 2.152. For a 
fuller discussion, see Ch. 3. 

Furthermore, it turns out that the topological realizations in Proposition 2.152 
(iii) are, in an important sense, unique. We motivate this result with a simple 
observation. 


LEMMA 2.153 Let M; be a mereotopology over the topological space Xj, 
for i = 1,2. Suppose there is a homeomorphism h : Xı — Xə which maps 
M; onto Mə. Then, for any signature X of topological primitives, h induces a 
structure isomorphism h : Mı ~y Mə. 


Proof Immediate. QED 


The uniqueness of the topological realizations in Proposition 2.152 (iii) takes 
the form of a partial converse of Lemma 2.153: 


THEOREM 2.154 (ROEPER, 1997) Let M; be a mereotopology over a com- 
pact, Hausdorff space X; (i = 1,2). Suppose there is a structure isomorphism 
f: Mı ~c Mə. Then there exists a homeomorphism h : Xı —> Xə which 
induces f—that is, one such that, for allr € M,, f(r) = h(r). 
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Thus, every model of Pca U {ext int } is isomorphic to exactly one mereo- 
topology over a compact, Hausdorff space (up to homeomorphism). Since this 
fact is important for the development here, we present details of the proof. 

We assume familiarity with the theory of ultrafilters: for details, see 
Koppelberg, 1989, Ch. 1, Sec. 2. In this context, recall that, for B a Boolean 
algebra, a filter on B is a set F C B such that a,b € F implies a- b € F, and 
a€ F,a<be Bimplies b € F. A filter is proper if it is not the whole of B, 
or equivalently, if it does not contain 0. A proper filter U is an ultrafilter if it is 
maximal under set-inclusion, or equivalently, if b1 + b2 € U implies bı € U or 
bo € U. The following result is standard (Koppelberg, 1989, Ch. 1, 2.16). 


PROPOSITION 2.155 (PRIME IDEAL THEOREM) Any proper filter on a 
Boolean algebra can be extended to an ultrafilter. 


In the following lemmas, let M be a mereotopology over a compact, Hausdorff 
space X. Since a compact, Hausdorff space is normal (and hence regular), 
Lemma 2.23 applies. 


LEMMA 2.156 Let U be an ultrafilter on M. Then the set (\{r~ |r € U} is a 
singleton. We denote the member of this set by py and say that U converges to 


PU. 


Proof We first show that {u7 |u € U} contains at least one point. For 
otherwise, LJ{X \ u7ļu € U} = X, whence {—ulu € U} covers X. By 
compactness of X, let —w1,...,—Uu,, be a finite subcover. Then —u1 ++- + 
—Un = l; i.e. uy: +++ -Un = 0 € U, contradicting the fact that U is proper. 
Next we show that {u7 |u € U} contains at most one point. For suppose p, 
q are distinct points of X. By Lemma 2.23, there exists r € M such that p € r 
and q € —r. Hence p ¢ (—r)~ and q ¢ r7. Since U is an ultrafilter, either r 
or —r is in U, so that either p or q is not in {u7 |u € U}. QED 


LEMMA 2.157 Let U be an ultrafilter on M, and let r € M. If py € r, then 
there exists s E€ U such that py € s and s~ C r. Hence also, r € U. 


Proof Suppose py € r € M. Then py ¢ (—1r)~, and by Lemma 2.23, there 
exists s € M such that py € s and s~ C r. But since py ¢ (—s)~ we have 
—s ¢ U, and thus s € U. QED 


DEFINITION 2.158 If U and V are ultrafilters on M, we say U and V are 
contacting ifr N s7 AQ forallr € U ands € V. 


LEMMA 2.159 JfU and V are ultrafilters on M, then U and V are contacting 
if and only if pu = pv. 
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Proof The if-direction is trivial. For the only-if direction, suppose that py ~ 
pv. By Lemma 2.23, there exist r,s E€ M such that py € r, py E€ s and 
rons = 9. By Lemma 2.157, r € U, s € V, so that U and V are not 
contacting. QED 


LEMMA 2.160 Let Mı and Mə be mereotopologies over weakly regular topo- 
logical spaces, let f : Mı ~c Mə be an isomorphism, and let U and V be 
contacting ultrafilters on Mı. Then f(U) and f (V) are contacting ultrafilters 
on Mo. 


Proof Almost immediate given the definability of < in terms of C 
(Lemma 2.22). QED 


LEMMA 2.161 Let Mı and Mə be mereotopologies over regular topological 
spaces, such that f : My ~c Mə. Let r € M, and let U be an ultrafilter on 
M; with py € r. Then pry) € f(r). 


Proof By Lemma 2.157, there exists s E€ U such that py € sands” Cr, 


so that s~ N (—r)~ = Ø. Since f is also a Boolean algebra eee 
f(s)” N (=F) = 0, ie. f(s)” C f(r). Since f(s) € F(U), pru) 
F(s) € f(r). QED 


Proof [Theorem 2.154] Suppose that f : Mı ~c Mə. Define the map h by 
h(pu) = ppv), for U a compact ultrafilter on Mı. We show: (i) h is well- 
defined and 1-1, (ii) the domain of h is the whole of X, and the range of h 
is the whole of Xo, (iii) for all r € Mı, f(r) = h(r), and for all s € Mo, 
f-'(s) = h™}(s), and (iv) h and h~! are continuous. To prove (i), let U, V 
be compact ultrafilters on Mj, both converging to p. By Lemma 2.160, the 
isomorphism f maps contacting ultrafilters to contacting ultrafilters. Hence, 
h is well-defined. Applying the same reasoning to f—!, h is 1-1. To prove 
(ii), let p € Xy. Then {r € Mj|p € r} is a proper filter on Mı, and by 
Proposition 2.155, this filter can be extended to an ultrafilter U on Mı. By 
Lemma 2.156, U converges to some point py. Since X; is Hausdorff, p = py. 
Thus, the domain of h is the whole of X1. Similarly, if q € X2, we have 
an ultrafilter V on M2 such that q = py. Thus q = py = pyif-1(v)) = 
h(pr-1y)), so that the range of h is the whole of X2. To prove (iii), let 
pv € f(r) with V an ultrafilter on M2. By Lemma 2.161, py—1(y) € r. Hence, 
pv = h(pf-ı({v)) E€ h(r). Conversely, let py € h(r). By the definition of h, 
Pr-uv) € r, and by Lemma 2.161, py € f(r). Hence f(r) = h(r). Now 
if s € Mə, f-!(s) € Mi, so, applying the results just obtained to this set, 
we have f-i(s) = h-1(h(f-1(s))) = h-MF(F-1(s))) = h~} (s). Gv) Let 


u C X; be an open set. Since Mj, is a mereotopology, for each point p € u, 
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there exists rp E Mı with p € rp C u. Thus the set U = {rp E€ M|p € u} 
satisfies | JU = u. Then h(u) = h(UU) = Urey Rr) = Urey f(r) is a 
union of open sets in X and hence is itself an open set in X2. Therefore, h71 
is continuous. By substituting h~' and for h and repeating the argument, h is 
continuous. QED 


7.2 Abstract models of geometrical mereotopological 
theories 


We have shown that mereotopologies over certain classes of topological 
spaces can be characterized in terms of certain first-order sentences which they 
make true. But what of specific mereotopologies of interest—for instance, 
those defined over the open or closed plane? This is the topic we now address, 
based on the results of Pratt and Lemon, 1997. 

We employ standard results on prime models: for details, see Chang and 
Keisler, 1990, Ch. 2. A structure 2 is said to be a prime model if it is ele- 
mentarily embeddable in any elementarily equivalent submodel. Prime models 
are considered the “simplest” or “smallest” models of their theories, a view 
which is justified by the following proposition (Chang and Keisler, 1990, The- 
orem 2.3.3). In the sequel, all signatures are silently assumed to be countable. 


PROPOSITION 2.162 Elementarily equivalent prime models are isomorphic. 


The following notion is closely related to that of primeness. A formula ¢ is 
said to be complete with respect to a theory T if, for all formulas 0 having the 
same free variables of ¢, exactly one of T = ¢ — 0 or T H ¢ — —6 holds. A 
structure 2 is said to be atomic if any n-tuple a in A satisfies a formula ¢(Z) 
in 2 such that ¢ is complete with respect to Th(2l). We have the following 
standard result (see, for example, Chang and Keisler, 1990, Theorem 2.3.4). 





PROPOSITION 2.163 A structure is countable atomic if and only if it is a 
prime model. 


Recall the concepts of topologically complete formula and homogeneous mer- 
eotopology given in Definitions 2.51 and 2.90, respectively. 


LEMMA 2.164 Let M be a homogeneous mereotopology over a topological 
space X, and let © be a signature of topological primitives. If ọ € Ly is topo- 
logically complete in X over M, then ¢ is complete with respect to Thy(M). 


Proof Immediate from Lemma 2.92. QED 


Theorem 2.178 below is a partial converse of this result. 
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For the next theorem, recall that ROQ(S?) is the rational polygonal mereo- 
topology over the closed plane, and that its D.<-theory is Te,<, the standard 
Le <-theory of closed plane mereotopology, which we axiomatized in Sec. 5. 
Recall further that wy (2) is the Le,<-formula stating that Z forms ac?-partition, 
employed in the proof of Theorem 2.85 


THEOREM 2.165 Themereotopology ROQ(S?), consideredas a {c, <}-struc- 
ture, isa prime model of T..<. Infact, for any N, there exist formulas y1(Z),..., 
Kk (Z) (with K depending on N), complete with respect to Te <, such that 


Te< H YZS > (Z) V V-¥K(2)))- 


Proof The first part of the theorem is immediate from Theorem 2.85 and 
Lemma 2.164. For the second part, observe that, for a given N, there are only 
finitely many neighbourhood structures on an N-element c?-partition, each one 
giving rise to a topologically complete formula of the form 


YÈ) AW (2), 
as described in the proof of Theorem 2.85. QED 


Note that, by Lemma 2.38, ROQ(S?) and ROQ(R?) are the same {c, <}- 
structure, so we could replace S? in Theorem 2.165 by R?. 
Similarly, we have 


THEOREM 2.166 The mereotopologies ROQ(R?) and ROQ(S?), considered 
as {C}-structures, are prime models. 


Proof As for Theorem 2.165, but using Theorem 2.89 and Corollary 2.86, 
respectively. QED 


Analogues of Theorem 2.166 hold in three dimensions, of course. For example, 
we have: 


THEOREM 2.167 The mereotopology ROQ(R?) is a prime model of the Lc- 
theory of ROP(R?). 


The proof strategy is essentially identical to the plane case, using Theorem 2.150. 
Note, however, that much more care is required to show that the topologically 
complete formulas identified in Theorem 2.150 are complete with respect to the 
Lc-theory of ROP(R?). We leave the details to the interested reader. 
Returning to mereotopologies over S?, the question then arises as to whether 
ROQ(S?) is strictly simplest among countable models of T. <, in that there 
are countable models of that theory not isomorphic to ROQ(S2). The answer 
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is: yes and no. Recall that a theory is said to be w-categorical if it has exactly 
one countable model up to isomorphism. Recall also that a type in variables 
T = @1,...,£p is a maximal consistent set of formulas whose free variables 
are among the 71,..., £n, and that a theory T is said to have a type ®(Z) if 
(Zz) is consistent with T. The following result is standard (see, for example, 
Chang and Keisler, 1990, Theorem 2.3.13). 


PROPOSITION 2.168 Let T be a complete theory. Then T is w-categorical if 
and only if, for each n, T has only finitely many types in £1, ... , £n. 


THEOREM 2.169 T. < is not w-categorical. 


Proof By Proposition 2.168, it suffices to prove that T,. < has infinitely many 
types in the single variable x. It is easy to see that, for every positive integer 
m, the formula Ym(z) 














Pa ees en A (elz) Azi #0) A TAN =e + zj) A£ = 3o A 


1<i<m 1<i<j<m 1<i<m 





is satisfied in ROQ(SÊ?) by all and only those regions having exactly m compo- 
nents. Hence, the Ym (x) are all satisfied in ROQ(S?); so each can be extended 
to a type [',(x) of The. <(ROQ(S?)). But the Ym(zx) are also pairwise mutu- 
ally exclusive in T.<; so no two of them can be extended to the same type. 
Hence, Te < has infinitely many types in x. QED 


One the other hand, it turns out that T< < is almost countably categorical, in 
the following sense. Note that, since any model of Te,< is a Boolean algebra 
interpreting the predicate c, we may employ the terminology introduced at the 
start of Sec. 4.1. 


‘THEOREM 2.170 All countable finitely decomposable models of Te< are 
isomorphic. 


Proof Let 2% = Te < be finitely decomposable. By Claims 2.54 and 2.63, 
every tuple from A can be refined to a c?-partition. Theorem 2.165 then implies 
that X is prime. The result follows by Proposition 2.162. QED 





The above results show that, while specific mereotopologies such as ROS(S?) 
cannot be characterized in terms of the first-order sentences which they make 
true, they almost can. Specifically, we have the following abstract characteri- 
zation of the mereotopology ROQ(S?). 


COROLLARY 2.171 Jf A is a countable, finitely decomposable model of 
Axioms 1—8 in Sec. 5.1, then A is isomorphic (as a {c, <}-structure) to the 
mereotopology ROQ(S?). 
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Proof Theorem 2.170 and the fact that, by Theorem 2.101, any finitely decom- 
posable model 2 of Axioms 1—8 is elementarily equivalent to ROQ(S?). QED 


7.3 Loose ends 


We end this section with some matters touched on earlier in this chapter. We 
continue to assume all signatures to be countable. The following proposition is 
a special case of the Löwenheim-Skolem Theorem (see, for example, Hodges, 
1993, p. 90). 


PROPOSITION 2.172 Let 2 be a X-structure and Z a countable subset of A. 
Then X has a countable elementary submodel whose domain includes Z. 


Recall that a topological space X is said to be second countable if its topology 
has a countable basis. 


LEMMA 2.173 Let M be a mereotopology over a compact, second-countable, 
Hausdorff space X, and let P C M be countable. Then there is a countable 
mereotopology Q over X such that P C Qand Q < M. 


Proof We construct a countable subset P’ C M such that, for all p € o C X 
with o open, there exists r € P’ such that p € r C o. The lemma then follows 
from Proposition 2.172 by putting A = M and Z = PUP’. Let B bea 
countable basis for the topology on X. For any b,c € B with b~ C c, take 
a cover of b~ by elements s € M such that s C c (possible because M is 
a mereotopology), choose a finite subcover (possible because X is compact), 
and let rp. be the sum, in M, of the elements of this finite subcover. Certainly, 
b C The Cc. Let P’ = {rbe | b,c € B,b~ Cc}. Since X is normal, for all 
p €o C X with o open, we can find b,c € B with p € b,b Ccandc C o. 
But then p € rp, C o as required. QED 


Note that Lemma 2.173 holds for all (countable) signatures. 
We may now derive the promised strengthening of Corollary 2.118. 


COROLLARY 2.174 All splittable, finitely decomposable mereotopologies 
over S° with curve-selection have the same Ly-theory for any topological 
signature >. 


Proof Let Mı, Mz be two such mereotopologies. Extend the signature X if 
necessary so that it contains the predicates C, c and <, and expand Mı and 
Mp by interpreting these predicates in the normal way. By Lemma 2.173, let 
Q; be a countable mereotopology over S such that Q; <s Mj, for i = 1,2. 
Thus, Q; and Q% are splittable, finitely decomposable mereotopologies over S? 
having curve-selection. By Corollary 2.118, Q1 =c,< Q2. By Theorem 2.170, 
Qi ~c< Q2. By Lemma 2.49, Qı ~c Q2. By Theorem 2.154, there is a 
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homeomorphism mapping Qı onto Q2. Finally, by Lemma 2.153, Q1 ~» Qa, 
whence Mı =x Mo. QED 


Recall from Definition 2.97 that, if X is a signature of topological primitives, 
Ty denotes Thy(ROS(S?)). By Corollary 2.174, Ty is the Ly-theory of 
any splittable, finitely decomposable mereotopology over S? having curve- 
selection. This justifies our decision to call it the standard Ly-theory of closed 
plane mereotopology. 

Theorem 2.170 now has the following corollaries. 


COROLLARY 2.175 Let M be a countable, finitely decomposable mereo- 
topology over a locally connected, compact, Hausdorff space X, such that 
Thc(M) = Tc. Then there is a homeomorphism h : X < S? taking M to 
ROQ(S?). 


Proof By Lemmas 2.22 and 2.27, Tho..,<(M) = Tco¢,<. By Theorem 2.170, 
M ~.,< ROQ(S?). But, by Lemma 2.49, Tc,-,< contains a formula defining 
C explicitly in terms of c and <. Hence M ~c ROQ(S?). Now apply 
Theorem 2.154. QED 


COROLLARY 2.176 Let M be a finitely decomposable mereotopology over a 


locally connected, second countable, compact, Hausdorff space X, such that 
Thc(M) = Tc. Then X is homeophorphic to S?. 


Proof Apply Lemma 2.173 to obtain a countable mereotopology Q over X 
with Q < M and proceed as for Corollary 2.175. QED 


We remark that there is no prospect of removing the compactness condition 
from the above corollaries. For example, let p, be, say, the point of S? with 
coordinates (0, 7), and consider the topological space X = S? \ {pr} and the 
mereotopology M over X given by M = {r \ {pz} | r E€ ROQ(X)}. Then 
ROQ(S?) ~c.e.< M; but S? and X are not homeomorphic. 

A further consequence of Theorem 2.154 is the promised partial converse of 
Lemma 2.164. We require the following fact about prime models. 


LEMMA 2.177 Let 2 be a countable, atomic model and let a ,b be tuples from 
A which satisfy the same formulas in 2. Then there is an automorphism of A 
taking āū to b. 


Proof Almost immediate from Proposition 2.162, by adding a tuple of indi- 
vidual constants to stand alternatively for a and b. QED 


THEOREM 2.178 Let M be amereotopology over a compact, second-countable 
Hausdorff space X, and let X£ be a signature of topological primitives such that 
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C (contact) is first-order definable over M. If every tuple from M satisfies an 
Ly-formula which is complete with respect to Thy, then that Ly-formula is 
topologically complete in M over X. 


Proof Let pbecomplete with respect to Ths (M), and suppose that M H fr], 
M $| ọ[5]. We must show that 7 and 5 are similarly situated in X. By 
Lemma 2.173, let M’ be a countable mereotopology over X containing the 
tuples 7 and 5, such that M’ < M. Thus, M’ is countable and atomic, and 
ġ is a complete formula with respect to Thy(M") satisfied by both 7 and 5. 
By Lemma 2.177, there exists an automorphism f : M’ ~s M’ such that 
f(r) = 5. Then, by Theorem 2.154, there is a homeomorphism h : X — X 
taking 7 to 5. QED 





Lemma 2.164 and Theorem 2.178 establish the close connection between the 
notions of topological completeness with respect to a topological space and 
completeness with respect to a mereotopological theory. 


8. Philosophical considerations 


The earliest modern work on region-based theories of space is that of 
Whitehead and de Laguna (Whitehead, 1919; Whitehead, 1920; Whitehead, 
1929; de Laguna, 1922a; de Laguna, 1922b; de Laguna, 1922c). Both authors 
propose a system of postulates governing a small collection of primitive spatial 
relations, together with reconstructions of familiar spatial concepts in terms of 
those relations. The postulates serve implicitly to define the primitive relations 
they constrain (and perhaps the domain of entities over which they quantify), 
while the reconstructions of familiar spatial concepts connect the whole system 
to the data of spatial experience. To be sure, both Whitehead and de Laguna 
motivate their postulates by providing informal interpretations for their respec- 
tive spatial primitives. Thus, for example, Whitehead illustrates his relation of 
extensive connection (as he calls it) using diagrams suggesting that two regions 
are extensively connected just in case their topological closures share a point 
in common (this is the interpretation given to the binary predicate C in this 
chapter). However, such explanations are intended only as a heuristic guide. 
Officially, spatial primitives acquire their content solely from the entire system 
postulates in which they participate. Primitives, by definition, are not explicitly 
definable. 

The inspiration for such systems was presumably the axiomatic treatment of 
geometry found in Euclid (and latterly Hilbert); and the motivation for carrying 
out the procedure on a purely region-based footing seems, for both authors, to 
have been a certain disquiet about the empirical distance between the concept 
of a point as a primitive geometrical entity and the character of everyday spatial 
experience. The great difficulty of this approach, of course, is the problem of 
evaluating the system of postulates and conceptual reconstructions proposed. 
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Whitehead’s system has thirty-one postulates (or assumptions, in Whitehead’s 
terminology) and a similar number of definitions. De Laguna’s system, though 
far tidier, is also hardly self-evident. The only obvious sources of justification 
for such systems are their ability to chime with our pre-theoretic intuition and 
their eventual integration into a larger, empirically successful, physical theory. 
Neither source is very satisfactory. On the one hand, as we have seen in this 
chapter, almost any collection of spatial primitives enables us to write down 
propositions on which pre-theoretic intuition cannot be expected to return a 
reliable verdict. On the other hand, although empirical confirmation of a general 
physical theory must provide some support for the account of space it contains, 
the size of the undertaking and the difficulty of assigning credit when theories 
perform well (or blame when they perform badly) means that there is little 
practical prospect of any such justification for such systems of postulates and 
conceptual reconstructions. 

An alternative approach to developing a region-based theory of space is 
illustrated by Tarski’s Geometry of Solids (Tarski, 1956). Tarski too develops 
a geometry in which regions, not points, are the primitive objects; however, 
in contrast to Whitehead and de Laguna, he does not build his theory by writ- 
ing a collection of plausible-looking, but unprovable, axioms. Rather, begin- 
ning with the familiar model of space as RÌ, he considers a formal language 
whose variables range over the set of of spheres in R? (defined in the standard 
way), and whose sole non-logical constant is the part-whole relation (again, 
defined in the standard way). Because the “primitives” in Tarski’s Geometry 
of Solids are well-defined mathematical objects and relations, the question of 
what postulates they satisfy is a well-defined mathematical problem, not a mat- 
ter for intuition or experiment. And because many familiar spatial concepts 
have rational reconstructions in terms of the standard model, the question of 
how, if at all, these concepts can be expressed using formulas of Tarski’s lan- 
guage is again a purely mathematical affair. Having thus specified the structure 
under consideration and the language used to describe it, Tarski then goes on 
to examine the kinds of logical issues that should by now be familiar to us. In 
fact, Tarski obtains a system of axioms (in higher-order logic) for which the 
standard Euclidean interpretation is, up to isomorphism, the only model. 

This alternative approach is, in contrast to the “postulationist” strategy of 
Whitehead and de Laguna, conservative and rationalist: conservative, because 
no attempt is made to build systems of axioms and definitions from the ground 
up; rationalist, because the appropriateness of the resulting region-based theo- 
ries is secured by means of their logical relations to point-based models whose 
usefulness as representations of the space we inhabit—at least approximately 
and for mesoscopic objects—is anyway beyond doubt. It is this approach that 
we have taken in this chapter. Latterly, region-based theories of space have 
increased in popularity, following the seminal work of Clarke, 1981, Clarke, 
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1985, Biacino and Gerla, 1991, Randell et al., 1992, Gotts et al., 1996 and Renz 
and Nebel, 1997. One reason for this resurgence of interest, particularly within 
the AI community, is the requirement to quantify over spatial regions without 
leaving the realm of first-order logic. The technology of theorem-proving for 
first-order logic is more highly developed than for higher-order logics; and, 
more generally, formalisms with limited expressive power enjoy a premium in 
Al if they give rise to entailment and satisfiability problems which have (theo- 
retically or practically) efficient algorithmic solutions. Insofar as the study of 
region-based theories of space is motivated by computational considerations, 
the best approach to developing and analysing such theories is surely that of 
Tarski, not that of Whitehead. 

These matters notwithstanding, the principal outcome of the investigation 
undertaken here is just how much information it gives us about the possibilities 
for developing a truly region-based theory of space, along the lines apparently 
envisaged by Whitehead and de Laguna. Consider, for example, the issue of the 
“correct” set of postulates. True, Examples 2.17 and 2.18 show that different 
mereotopologies defined over the spaces RO(IR”) indeed have different first- 
order theories. Nevertheless, the discussion of Sec. 5 shows that the choices 
on offer are much more limited than these examples might initially lead one 
to suppose. In particular, all finitely decomposable, splittable mereotopologies 
over S? having curve-selection have identical Ly-theories, for any signature 
of topological primitives. We proposed that this common Ls-theory should 
therefore be regarded as standard. 

Or take again the issue of reconstructing familiar spatial concepts in terms 
of a chosen collection of primitives. We have seen that first-order topological 
languages interpreted over well-behaved mereotopologies have surprising— 
but not unlimited—expressive power. In particular, we provided formulas 
expressing a variety of familiar spatial relationships (as defined by their usual 
point-based definitions, of course) over a wide range of mereotopologies. In 
addition, we showed that the first-order language Le,< is sufficiently expressive 
that every tuple of polygons in S? can be characterized up to similar situation by 
one of its formulas, and that the first-order language Dc is sufficiently expressive 
that every tuple of polygons in R? and every tuple of polyhedra in R? can be 
characterized up to similar situation by one of its formulas. 

Most striking of all, however, is what the foregoing analysis tells us about 
the view of space to which any first-order mereotopological theory commits us. 
While almost all interesting mereotopologies have first-order theories which 
are not categorical in any infinite cardinal, we nevertheless showed that the 
plane mereotopology ROQ(S?) and the spatial mereotopology ROQ(S?) are 
prime models of their first-order theories over standard signatures of topo- 
logical primitives. We further showed that ROQ(S?) is, up to isomorphism, 
the only countable, finitely decomposable model of its Le,<-theory; and we 
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remarked that a corresponding observation—albeit with a more complex ver- 
sion of the finite decomposability condition—must apply in three dimensions 
as well. Finally, we showed that mereotopologies over compact, Hausdorff 
spaces, regarded as structures interpreting suitably rich topological signatures, 
determine their underlying spaces up to homeomorphism. In conclusion, the 
logical possibilities for region-based topological theories of space are more 
constrained than their earliest proponents might perhaps have thought. 
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1. Introduction 


This work explores the interconnections between a number of different per- 
spectives on the formalisation of space. We begin with an informal discussion 
of the intuitions that motivate these formal representations. 


1.1 Axioms ys algebras 


Axiomatic theories provide a very general means for specifying the logical 
properties of formal concepts. From the axiomatic point of view, it is symbolic 
formulae and the logical relations between them—especially the entailment 
relation—that form the primary subject of interest. The vocabulary of concepts 
of any theory can be interpreted in terms of a domain of entities, which exem- 
plify properties, relations and functional mappings corresponding to the formal 
symbols of the theory. Moreover, by interpreting logical operations as functions 
of these semantic denotations, such an interpretation enables us to evaluate the 
truth of any logical formula built from these symbols. An interpretation is said 
to satisfy, or be a model of a theory, if all the axioms of the theory are true 
according to this evaluation. 
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In general, an axiomatic theory can have many different models exhibiting 
diverse structural properties. However, in formulating a logical theory, we 
will normally be interested in characterising a particular domain and a number 
of particular properties, relations and/or functions that describe the structure 
of that domain; or, more generally, we may wish to characterise a family of 
domains that exhibit common structural features, and which can be described 
by the same conceptual vocabulary. 

From the algebraic perspective, it is the domain of objects and its structure 
that form the primary subject of investigation. Here again, we may be inter- 
ested in a specific set of objects and its structure, or a family of object sets 
exemplifying shared structural features. And the nature of the structure will 
be described in terms of properties, relations and functions of the objects. To 
specify a particular structure or family of structures, one will normally give an 
axiomatic theory formulated in terms of this vocabulary, such that the algebraic 
structures under investigation may be identified with the models of the theory. 

Hence, axiom systems and algebras are intimately related and complemen- 
tary views of a conceptual system. The axiomatic viewpoint characterises the 
meanings of concepts in terms of propositions involving those concepts, whereas 
the algebraic viewpoint exemplifies these meanings in terms of a set of objects 
and mappings among them. Moreover, the models of axiomatic theories can 
be regarded as algebras, and conversely algebras may be characterised by ax- 
iomatic theories. 

Having said this, the two perspectives lead to different emphasis in the way a 
conceptual system is articulated. If one starts from axiomatic propositions, one 
tends to focus on relational concepts (formalised as predicates), whereas, if one 
starts from objects and structures, the focus tends to be on functional concepts 
corresponding to mappings between the objects. Indeed, the term ‘algebra’ is 
sometimes reserved for structures that may be characterised without employing 
any relational concept apart from the logical equality relation. And the most 
typical algebras are those specified purely by means of universally quantified 
equations holding between functional terms. 


1.2 Representing space 


1.2.1 Classical approaches. Our modern appreciation of space is very 
much conditioned by mathematical representations. In particular, the insights 
into spatial structure given to us by Euclid and Descartes are deeply ingrained 
in our understanding. 

Euclid described space in terms several distinct categories of geometrical 
object. These include points, lines and surfaces as well as angles and plane 
figures. These entities may be said to satisfy a number of basic properties (e.g. 
lines may be straight and surfaces may be planar) and relationships (e.g. a 
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point may be incident in a line or surface, two lines may meet at a point or 
be inclined at an angle). The nature of space was then characterised by postu- 
lates involving these basic concepts, which were originally stated in ordinary 
language. Euclid proceeded to define many further concepts (such as different 
types of geometrical figure) in terms of the basic vocabulary. 

From Descartes came a numerical interpretation of space, with points in 
an n-dimensional space being associated with n-tuples of numerical values. 
According to this Cartesian model, the basic elements of space are points. Their 
structure and properties can be axiomatised in terms of the metrical relation of 
equidistance (see e.g. Tarski, 1959; Tarski and Givant, 1999), and interpreted 
in terms of numerical coordinates. 

If points are taken as the primary constituents of the universe, lines and 
regions have a derivative status. Two distinct points determine a line, and 
polygonal figures can be represented by a sequence of their vertex points. To 
get a more general notion of ‘region’ we need to refer to more or less arbritrary 
collections of points. This is the representational perspective of classical point- 
set topology. 


1.2.2 Region-based approaches. Although the point-based analysis 
has become the dominant approach to spatial representation, there are a number 
of motivations for taking an alternative view, in which extended regions are 
considered as the primary spatial entities. 

An early exponent of this approach was Alfred North Whitehead, who shared 
with Bertrand Russell the view that an adequate theory of nature should be 
founded on an analysis of sense data, and that elements of perception can 
be the only referents of truly primitive terms. On this basis, Whitehead in 
his book Concept of Nature (Whitehead, 1920) argued that extended regions 
are more fundamental than points: whereas regions may be perceived as the 
spatial correlates of colour patches in the visual field, points cannot be perceived 
directly but are only constructed by cognitive abstraction. 

This motivation shares some common ground with that of Stanislaw 
Lesniewski, who also wanted to bring the theoretical analysis of the world more 
closely in line with phenomenological conceptions of reality, and believed that 
perceiving the integrity of extended objects is basic to our interpretation of the 
world. Mereology, a formal theory of the part-whole relation was originally 
presented by Lesniewski, 1931 in his own logical calculus, which he called 
Ontology. 

Whitehead identified the relation of connection between two spatial or spatio- 
temporal regions as of particular importance to the phenomenological 
description of reality. In further work he attempted to use this concept as 
the fundamental primitive in a logical theory of space and time. A formal the- 
ory of this relation was presented in Whitehead, 1929. (This was subsequently 
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found to be inconsistent and was posthumously corrected in a second edition, 
Whitehead, 1978.) 

The earliest completely rigorous and fully formalised theory of space where 
regions are the basic entity is the axiomatisation of a Geometry of Solids 
that was given by Tarski, 1929. Subsequently, a number of other formali- 
sations have been developed. These include the Calculus of Individuals pro- 
posed by Leonard and Goodman, 1940 and Goodman, 1951, which is close to 
Lesniewski’s mereology, and the spatial theories of Clarke, 1981 and Clarke, 
1985, which are based on Whitehead’s connection relation. 

More recently, region-based theories have attracted attention from 
researchers working on Knowledge Representation for Artificial Intelligence 
(AD systems. The so-called Region Connection Calculus (Randell et al., 1992b) 
is a 1st-order formalism based on the connection relation and is a modification 
of Clarke’s theory. AI researchers are motivated to study such representations 
by a belief that they may be useful as a vehicle for automating certain human- 
like spatial reasoning capabilities. From this point of view it has been argued 
that the region-based approach is closer to the natural human conceptualisation 
of space. In the context of describing and reasoning about spatial situations 
in natural language, it is common to refer directly to regions and the relations 
between them, rather than referring to points and sets of points. Therefore, 
treating regions as basic entities in a formal language can in many cases allow 
simpler representation of high-level human-like spatial descriptions. 


1.2.3 Interdefinability of regions and points. Despite the difference 
in perspective, several formal results show that region and point-based concep- 
tualisations are in fact interdefinable, given a sufficiently rich formal apparatus. 
Whitehead himself had noted that by considering classes of regions, points 
can be defined as infinite sets of nested regions which converge to a point. 
Pratt and Lemon, 1997 and Pratt-Hartmann, 2001 showed that any sufficiently 
strong axiomatisation of the polygonal regions of a plane can be interpreted in 
terms of the classical point-based model of the Euclidean plane. So in some 
sense the region-based theory is not ‘ontologically simpler’ in its existential 
commitments. 

Later in this paper we shall adopt a similar approach, and by identifying a 
point with the sets of all regions region to which it belongs, we shall show that 
even much weaker and more general region-based theories can be interpreted 
in terms of point sets. Hence, the point and region based approaches should not 
be regarded as mutually exclusive, but rather as complementary perspectives. 
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Nevertheless, we believe that region-based theories deserve more attention 
than has traditionally been paid them, and that for certain purposes they have 
clear advantages. 

There is an argument that regions are actually more powerful and flexible 
than points as a starting point for spatial theories. Pratt-Hartmann, 2001 shows 
that if we have a domain of regions plus a spatial language with sufficient (in fact 
rather low) expressive power, we implicitly determine a corresponding domain 
of points. Roughly speaking, this is done as follows: within the region-based 
theory we can specify pairs of regions that have a unique ‘point’ of contact. 
Thus, relations among points can be recast in the guise of formulae which refer 
to these region pairs. In this way points are implicitly definable from regions 
by 1st-order means; whereas, if we start with points as the basic entities, the 
definition of regions requires higher order axioms (unless we arbitrarily restrict 
the geometrical complexity of regions). 


1.3 Alternative logical formalisms 


We have seen that the representation of space allows alternative views that 
invert the perspective of the orthodox picture. The same is also true for the mode 
of application of formal representations themselves. 1st-order logic provides 
a standard alignment of syntactic and conceptual categories. Specifically, the 
basic nominal symbols of the formal language refer to what are considered to be 
the primitive entities of the ‘domain’ of a theory, while formal predicates corre- 
spond to properties and relations that hold among those entities. This alignment 
is widely held to be ‘natural’, in that it seems to accord in some respects with 
the syntactic expression of semantics found in natural languages. However, 
this intuition is difficult to establish conclusively. Moreover, by altering the 
correspondence between syntactic and conceptual categories, one may obtain 
alternative calculi that also have a meaningful interpretation and proof theory. 
For instance, one could interpret syntactically basic symbols as denoting ‘prop- 
erties’, and represent ‘individuals’ formally as predicates (the extension of the 
predicate being the set of properties satisfied by the corresponding individual). 

However, since relations between objects are not in general reducible to 
properties of individual objects, a much more powerful abstraction is obtained 
by taking relations as the basic entities of a formal system. This approach was 
first formalised in an algebraic framework by Tarski, 1941 and relation algebras 
are now a well-established alternative to standard | st-order formalisms (Tarski 
and Givant, 1987; Andréka et al., 2001). 

From the point of view of logic and computation, there are significant 
advantages in treating relations as basic entities. In particular this mode of 
representation allows quantifier-free formalisation of many properties and in- 
ference patters, which would otherwise require quantification. This is one of 
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the main themes of algebraic logic, as it is elaborated in Andréka et al., 2001; 
Németi, 1991 and Ahmed, 2004. 

As we will be concerned with spatial representation based on the ‘contact’ 
relation, relation algebras are a natural system within which to formulate theo- 
ries of this kind. 


1.4 Structure of the chapter 


The organisation of this chapter is as follows. In Sec. 2 we shall present 
some basic formal structures and notations that will be used to develop the 
theory. These include, Boolean algebras, relation algebras, topological spaces 
and proximity spaces. Sec. 3 introduces the spatial contact relation, which is 
the primary focus of our investigation. We consider the fundamental axioms 
satisfied by a contact relation and give standard interpretations of contact in 
terms of point-set topology. We then see how the basic properties of this relation 
can be described in both 1st-order and relation algebraic calculi. 

In Sec. 4 we introduce Boolean Contact Algebras, which are Boolean alge- 
bras supplemented with a contact relation satisfying appropriate general axioms. 
Additional axioms are also considered, which characterise further properties of 
contact that are exhibited under typical spatial interpretations. We give repre- 
sentation theorems for the general class of Boolean Contact Algebras in terms 
of both topological spaces and proximity spaces, and give more specific rep- 
resentation systems for algebras satisfying additional axioms. These theorems 
make concrete the correspondence between the relational approaches which 
focus on axiomatic properties of the contact relation, and the more well-known 
models of space in terms of point-set topology. 

Sec. 5 looks at some other well known approaches to formalising topological 
relationships, in particular the Region Connection Calculus (Randell et al., 
1992b) and the 9-intersection model (Egenhofer and Franzosa, 1991). Sec. 6 
considers the problem of reasoning with topological relations. The methods 
presented are: compositional reasoning, equational reasoning, encoding into 
modal logic and a relation algebraic proof theory. Sec. 7 concludes the chapter 
with a consideration of the correspondences that have been established between 
different modes of formalising topological information, and of ongoing and 
future developments in this area. 


2. Preliminary definitions and notation 


In this section we give definitions and key properties of the basic formal 
structures that will underpin our analysis. We start with Boolean algebras 
with operators, which provide an extremely general framework for studying 
structured domains of objects. Two important special cases of these algebras 
are considered: modal algebras, and relation algebras. 
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2.1 Boolean algebras 


We assume that the reader has some familiarity with Boolean algebras (BAs), 
and here only revise the basic details and notation. Our standard reference for 
BAs is Koppelberg, 1989. Our signature for a BA will be (B,-,+,—,0,1). 
We will usually refer to an algebra by its base set (in this case B). 


DEFINITION 3.1 Boolean algebra concepts and notations: 
i) Foralla,b€ B,a< b holds iffa + b = b. 


ii) If AC B, then `p A denotes the least upper bound of A relative to 
the < ordering of B. If A is infinite this does not necessarily exist. 
Where the relevant algebra is clear, we may write simply > A. 


iii) If Ais a subalgebra of B, we denote this by A < B. 
iv) The set of non-zero elements of B is denoted by B*. 


v) IfM isa subset of B*, then M is dense in B, iff 
(Vb € Bt\(Jae M)a<b. 





vi) Anatom of B is an element a € Bt such that 
(Ve)le< a > (c= 0 V c= a)]. 
vii) The set of atoms of B will be written as At( B). 
viii) B is atomic, iff At(B) is dense in B*. 
ix) Iff: B — B is a mapping, then its dual is the mapping f? : B > B 


defined by f? (x) = —f(—2). 


In general, a BA will contain elements corresponding to the meet and join of 
any finite subset of its domain. A BA is called complete, if arbitrary joins and 
meets exist. The completion of B is the smallest complete BA A which contains 
B as a dense subalgebra. Itis well known that each B has a completion which 
is unique up to isomorphisms. 

DEFINITION 3.2 An ultrafilter is a subset F of B such that: 


i) IfxeF,yebBandrz < y, theny € F. 
ü) Ifx,yEF,thenz-yE€F. 


iii) x€ F ifandonlyif—zx £ F. 
The set of all ultrafilters of B will be denoted by Ult( B). 
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Ultrafilters are often employed as a means to represent “point-like’ entities 
that are implicit in the structure of a BA. From a purely algebraic point of view, 
the elements of a BA are abstract entities with no sub-structure. However, the 
elements may be, and often are, intended to correspond to composite objects 
(e.g. sets or spatial regions). Thus, as we shall see later, the elements of a BA 
are often interpreted as point sets in some space (e.g. a topological space). In 
such a context, an ultrafilter can usually be thought of a set of all those elements 
of a BA that contain some particular point in the space over which the algebra 
is interpreted. 

Perhaps the simplest example is the BA X* whose elements are (interpreted 
as) all subsets of the set X (with the Boolean operations having their standard 
set-theoretic interpretation). In this case, for each x € X the set {Y | Y C 
X* ^x € Y } is an ultrafilter of X*. 


DEFINITION 3.3 A canonical extension of B is an algebra B°, which is a 
complete and atomic BA containing an isomorphic copy of B as a subalgebra, 
and which satisfies the following the properties: 


i) Every atom of B” is the meet of elements of B. 


ii) If AC Bsuch that go A=1, 
then there is a finite set of A’ C A such that X po A’ = 1. 


It is well known, that each BA has a canonical extension which is unique 
up to isomorphism. One such construction is given by Stone’s representation 
theorem for Boolean algebras: Let B7 be the powerset algebra of the set of 
ultrafilters X of B, and embed B into B” byb — {UE X : bE U}. IfFAK< B, 
then A is called a regular subalgebra of B, if B is a canonical extension of A. 
The notion of canonical extension is equivalent to that of ‘perfect’ extension 
introduced in Jónsson and Tarski, 1951. Our notion of regular sub-algebra is 
also equivalent to that used in Jónsson and Tarski, 1951, which is different to 
that given in Koppelberg, 1989. 

For more details and discussions we refer the reader to Jónsson, 1993, 
Jónsson, 1994, Jónsson, 1995, Jónsson and Tarski, 1951. 





2.2 Boolean algebras with operators 


The structure of a Boolean algebra may be further elaborated by the intro- 
duction of additional operators. These Boolean algebras with operators arose 
from the investigation of relation algebras, and were first studied in detail by 
Jónsson and Tarski, 1951; a survey can be found in Jónsson, 1993. Many useful 
structures have the form of such algebras. 


DEFINITION 3.4 Some useful concepts for describing Boolean algebras with 
operators are defined as follows: 
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i)  Afunction f : B” — Bona BA is called additive in its i-th argument if 
Froeso ti a) Pee Se = 
f(xo,---, (£i + £i), £n), forall x; € B. 


ii) A function f : B” — B on a BA is called an operator, if it is additive 
in each of its arguments. 


iii) f: B” — B is called normal, if it is additive and its value is 0 if any 
of its arguments is 0. 


iv) A structure (B,(fi)ier) is called a Boolean Algebra with Operators 
(BAO), if B is a BA, and all f; are operators. 


v) Ifall f; are furthermore normal, then we speak of a normal BAO. 


vi) A collection of algebras defined by a given signature and a Set of uni- 
versally quantified equations is called an equational class (or variety). 


Examples of normal BAOs are modal algebras, relation algebras (both of which 
will be discussed below), and cylindric algebras, which provided an algebraisa- 
tion of Ist-order logic. We invite the reader to consult the classic monographs 
by Henkin et al., 1971, Henkin et al., 1985 or the recent exposition by Andréka 
et al., 2001, which provides a comprehensive (and comprehensible) introduc- 
tion to Tarski’s algebraic logic. 

The concept of canonical extensions of Definition 3.3 can be extended to 
BAOs: 


DEFINITION 3.5 Suppose that B is a BA, and f an n-ary normal operator on 
B. The canonical extension f7 of f is defined by 


8.1) 2a = 5 Mo :p <y € B"}: pe At(B’)” andp < a} 


forall x € (B°)”. If (B, (fi)ier) is anormal BAO, we call (B°, (F? Jier) the 
canonical extension of (B, (fi)ier). 


PROPOSITION 3.6 (JONSSON AND TARSKI, 1951) The canonical exten- 
sion of a normal BAO (B,(fi)ier) is a complete and atomic normal BAO 
containing (B,(fi)ier) as a subalgebra. 


This is not the place to dwell on the preservation properties of canonical exten- 
sions of normal BAOs, and we refer the reader to Jonsson, 1993 and de Rijke 
and Venema, 1995 for details. 

As the connection of unary normal operators to operators of modal logics 
(which will be examined in detail later) is somewhat special, we make the 
following convention: 
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DEFINITION 3.7 If f is a unary normal operator on the BA B, we call it 
a modal operator or possibility operator, and the structure (B, f) a modal 
algebra. 


Hence, modal algebras form an equational class (or variety) of algebras. That 
is the class of BAOs with one operator f that satisfy, in addition to the identities 
of Boolean algebra, the equations: 


(32) f(xt+y) = f(x) + fly), 
(3.3) f(0) =0. 


A special case of modal algebras (hence, of BAOs) are closure algebras: 


DEFINITION 3.8 A possibility operator f on B which also satisfies, for all 
a€é B, 


(3.4) a+ f(a) = f(a), 
(3.5) F(f(a)) = fla). 
is called a closure operator, and, in this case, (B, f) is a closure algebra. 


Incidentally, one dimensional cylindric algebras are a special case of closure 
algebras (Henkin et al., 1971). 


DEFINITION 3.9 Functions whose duals are possibility operators are called 
necessity operators. Thus a necessity operator on B is a function g : B —> B, 
for which 


(3.6) gly =, (Dually normal) 
(3.7) gla: b) = g(a) - g(b) foralla,b € B. (Multiplicative) 


DEFINITION 3.10 A necessity operator g is called an interior operator if for 
alla € B it satisfies 

(3.8) gla)+a =a, 

(3.9) g(a) = g(g(a)). 


If g is an interior operator on B, then the structure (B, g) is called an interior 
algebra. 


Modal algebras can be viewed as an algebraic counterpart to the relational 
structures known as (Kripke) frames: 


DEFINITION 3.11 A frame is a pair F = (U, R}, where Ris a binary relation 
on U, called an accessibility relation. 
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Every frame F has a corresponding algebra, called the complex algebra of F. 


DEFINITION 3.12 If F = (U, R) is a frame, then the complex algebra of F 
is the structure F* = (2, 0p), where Op : 2U — 2" is defined by 





(3.10) OrR(X) = {y E€ U : (Ax € X)rRy}. 


It is not hard to see that F* is a complete and atomic modal algebra. 
Conversely, we can construct a frame from a modal algebra: 


DEFINITION 3.13 If (B, f) is a modal algebra, let Rf € Rel(At(B)) be 
defined by 


(3.11) aRrb 4= a < f(b). 


The structure (At(B), Rr) is called the canonical frame of (B, f}, and Ry its 
canonical relation. 


(We use the symbol <= to express meta-level equivalences and semantic 
definitions.) 
We now have the following representation theorem: 


PROPOSITION 3.14 (JONSSON AND TARSKI, 1951; JONSSON, 1993; 
JONSSON, 1994) Let (B, f) be a complete and atomic modal algebra. Then, 
(B, f) is a regular subalgebra of the complex algebra of its canonical frame. 
Furthermore, if (B, f) is isomorphic to a regular subalgebra of a complex 
algebra of some frame (U, R}, then (U, R) = (At(B), Rf). 


It is worth mentioning that all normal BAOs, not only the ones with unary 
operators, are representable as regular subalgebras of complex algebras of 
frames. Normality is essential, since non-normal BAOs do not admit such 
a representation (Madarász, 1998). 

Correspondence theory investigates, which relational properties of R can be 
expressed by its canonical modal operator and its dual (see e.g. van Benthem, 
1984). We have, for example: 


(3.12) Ris reflexive 4> (VX)[X C Op(X)], 
(3.13) Ris symmetric => (VX)[Or(—OR(—X)) C XJ, 
(3.14) Ris transitive => (VX)[OrOR(X) C OR(X)]. 


These correspondences, as well as the following result, have appeared already 
in Jonsson and Tarski, 1951: 


PROPOSITION 3.15 A modal algebra is a closure algebra if and only if its 
canonical relation is reflexive and transitive. 
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2.3 Binary relations and relation algebras 


A binary relation R ona set U is a subset of U x U, i.e. a set of ordered pairs 
(x,y) where x,y € U. Instead of (x,y) € R, we shall often write xRy. The 
smallest binary relation on U is the empty relation Ø, and the largest relation is 
the universal relation U x U, which we will normally abbreviate as U?. The 
identity relation (x, x) : x € U will be denoted by 1’, and its complement, the 
diversity relation, by 0'. Domain and range of R are defined by 


(3.15) dom(R) = {x € U : (3y € U)zRy}, 
(8.16) ran(R) = {x € U : (Jy € U)yRe}. 
Furthermore, we let R(x) = {y € U : xRy}. 


The set of all binary relations on U will be denoted by Rel(U). Clearly, 
Rel(U) is a Boolean algebra under the usual set operations: 








(3.17) -R = {(a,y) : =(xRz)}, 
(3.18) RU S = { (x,y) : cRy or cSy}, 
(3.19) RA S = { (x,y) : zRy and cSy}. 

If R, S € Rel(U), the composition of R and S is defined as 
(3.20) R; S= {(x,y) : (3z)[xRz and zSy]}. 





The converse of R, written as R“, is the set 
(3.21) R“ = {(y, x) : xRy}. 


A detailed analysis of relation algebras can be found in Henkin et al., 1971, 
and an overview in Jónsson, 1991. The following lemma sets out some decisive 
properties of composition and converse. 


LEMMA 3.16 


i) ; is associative and distributes over arbitrary joins. 

ü) 11; R=R;1'=R. 

iii) “ is bijective, of order two, i.e. R“ ~“ = R, and distributes over 
arbitrary joins. 

iv) (R; S)“ = S~“; R^. 

v (R; GAT =b <= (R`; TAS =< (T; SA Ree: 


Note that any equation and any inequality between relations can be written as 
T = U? for some T. To do this, it is convenient to first to define the operation 
R & S, which gives the symmetric difference of R and S: 


(3.22) R@S=(RN-S)U(SN—R). 
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We then have the following equivalences: 


(3.23) R= S 4> -(R2 85) = V’, 
(3.24) R#S => (U?; ((R & S); U?)) =U". 


Implicitly, we use here the concept of discriminator algebras which are a pow- 
erful instrument of algebraic logic (see Heinrich, 1978 and also Jónsson et al., 
1991). 


DEFINITION 3.17 The full algebra of binary relations on U is the structure 
(Rel(U), N, U, —, 0, U?, ; , “, 1%). 

A Boolean subalgebra of Rel(U) which is closed under ; and ~ and contains 
1’ will be called an algebra of binary relations (BRA). 


Many properties of relations can be expressed by equations (or inclusions) 
among relations, for example, 


(3.25) Ris reflexive = > (Vr)rRz, 
= 1 CR. 

(3.26) Ris symmetric 4> (Vz, y)[x Ry > yRz], 
L&S R=R. 

(3.27) R is transitive = > (Yz, y, z) [xRy ^A yRz > zRz2], 
SR RER. 

(3.28) Ris dense <> (Vx)a(—R)x ^ 





(Vz,y)[cRy > (dz)cRzRy], 
4> RAI’'=ØARCR;R, 
<> RN ('U -(R; R) = 4. 
(3.29) Ris extensional — > (Vz, y)| R(x) = R(y) > x =y], 
> [-(R; =- R“) N -(R*; =- RJ ET. 


Observe that all formulae above contain at most three variables. This is no 
accident, as the following result shows: 


PROPOSITION 3.18 (TARSKI AND GIVANT, 1987; GIVANT, 2006) 


i) The lst-order properties of binary relations on a set U that can be ex- 
pressed by equations using the operators N, U, —, ; , ~“, and constants 
0, U?,1' are exactly those which can be expressed with at most three 
distinct variables. 


ii) IfR isa collection of binary relations on U, then, the closure of RA 1'} 
under the operations N, U, —, ; , ~is the set ofall binary relations on U 
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which are definable in the (language of the) relational structure (U, R) 
by Ist-order formulae using at most three variables, two of which are 
free. 


If Ais acomplete and atomic BRA, in particular if A is finite, then the actions 
of the Boolean operators are uniquely determined by the atoms. To determine 
the structure of A, it is therefore enough to specify the composition and the 
converse operation. 

When dealing with an atomic BRA, it is often convenient to specify the 
composition operation by means of composition table (CT), which, for any two 
atomic relations R;, R}, specifies the relation R;; R; in terms of its constituent 
atomic relations. Formally, a composition table is a mapping CT : At(A) x 
At(A) > 24*A) such that 


(3.30) T € CT(R, S) FC (R; S). 
Since A is atomic, we have 
(3.31) R; S=(JcT(R, S). 


CT can be conveniently written as a quadratic array (the composition table of 
A), where rows and columns are labelled with the atoms of A, and the cells 
contain CT(R, S). 

BRAs are one instance of the class of relation algebras, which may be seen 
of an abstraction of algebras of binary relations (Tarski, 1941): 


DEFINITION 3.19 A relation algebra (RA) 
(A, Pyty 5 0, i; +> 5 1’) 


is a structure of type (2,2,1,0,0,2,1,0) which satisfies: 


(RAO) (A, +,:,—,0, 1) is a Boolean algebra. 
(RA1) z; (y; z)=(@3 y); z. 

(RA2) (x+y); z=(z; z)+(y; z) 
(RA3) z; =r. 

(RA4) pgs 

(RAS) (ety) =a2°4+y 

(RA6) (z; y) =y“; T. 

(RA7) (oF = (x£; y)) < -y 


Observe that BRAs and RAs are BAOs: An RA is a BAO where the additional 
operators (; , “, 1’) form an involuted monoid, and the connection between this 
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monoid and the Boolean operations is given by RA7. The somewhat cryptic 
character of RA7, can be made clearer by observing that, in the presence of the 
other axioms, it is equivalent to the cycle law 


(3.32) (x; y) -z =O (es z) -y =0 <4 (z; eH 


Tarski announced in the late 1940s that set theory and number theory could 
be formulated in the calculus of relation algebras: 


‘It has even been shown that every statement from a given set of axioms can be 
reduced to the problem of whether an equation is identically satisfied in every 
relation algebra. One could thus say that, in principle, the whole of mathematical 
research can be carried out by studying identities in the arithmetic of relation 
algebras’. (Chin and Tarski, 1951) 


We invite the reader to consult Tarski and Givant, 1987, and, for an overview 
Ahmed, 2004 or Givant, 2006. Another excellent reference for the theory of 
RAs is the book by Hirsch and Hodkinson, 2002. 


2.4 Topological spaces 


We assume familiarity with the basic ideas of topological spaces and only 
briefly recap some notational details. A topological space is a structure (X, T}, 
where X is the base set, and 7 the collection of open subsets of X. 7 is closed 
under arbitrary unions and finite intersections. If 7 is understood, we may use 
X to refer to the topological space. The elements of X will be denoted by lower 
case Greek letters (except 7), and its subsets by lower case Roman letters. 

If x C X, its interior is denoted by int(zx), and its closure by cl(x). Observe 
that int and cl are respectively an interior operator in the sense of (3.8)—-(3.9), 
and a closure operator in the sense of (3.4)-(3.5). A subspace y of X is dense 
in X, if cl(y) = X. 

The boundary O(x) of x C X is the set cl(x) \ int(x). If a € X, and 
a € x E 7, then z is called an open neighbourhood of a. X is called connected 
if it is not the union of two disjoint non-empty open sets. 


2.4.1 Separation conditions. The general framework of topological 
spaces includes structures of many different kinds. In particular the open sets 
may be more or less densely distributed within the space. Significant, fun- 
damental properties of this distribution can often be described in terms of the 
existence of certain disjoint sets separating arbitrary points and/or subsets of 
the space. Such properties are known as separation conditions. 

Later, in Sec. 4, we will show how axiomatic properties of spaces described 
in terms of the contact relation correspond to separation conditions of their 
topological interpretations. To this end, the following conditions are especially 
relevant: 
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Tı. Atopological space X is a called T; space, if for any two distinct points 
a, P, there are x,y E€ T such that a € x, B Z x and 8 E y, a Z y. This is 
equivalent to the fact that each singleton set is closed. 


T> (Hausdorff). X is called a To or Hausdorff space, if any two distinct 
points have disjoint open neighbourhoods. It is well known that each Ty space 
is a T space, and that each regular Tı space is a Th space. 


Regular. A space X is regular if every point a and every closed set not 
containing @ are respectively included in disjoint open sets. 

It is well known (see e.g. Engelking, 1977) that X is regular, if and only 
if for each non-empty u € T and each a € u there is some v € 7 such that 
aévCecl(v) Cu. 


Semi-Regular. A space is semi-regular if it has a basis of regular open 
sets—i.e. every open set is a union of regular open sets. 
Regularity implies semi-regularity, but not vice versa. 


Weakly Regular. We call X weakly regular if it is semi-regular and for 
each non-empty u € T there is some non-empty v € T such that cl(v) C u. 
Weak regularity may be called a “pointless version” of regularity, and each 
regular space is weakly regular. 


Completely Regular. X is called completely regular, if for every closed x 
and every point a ¢ x there is a continuous function f : X — [0,1] such that 
f(@) =0 forall 8 € x, and f(a) = 1. 


Normal. X is called normal, if any two disjoint closed sets can be separated 
by disjoint open sets. 


Weakly Normal. X is called weakly normal, if any two disjoint regu- 
lar closed sets can be separated by disjoint open sets. Weak normality was 
introduced as ‘«-normality’ by Shchepin, 1972. 


For any T space, X, the following entailments hold: 
X is normal 
=> X is weakly normal 
=> X is completely regular 
=> X is regular 
=> X is weakly regular 
=> X is semi-regular. 
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None of these implications can be reversed (see Diintsch and Winter, 2005 
for examples). 

A space which is T and regular is called a T3 space and a space which is Ti 
and normal is a T4 space. The various conditions T; are successively stricter 
as į increases. Thus, T4 T3 To T). 








2.4.2 Regular sets and their algebras. A set x C X is called reg- 
ular open if x = int(cl(x)), and regular closed if x = cl(int(x)). Clearly, 
the set complement of a regular open set is regular closed and vice versa. 
The collection of regular open sets (regular closed sets) will be denoted by 
RegOp(X) (RegCl(X)). Itis well known (Koppelberg, 1989) that RegOp(X) 
and RegCl(X) can be made into (isomorphic) complete Boolean algebras by 
the operations 


x+y = int(cl(x U y)), gt+y=xUy, 
z- Y=TNY, x-y = cl(lint(z N y)), 
—x = X \cl(z), —x = X \ int(z), 
0=9, 0=9, 
1=X, 1=X. 


RegOp(X) does not fully determine the topology on X: 


PROPOSITION 3.20 (VAKARELOV ET AL., 2002) Ify is a dense subspace 
of X, then RegOp(X) = RegOp(y). 


If we only want to consider the regular closed sets (or regular open sets), it 
suffices to look at semi-regular spaces. Let us call the topology r(r) on X 


which is generated by RegOp(r) the semi-regularisation of (X,7). Then the 
following useful fact holds: 


PROPOSITION 3.21 (DUNTSCH AND WINTER, 2005) Suppose that (X,T) 
is a topological space. Then, (RegOp(r)) = (RegOp(r(r))). 


Proof (See also Engelking, 1977, p. 84.) 
Let a C X. Then, 


clr) (a) = -Ufm € RegOp(T) : mN cl- (a) = o} 


> -Ufm E€T:mNcl, (a) = o} = cl, (a). 
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Let a € RegOp(r). Then, 


int,(7) Cl,(7)(@) = int,(7) (- (Jim E€ RegOp(T) : mN cl (a) = 0}) : 
he € RegOp(r) : tN m = Í for all m € RegOp(r) 
with mN cl,(a) = Ø}, 


=a, 


since a and t are regular open, and thus, t C cl-(a) implies t C a. 
Conversely, let a € RegOp(r(r)). Then, 


a = int,(7) cl,(7)(@) = Ltt € RegOp(r) : t C cl,,)(a)}. 
Now, int; cl-(a) € RegOp(r), and thus, 
a C int, cl (a) C intrer) cl(7)(a) = a. 


Ifa € RegOp(r), then, by the preceding consideration, —-a = —,,;)a, and 
thus, cl, (a) = cl,(+} (a). This implies the claim. QED 


2.4.3 Closure algebras and topologies. The study of topologies via 
the closure or interior operator is sometimes called pointless topology (see, for 
example, Johnstone, 1983). McKinsey and Tarski, 1944 had already shown that 
the closure algebras (as specified by Definition 3.8) give rise to the collection 
of closed sets of a topological space, by proving the following representation 
theorem: 


PROPOSITION 3.22 (MCKINSEY AND TARSKI, 1944) 


i) If (X,7) is a topological space, then (2* ,cl) is a closure algebra, 
called the closure algebra over (X, T). 


ii) If(B, f) isaclosure algebra, then there is some T; space (X,T) such 
that (B, f) is a subalgebra of (2* , cl). 


Dual statements holds for interior algebras and the topological int operator: 
PROPOSITION 3.23 
i) If (X,r) is a topological space, then (2* , int) is an interior algebra, 


called the interior algebra over (X, T). 


ii) If (B,q) is an interior algebra, then there is some T; space (X,T) 
such that (B, g) is a subalgebra of (2* , int). 
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2.4.4 Heyting algebras and topologies. Another way of looking at 
these algebras is via a certain class of lattices: An algebra (A,+,-,=>,0, 1) 
of type (2, 2,2,0,0) is called a Heyting algebra if (A, +,-,0, 1) is a bounded 
lattice, and = is the operation of relative complementation, such that ifa,b € A, 
then 


(3.33) a = bis the largest x € A for whicha - x < b. 
In other words, 
(3.34) a-« < bifand only if x < a > b. 


In Rasiowa and Sikorski, 1963 such algebras are called pseudo-Boolean alge- 
bras. If a € A, then its pseudo-complement a* is the element a = 0, i.e. a* 
is the largest x € A for which a - x = 0. Heyting algebras form an equational 
class—i.e. a collection of algebras defined by a set of universally quantified 
equations (for details see Rasiowa and Sikorski, 1963 or Balbes and Dwinger, 
1974). Furthermore, if (B, g) is an interior algebra, then the collection O(B) 
of its open sets forms a Heyting algebra with = defined as 


(3.35) a = b = g(—a + b). 


In view of Proposition 3.23 we now have the following representation theorem 
(McKinsey and Tarski, 1944): 


PROPOSITION 3.24 For each Heyting algebra A, there exists a T; space X 
such that A is isomorphic to a subalgebra of the Heyting algebra of open sets 
of X. 


2.5 Proximity spaces 


Proximities were introduced by Efremovič, 1952. The intuitive meaning of 
a proximity A is that r^y holds for some x,y C X, when x is close to y in 
some sense. Their axiomatisation is very similar to that of Boolean contact 
algebra to be discussed in Sec. 4. The main source on proximity spaces is the 
monograph by Naimpally and Warrack, 1970. 

From the point of view of this investigation proximity spaces play a very use- 
ful role. On the one hand, the proximity approach is close to that of point set 
topology, and mappings between proximity spaces and corresponding topolog- 
ical spaces are well established. On the other hand the formulation of proximity 
spaces is based on a binary relation between point sets, whose meaning can be 
correlated with the contact relation thatis taken as a primitive in many axiomatic 
and algebraic approaches to representing topological relationships between reg- 
ions (which will be considered further in Sec. 3 below). Hence, proximity 
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spaces provide a link between these axiomatic or algebraic formulations and 
point-set topological models of space. 

Formally, a binary relation A on the powerset of a set X is called a proximity, 
if it satisfies the following axioms for x,y,z C X: 


(P1) If cNy #4 then rAy. 

(P2) If rAy then x,y 490. 

(P3) A is symmetric. 

(P4) cA(yU z) if and only if cAy or rAz. 

(P5) If z(—A)y then x(—A)z and y(—A) — z forsome z C X. 


DEFINITION 3.25 The pair (X, A) is called a proximity space. 


Sometimes the term proximity space has been used to include structures that 
do not satisfy axiom P5. Those satisfying P5 are sometimes called Efremovic 
proximity spaces (Efremovit, 1952). 


DEFINITION 3.26 A proximity is called separated if it satisfies 
(Peep) {a}A{B} implies a = 8 
Thus, in a separated proximity space, no two distinct singleton sets are related 


by the proximity relation. 


2.5.1 The topology associated with a proximity space. Each proxim- 
ity space determines a topology on X in the following way: we take the closure 
of any set x as the set of all points a, such that {a} is proximal to x: 


(3.36) cl(x) = {a € X:{a}Az}. 


PROPOSITION 3.27 (NAIMPALLY AND WARRACK, 1970) 
i) The operation of (3.36) defines the closure operator of a topology 
T(A) on X (which is not necessarily T;). 
ii) (X,7(A)) is a completely regular space. 
iii) IfA is separated, then (X,7(A)) is a T; space. 


iv) xy if and only if cl(x)A cl(y). 


A proximity which is relevant to our investigation is the standard proximity 
on anormal T} space X (Naimpally and Warrack, 1970). For x,y C X, let 


(3.37) rAy 4> cl(x) Ncl(y) 4 0. 
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Observe that A is separated, since X is a Tı space and thus singletons are 
closed. 


3: Contact relations 


The relation of ‘contact’ is fundamental to the spatial description of con- 
figurations of objects or regions. Contact relations have been studied in the 
context of qualitative approaches to geometry going back as far as the work of 
de Laguna, 1922, Nicod, 1924, Whitehead, 1978 and subsequently of Clarke, 
1981. More recently the contact relation has been employed as a funda- 
mental primitive in the field of Qualitative Spatial Reasoning (Randell et al., 
1992b; Borgo et al., 1996; Cohn et al., 1997; Pratt and Schoop, 1998; Pratt and 
Schoop, 2000; Stell, 2000; Düntsch et al., 1999; Diintsch et al., 2001a; and see 
also Sec. 5 below). This has emerged as a significant sub-field of Knowledge 
Representation, which is itself a major strand of research in Artificial Intelli- 
gence. (In AI and Qualitative Spatial Reasoning, the contact relation is often 
called ‘connection’. In the present work we use contact to avoid confusion with 
the slightly different notion of ‘connection’ employed in topology.) 

The contact relation can be seen as a weaker and more fine-grained cousin of 
the ‘overlap relation’, which is straightforwardly defined in terms of the ‘part 
of’ relation, thus: xOy =,, 3z|zPx ^ zPy |. The properties of the parthood 
relation were first formalised by Leśniewski, 1931, as the basic relation of his 
Mereology (see also Leśniewski, 1983; Leśniewski, 1992). 





DEFINITION 3.28 A contact relation C is a relation satisfying the following 
axioms: 


(C1) Val Cz | (Reflexivity), 
(C2) Vay| Cy > yCa | (Symmetry), 
(C3) Vay|[ V2 zCx = zCy]>zr=y] (Extensionality). 


These axioms correspond to axioms AO.1 and AO.2 given by Clarke, 1981 for 
the mereological part of his calculus of individuals.In theories whose domain 
includes an empty/null region, C1 is normally weakened to Yz[e = 0 V xCa}. 

Our main interest will be contact relations which are defined on open or 
closed sets of a topological space. Primary examples are collections 93 of 
non-empty regular closed (or regular open) sets of some topological space X. 

If we identify regions with elements of RegCl(X), it is natural to define C 
as the relation that holds just in case two regions share at least one point: 


(3.38) sCy 4> ny Í, 
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whereas, if our domain of regions is RegOp(X), it is usual to define C as 
holding whenever the closures of two regions share a point: 


(3.39) aCy ==> cl(x) Nel(y) £9. 


It is easy to see that these interpretations fulfil the contact relation axioms 
C1-3. In the sequel, they will be called the standard contact relations on 
RegCl(X) and RegOp(X) respectively. 

It is often useful to consider contact relations over other, more specific 
domains. Take, for example, the set D of all closed discs in the Euclidean, 
and define C by (3.38). Then, C obviously is a contact relation on D. 

When describing properties of the C relation, it is often convenient to refer 
to the set of all regions connected to a given region. Thus, we define 


(3.40) C(x) = {y | Cy}. 
In terms of this notation, the extensionality axiom can be stated as: 


(3.41) Vey[(C(z) = C(y)) e z = y]. 


Many other useful relations can be defined in terms of contact (see Clarke, 
1981; Randell et al., 1992b and 5.1 below). A particularly important definable 
relation is that which is normally interpreted as the part relation: 


(3.42) Py =u Vz) 2zCxu > 2Cy |. 


This definition (by itself) ensures that P is reflexive and transitive—i.e. it 
is a pre-order. And if we assume the extensionality of C (i.e. C3) it can be 
proved that P is antisymmetric, so that it must be a partial order. 

The C relation is a very expressive primitive for defining topological rela- 
tionships between regions. In terms of C the following useful relations can be 
defined. These definitions have been used to define the relational vocabulary of 
the well-known Region Connection Calculus, which will be discussed further 
in Sec. 5 below. 





(3.43) x£xPPy =,, «Py \7yP2 x is a Proper Part of y 
(3.44) wxOy =æ 3z|zPx A zPy| x Overlaps y 
(3.45) «DRy =. 7xOy x is DiscRete from y 
(3.46) «DCy =u xOy x is disconnected from y 
(3.47) £ECy =a tCy Anry x is Externally Connected to y 


(3.48) «xPOy =,, xOy ^nr Py ^AnyPr x Partially Overlaps y 
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(3.49) wEQy =. cPy \ yP« x is Equal to y 
(3.50) «TPPy =, tPPy A 3z|zECx \ zECy| 
x is a Tangential Proper Part of y 
(3.51) «tNTPPy =,, ¢PPy \ 742|zECz A zECy] 
x is a Non-Tangential Proper Part of y 








(3.52) «TPPIly =. yTPPx 
x is an Inverse Tangential Proper Part of y 
(3.53) £NTPPIy =,, yNTPPx 
x is an Inverse Non-Tangential Proper Part of y 


In the presence of the extensionality axiom, (3.49) is equivalent to simply x = y. 
It should be noted that for the defined relations to have their intuitive meaning, 
one should not include in the domain a ‘null’ region that is not connected to 
any other region. If such a null region is present, it would be be part of every 
other region. Consequently xOy would hold for all x and y, and other relations 
defined in terms of O would also have counter-intuitive interpretations. 

Under typical interpretations of the C relation (not including a null region 
in the domain), the relations defined by (3.46)—(3.53) form a jointly exhaustive 
and pairwise-disjoint partition of possible relations between any two spatial 
regions (i.e. every two regions satisfy exactly one of the relations). This set of 
eight relations introduced in Randell et al., 1992b is often known as RCC-8, 
and is widely referred to in the AI literature on Qualitative Spatial Reasoning 
(see also Sec. 5.1 below). 


3.1 Contact relation algebras 


If C is taken to be a relation in a relation algebra, the properties C1-3 of the 
contact relation correspond to the following relation algebraic conditions: 


(CRA1) eee (Reflexivity), 
(CRA2) C=C", (Symmetry), 
(CRA3) [-(C; -C)n-(C; -0] <1, (Extensionality). 





DEFINITION 3.29 A relation algebra generated from a single relation C sat- 
isfying conditions CRA1-3 will be called a contact relation algebra (CRA). 


Contact Relation Algebras (CRAs) were introduced and studied in Düntsch 
et al., 1999, where many fundamental properties are demonstrated. CRAs 
provide a rich language within which many other useful topological relations 
can be defined. In the relation algebra setting, the part relation has the following 
definition: 


(3.54) P =œ (Ce - C) 
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Many other relations are relationally definable from C’. Indeed all the rela- 
tions that were defined above using Ist-order logic can also be defined using 
the algebraic operators of relation algebra: 


(3.55) PP =, POI’ proper part of 

(3.56) OVS Pe Pe overlap 

(3.57) DR =,, —O discrete 

(3.58) DC =y -C disconnected 

(3.59) EC = CAN-O external contact 

(3.60) PO =„ ON (PUP) partial overlap 

(3.61) EQ = (PU P“) (=1') equality 

(3.62) TPP =,, PPA(EC; EC) tangential proper part 
(3.63) NTPP =,, PPA-TPP non-tangential proper part 
(3.64) TPPI =,, TPP tangential proper part inv. 
(3.65) NTPPI =,, NTPP~ non-tang’! proper part inv. 


In view of Proposition 3.18, this comes as no surprise, since RAs capture 
exactly those Ist-order properties of C that can be expressed with up to three 
variables, and this is sufficient for all the definitions given above. 

Depending on the base set, some of these relations might be empty or coin- 
cide. If, for example, B is a BA, and xCy = > «x- y Æ 0, then C coincides 
with the overlap relation, and EC = Ø. A picture of some of these relations 
over the domain D of (non-empty) closed discs is given in Fig. 3.1. 

It turns out that the relations 


(3.66) 1’, DC, PO, EC, EPP TPP“, NTPP, NTPP* 


are the atoms of the relation algebra De generated by C over D, henceforth 
called the (closed) disc relations. (The ‘composition table’ for the RCC-8 
relations over the domain D, will be given in Table 6.2 below.) 


4. Boolean contact algebras 


While the contact relations of Sec. 3 did not assume a particular algebraic 
structure on the base set, we will often be interested in cases where the set of 
regions has further structure; and, in particular, we will often want to consider 
the set of regions as having the structure of a Boolean algebra. 

A Ist-order theory intended to model topological properties of regions, the 
Region Connection Calculus (RCC), has been introduced by Randell et al., 
1992b in 1992, and has since gained popularity in the spatial reasoning com- 
munity; we will examine the RCC more closely in Sec. 5. First, we will consider 
a more general class of structures: 
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Figure 3.1. Topological relations on the domain of closed discs. 


DEFINITION 3.30 A Boolean contact algebra is a pair (B,C), such that B is 
a non-trivial (i.e. 0 4 1) Boolean algebra, and C is a binary relation on B7, 
called a contact relation, with the following properties: 


(BCA0) aCb implies a,b #4 0 
(BCA1) a #0 implies aCa 
(BCA2) C is symmetric. 
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Figure 4.2. Illustration of the interpolation axiom. 


(BCA3) (Compatibility) aCb and b < c implies aC'c 
(BCA4) aC(b + c) implies aCb or aCc 
While axioms BCA0-4 characterise the properties of Boolean contact al- 


gebras in general, we shall often be interested in BCAs that satisfy additional 
axioms. In particular, we shall be interested in the following axioms: 


(BCAS) 
(Extensionality) C(a) C C(b) implies a < b 
(BCA6) 
(Interpolation) If a(—C)b there is some c such thata(—C)cand—d—C)b 
(BCA7) 


(Connection) a Z {0,1} implies aC — a 


A BCA which satisfies BCA5 and BCA7 will be called an RCC algebra, 
since these axioms are satisfied by the Ist-order Region Connection Calculus 
theory proposed by Randell et al., 1992b (which will be considered in further 
detail in Sec. 5 below). 

Clearly, C is a contact relation in the sense of Sec. 3, and therefore, all 
relations specified by the definitional formulae (3.42)—(3.51) are at our disposal. 
It is easy to see that 





(3.67) BCA5 <=> P is the Boolean order, 
(3.68) BCAG6 <> V(z,y)(Sz)[aNTPPz A zNTPPy}, 
(3.69) rOy 4= «-yFO. 


Simple structural properties include 
PROPOSITION 3.31 Let (B,C) be a BCA: 
i) (DUNTSCH AND WINTER, 2004) O is the smallest contact relation 
on B. 


ii) (DUNTSCH AND WINTER, 2004) B is a finite-cofinite algebra if 
and only if O is the only contact relation on B that satisfies BCAS. 
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iii) (DUNTSCH ET AL., 20018) Jf C satisfies BCA5 and BCAT, then 
B is atomless. 


4.1 Interpretations of BCAs 


As intended, the regions and relations of the BCA theory can be interpreted in 
terms of classical point-set topology. In fact, there are two dual interpretations 
that are equally reasonable. 

Closed Interpretation: 


e A region is identified with a regular closed set of points. 
e Regions are connected if they share at least one point. 


e Regions overlap if their interiors share at least one point. 
Open Interpretation: 


e A region is identified with a regular open set of points. 
e Regions are connected if their closures share at least one point. 


e Regions overlap if they share at least one point. 
The axioms for C translate into topological properties as follows: 


PROPOSITION 3.32 (DUNTSCH AND WINTER, 2005) Suppose that (X,7T) 
is a topological space, and that C, is the standard contact relation on RegCl(X ). 


i) C- satisfies BCA0-4. 
ii) C- satisfies BCAS if and only if X is weakly regular. 
iii) C- satisfies BCA6 if and only if X is weakly normal. 


iv) C- satisfies BCAT if and only if X is connected. 


In fact the BCA axioms are also satisfied by dense subalgebras of RegCl(X ). 
Hence, proposition 3.32 can be generalised: 
PROPOSITION 3.33 Suppose that (X,7T) is a topological space, and C+ is the 
standard contact relation on some dense sub-algebra of RegCl(X ); then each 
of the clauses i—iv of proposition 3.32 are true for Cz. 


The preceding propositions give us many examples of BCAs. We would 
like to mention a countable example of a BCA which is, in some sense, one 
dimensional; in particular, this algebra is not complete (i.e. does not contain 
infinite sums of its elements). Suppose that L is the ordered set of non-negative 
rational numbers enhanced by a greatest element oo. Let B be the collection 
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of all finite unions of left-closed, right-open intervals of L, together with the 
empty set. It is well known (Koppelberg, 1989) that B is a Boolean subalgebra 
of 2}, called the interval algebra of L, and that each a € Bt has a unique 
representation as 


(3.70) a= [zo, Yo) U...U [En, Yn), 


where zo X yo = z1 ÊX y1 S ... X Zn X Yn. The set {z; : i < n}U 
{yi : i < n} is called the set of relevant points of a, denoted by rel(a). If we 
define C on B* by 


(3.71) aCb <=> (a N b) U (rel(a) N rel(b)) Æ 9, 


then (B,C) is a BCA which satisfies BCA6 and BCA7 (Düntsch and Winter, 
2004). Other constructions of countable BCAs can be found in Li et al., 2005. 
In Sections 5 and 5.1 we will present BCAs arising from spatial theories. 

We now exhibit some constructions that allow us to obtain new BCAs from 
old (these were described in Diintsch and Winter, 2004): 


PROPOSITION 3.34 (ADDING AN ultra-contact) Given any atomless 
BCA (B,C) it is possible to augment the connection relation by picking any 
two ultrafilters F and G of the algebra and stipulating that C( f, g) for any two 
regions f and g, where f € F and g € G. In formal terms this means that 
(B,C’) is a BCA where 


(3.72) C'=CU(F x G)U(Gx F). 


More generally, for a contact relation C, let Ro = {(F, G} : F x GCC}, 
and, for a reflexive and symmetric relation R on Ult(B), set Cr =U{FxG: 
(F,G) € R}. 


PROPOSITION 3.35 


i) (DÜNTSCH AND VAKARELOV, 2006) Cpr satisfies BCA0-4. 


ii) (DUNTSCH AND WINTER, 2006) Jf R is a reflexive and symmetric 
relation on Ult( B) which is closed in the product topology of Ult( B) x 
Ult( B), then Cpr satisfies BCA0-4. 


iii) (DÜNTSCH AND WINTER, 2006) The collection of all relations on 
B that satisfy BCA0—4 can be made into an atomistic complete co- 
Heyting algebra in which join is set union. 


iv) (DUNTSCH AND WINTER, 2006) The collection of all relations on 
B that satisfy BCA0—4 and BCA6 is isomorphic to the lattice of closed 
equivalence relations on Ult( B). 
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PROPOSITION 3.36 (RESTRICTION AND EXTENSION) Jf Aisadense sub- 
algebra of B, then the restriction of C to A is a contact relation on A which 
satisfies BCAT if B does. 

If B is a dense subalgebra of A, then the relation C" defined on A by 


aC'b => (Vs,t € B)[a < sandb<t= sCt] 


is a contact relation on A, and, if C satisfies BCAT, so does C". Furthermore, 
C’ is the largest contact relation on A whose restriction to B is C. 


4.2 Representation theorems for BCAs 


Theorems that characterise the class of models of a given axiomatic theory 
are know as representation theorems. In most cases, such theorems are sought 
after for one (or both) of the following reasons: 


a) to find an axiomatisation for a given class of structures, 


b) to show that a given axiom system is complete for an intended class of 
models. 


Famous representation results include Cayley’s theorem that every group is 
isomorphic to a group of permutations, and Stone’s theorem which shows that 
each Boolean algebra is isomorphic to an algebra of sets. If an axiom system has 
models outside an intended class of models, the existence of such non-standard 
models shows that the system is incomplete with respect to that intended class. 
In the sequel, we will exhibit both positive and negative representation results 
for contact relations in topological spaces. 

Apart from the earlier topological representation results of Roeper, 1997 and 
Mormann, 1998, which do not result in the standard topological contact, the first 
‘standard’ representation result for a class of contact algebras was discovered by 
Vakarelov et al., 2001. It utilises the theory of proximity spaces which have been 
briefly described in Sec. 2. Subsequently, making use of similar techniques, 
topological representation results were obtained for BCAs (Diintsch and Winter, 
2005). 


4.2.1 Constructing a topology to representa BCA. The proof of the 
representation result takes a form similar to that of Stone’s theorem. The plan 
is to devise a way to use the elements of a BCA to construct entities that can be 
correlated with points in a topological or proximity space. However, instead of 
taking ultrafilters as the base set for the topology (as is done in Stone’s theorem), 
asomewhat different construction is required to generate suitable sets of regions 
that can be identified with ‘points’ in a proximity space or topological model. 
We begin with the following definition: 
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Figure 4.3. Illustration of why clans are not closed under intersection. 


DEFINITION 3.37 A non-empty subset T of B is called a clan if, forall x,y € 
B, we have: 


CLI) Ifx,y €T then «Cy. 
CL2) Ife+yeT thnx €Toryel. 
CL3) IfxeTanda<y, theny Er. 


A clan can be regarded as a set of regions which share at least one point of 
mutual contact. The difference from a Boolean filter arises because regions may 
share a point of contact even though their intersection is empty. Moreover, as 
is illustrated in Fig. 4.3, even where regions do have a non-empty intersection, 
the regions may have a point of contact that is not in this intersection. 


DEFINITION 3.38 A clanT that is maximal (i.e. there is no clan" such that 
T CI”) will be called a cluster. The set of all clusters in B will be denoted by 
Clust(B). Clearly, every clan is contained in some cluster. 


Since clusters will represent points in a topological space, each region will 
be associated with a set of clusters. Hence, to construct the topological repre- 
sentation of a BCA, we need to find a suitable mapping from the elements of 
the BCA to sets of clusters. Again the construction is similar to that used in the 
Stone theorem. 

We define a mapping h : B — 20"st(B) by 


(3.73) h(a) = {I € Clust(B) : a € T}, 


In Duntsch and Winter, 2005 it was shown that for any BCA with domain 
B, we can specify a topology (Clust(B), 7g), determined by h. This is done 
by taking {h(x) : x € B} as a basis for the closed sets of (Clust(B), 7g). In 
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other words the open sets Tg are arbitrary unions of sets whose complements 
are in the range of h: 


ro = { L{Clust(B)\n(2) : £ € 5}: 8B} 


LEMMA 3.39 (DUNTSCH AND WINTER, 2005) The following properties 
of (Clust(B), Tg) hold. 


i) The range of h(x) for x € B is a dense subalgebra Ag of the regular 
closed algebra over (Clust( B), TB). 


ii) h preserves the Boolean structure of B in Ag (i.e. h is a Boolean 
homomorphism from B to Ap). 


iii) Foralla,b € B, aCbif and only if h(a) N h(b) £ 0. 


iv)  (Clust(B), 7p), is a weakly regular T; topology (which is not neces- 
sarily T2), 


Together, these properties give us the following representation theorem: 


PROPOSITION 3.40 Each BCA (B,C) is isomorphic to a dense substructure 
of some regular closed algebra (RegCl(X),C,), where T is a weakly regular 
Tı topology, and C is the restriction of C+ to B. 


Moreover, from propositions 3.32 and 3.33, we immediately have the fol- 
lowing result which tells us that the correspondence is bijective: 


PROPOSITION 3.41 If (X,T) is a weakly regular T; space, and B is a dense 
subalgebra of RegCl(X ) with C being the restriction of the standard contact 
on RegCl(.X), then (B,C) is a BCA. 


As a consequence of this result we obtain 


PROPOSITION 3.42 The axioms of BCAs are complete with respect to the 
class of dense substructures of regular closed algebras of weakly regular T; 
spaces with standard contact. 


4.2.2 The extensionality axiom. The theorems stated in the last section 
concern BCAs satisfying the axioms BCA0-5—.e. the general BCA theory to- 
gether with the extensionality axiom. For certain purposes, in particular the 
modelling of discrete space, one may wish to remove the extensionality condi- 
tion (Galton, 1999). The resulting very general BCAs will not be considered 
further here; however, representation results for general BCAs have been ob- 
tained by Duntsch and Vakarelov, 2006 and Dimov and Vakarelov, 2006. 
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4.2.3 The connection axiom. The connection axiom, BCA7, states 
that every region, except 0 and 1, is connected to its own complement: 


a ¢ {0,1} implies aC — a. 


Suppose BCA7 is false for an RCA with domain B; then there are regions 
a,b € Bsuch that a,b 4 0,a+b = landa(—C)b. Because the mapping h pre- 
serves Boolean identities and the contact relation, we must have regular closed 
regions h(a) and h(6) in (Clust(B), 7g) such that h(a) + h(b) = Clust(B) and 
h(a) N h(b) = Ø. Therefore, (Clust(B), 7) must be a disconnected space. 

Conversely, it can be shown that if (Clust( B), 7g) is a connected topological 
space, then the BCA, B must satisfy the axiom BCA7. This is a bit more difficult 
to demonstrate, but is proved in Diintsch and Winter, 2005. (Since elements of 
the BCA form only a dense subalgebra of RegCl((Clust(B), 7g) ), we cannot 
necessarily associate an arbitrary regular closed subset of Clust(B) with an 
element of the BCA from which the topology was constructed. This means that 
mapping topological constraints to BCA axioms often requires detailed analysis 
of the cluster construction.) Thus we have the following representation theorem 
for RCC algebras—i.e. BCAs satisfying axioms BCA0-5 and BCA7: 


PROPOSITION 3.43 The axioms for RCC algebras are complete with respect 
to the class of dense substructures of regular closed algebras of connected 
weakly regular T spaces with standard contact. 


4.2.4 Saturated clusters and the interpolation axiom. We now con- 
sider the effect of the interpolation axiom, BCA6. Recall that this stipulates 
that 

Vayla(—C)y > (Az)[a(-C)z ^ = 2(—C)yl]. 


This is a separation condition ensuring that for any two disconnected regions in 
the algebra, we can find a third region disconnected from the first and including 
the second as a non-tangential part. 

We shall later see that we can establish a correspondence between BCAs 
satisfying BCA6 and proximity spaces. In order to do this we show that in 
the presence of this condition, the clusters derived from the algebra exhibit 
a property called saturation, which results leads to a natural ‘well-behaved’ 
structure of the set of clusters. 





DEFINITION 3.44 A clan is called saturated iff it satisfies the following con- 
dition: 
(P) If xCy for every y ET, then x ET. 


If a clan T over B is saturated then for any x € B such that x ¢ I there is 
some y € T such that =(xCy). Therefore, TU {x} is not a clan. So T must be a 
maximal clan. Thus we have the following lemma (Düntsch and Winter, 2005): 
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LEMMA 3.45 Every saturated clan is a cluster. 


In formulating the proximity representation theorem for BCAs, clusters cor- 
responding to saturated clans will be will be taken as the points of a proximity 
space. Hence, we use the following terminology: 


DEFINITION 3.46 A cluster that is a saturated clan will be called a proximity 
cluster, or more briefly a p-cluster. 


Intuitively, each p-cluster can be interpreted as the set of all regions in the 
BCA that contain a particular point in a corresponding proximity space. How- 
ever, for BCAs in general, not every cluster need be a p-cluster. The following 
example of a BCA which includes clusters that are not p-clusters is given in 
Diintsch and Winter, 2005. 

Suppose that B is the interval algebra whose elements are finite unions of 
left closed, right open intervals, [x, y) on the rational unit interval [0,1). Let 
C(i, j) hold between elements just in case their closures share a point. Now, 
let a, b be points such that 0 < a < b < 1. If Fa is the ultrafilter of B of all 
sets containing a, and F; is the ultrafilter of B of all sets containing b, then, by 
Proposition 3.34, the relation C” = C U (Fy x Fp) U (F, x Fa) is a contact 
relation over B, and it can be shown that [ = Fa U Fy is a cluster. However, if 
s<a<st<b,and a = [s,a) U |t, b), then {x} x T C C’ (ie. xis connected 
to every member of I). But, neither [s, a) nor |t, b) is in T, so (because clans 
must satisfy C'L2) we must have x ¢ T. 

Let us see how this anomaly arose. By adding the ultra-contact between 
points a and b, we stipulated that every region containing point a is in the 
C’ contact relation with every region containing point b. But, in this algebra, 
contact also holds between regions that do not share a point, but whose closures 
share a point. However, the relation C’ does not necessarily hold between 
intervals 7, 7 such that the closure of i includes a and the closure of j includes b. 
This mismatch leads to a kind of discontinuity in the contact relation C” relative 
to the underlying topology of the interval algebra. 


LEMMA 3.47 (DUNTSCH AND WINTER, 2005) Jf (B,C) satisfies BCA6, 
then each cluster is a p-cluster. 


In order to see why BCA6 ensures that all clusters are saturated we first give 
another useful lemma: 


LEMMA 3.48 For every region r and cluster T, r € T if and only if 
for any set of regions S = {r1,...,1%} such thatr <1ry+...+1p, there is a 
region r; € S such that (Vx € T)[riCa]. 


Proof (sketch) Since clusters are maximal clans then, for any cluster I, if 
T U {r,...} satisfies CL1-3 then r € T. Moreover, to show that r € T it 
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suffices to show that [ U {r} satisfies CL1-2, since then T U {x : x > r} 
clearly satisfies CL/-3. It can be shown that T U {r} satisfies CL1-2 just in 
case for every sum (r1 +... + (rn + rn+1)) = r there is some r; such that 
(Va €T)[xCr;], and this implies the lemma. QED 


Using this, we can prove Lemma 3.47 as follows: 


Proof Let (B,C) bea BCA satisfying BCA6. Let I be a cluster derived from 
this algebra and r a region such that Vz € T'[aCr]. Suppose, in contradiction 
to Lemma 3.47 that r ¢ I. Then, by Lemma 3.48, there are r1,..., 7p, with 
r < rı +... + Tn, such that for each r; there is some x; € I with rj(—C)aj. 
Then by BCAG6 there are regions s1, . . . , Sn, such that s;(—C) x; andr;(—C) —s; 
(so each s; contains r; and separates it from x;). Let s = s1 + ...+ Sn. Thus 
r(—C) — s. Now pick any region y € L. Clearly y = y1 +...+ Yn + z, where 
Yi = y: si and z = y- —s. Because of CL2 we must have either z € T or some 
yi E T. But since y; = y- si and s;(—C)a; we have yi(—C)a;; so y; ¢ T 
(because of CL/). Thus we must have z € I’. However, since z = y- —s, we 
have z < —s and because r(—C) — s we have r(—C)z. But this contradicts 
the premiss that Vz € T'[aCr]. Hence the supposition that r ¢ T is impossible, 
so we have proved Lemma 3.47. QED 


The converse of Lemma 3.47 is not true (Diintsch and Winter, 2006). 


4.2.5 Representation in proximity spaces. As noted above, in Sec. 2, 
proximity spaces form a useful intermediary between topological spaces and 
axiomatic theories based on a contact relation, which has analogous properties 
to the proximity relation. Indeed contact can be regarded as a limiting case of 
proximity. 

The theory of proximity spaces and their relation to topological spaces has 
been developed in detail in the seminal work of Naimpally and Warrack, 1970. 
This analysis makes heavy use of a notion of cluster, which is very similar 
to (and was the inspiration for) the cluster construct for BCAs given above. 
Because proximity spaces satisfy axiom P5, the clusters employed by Naimpally 
and Warrack are saturated. Hence, in the case of BCAs satisfying BCA6, many 
of the results of Naimpally and Warrack, 1970 can be used to demonstrate 
correspondences between BCAs, proximity spaces and topologies. 

We first consider how we can derive a BCA from a proximity, space: 


PROPOSITION 3.49 Let (X, A) be aproximity space with associated topology 
T(A), and RegCl(X) be the regular closed subsets of X according to the 
topology T(A). Then the algebra (RegCl( X ), A) isa BCA called the proximity 
connection algebra over (X, A). 
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DEFINITION 3.50 (RegCl(X), A) is called a standard proximity connection 
algebra, if 
Ay iff £ Ny £9,for all x,y € RegCl(X). 


For our purposes, it suffices to consider only standard connection algebras. 
This is because of the following theorem: 


PROPOSITION 3.51 (VAKARELOV ET AL., 2001) Each proximity connec- 
tion algebra is isomorphic to a standard proximity connection algebra. 


It follows immediately from the proximity axioms and the BCA axioms, that 
each standard proximity connection algebra is a BCA that satisfies the inter- 
polation axiom BCA6 (corresponding to the proximity axiom P5). We will 
demonstrate in the remainder of this section that, conversely, each BCA which 
satisfies BCA6 can be embedded into a standard proximity connection algebra. 

Given a BCA, (B,C), satisfying BCA6, our aim is to define a proximity 
on Clust(B). As with the representation in a topological space, the proximity 
space construction will again make use of clusters to represent points in the 
proximity space. Hence, each subset of the space will correspond to a set of 
clusters. 

Since a cluster is interpreted as the set of regions containing a given point, 
the intersection of two clusters is the set of regions containing two points. More 
generally, given a set X of clusters representing a set of points, the common 
intersection (|X will be the set of all regions that contain all those points. 
Using this idea, we can for any BCA define a proximity relation between pairs 
of cluster sets, which corresponds to the contact relation of the BCA: 


DEFINITION 3.52 For any BCA, (B,C) that satisfies BCA6, we define a 
proximity relation over Clust( B) in the following way: 
for each X,Y C Clust(B) 


(Arp)  XABY iff (Vz,y € B)[x € Nx and y € NY imply £C'y]. 


Using this construction, the following lemma was proved by Vakarelov et al., 
2002, based on an earlier result of Naimpally and Warrack, 1970: 


LEMMA 3.53 (Clust(B), Ag) is a separated proximity space. 


Thus, the construction of clusters together with the definition of a proximity 
relation on sets of clusters enables us to derive a proximity space from any BCA 
satisfying BCA6. The structure (Clust( B), Ag) can be regarded as a canonical 
representation of the BCA B in terms of a (separated) proximity space. 

As with the topological representation, the correspondence between the reg- 
ions of the original BCA and subsets of the derived proximity space can be 
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specified by a function h : B — 2°™st(), defined by h(a) = {T € Clust(B) : 
a € IT}. This mapping both preserves the Boolean structure of the BCA and 
also associates the contact relation of the BCA with the proximity relation of 
the proximity space. 

We have now shown that each BCA (B,C) that satisfies BCA6 is isomor- 
phic to a standard proximity algebra over the proximity space (Clust(B), Az). 
In Sec. 2we saw that each proximity space is associated with a corresponding 
topology and the properties of this topology were characterised by Proposi- 
tion 3.36. This means that we can use the proximity space derived from a 
BCA to define a corresponding topological space. This gives us the follow- 
ing topological representation theorem for BCAs satisfying the interpolation 
axiom: 


PROPOSITION 3.54 (VAKARELOV ET AL., 2001) Each BCA which satis- 
fies BCA6 is isomorphic to a dense substructure of the regular closed algebra 
of a completely regular T; space X with standard contact as defined by (3.38). 
Furthermore, X is connected if and only if C satisfies BCA. 


It should be noted that not every completely regular T space is the represen- 
tation space of a BCA which satisfies BCA6, since these spaces must be weakly 
normal (see Propositions 3.32—3), and there are spaces that are completely reg- 
ular T}, but not weakly normal (Shchepin, 1972). We have, however: 


LEMMA 3.55 The BCA axioms BCA0-6 are complete with respect to the class 
of dense substructures of regular closed algebras of weakly normal T; spaces 
with standard contact. 


LEMMA 3.56 The BCA axioms BCA0-7 are complete with respect to the class 
of dense substructures of regular closed algebras of weakly normal connected 
T; spaces with standard contact. 


5. Other theories of topological relations 
5.1 The Region Connection Calculus 


The Region Connection Calculus (RCC) of Randell et al., 1992b is an 
axiomatisation of certain spatial concepts and relations in classical 1st-order 
predicate calculus. It has become widely known in the field of Qualitative Spa- 
tial Reasoning, a research area within the Knowledge Representation field of 
Artificial Intelligence. There is some variation in the full set of axioms used 
for the RCC theory. The formal apparatus of the original theory is complicated 
by the use of the many-sorted logic LLAMA (Cohn, 1987) and the use of a 
non-standard definite description operator (vx[y(a)]). This makes it difficult 
to make a direct comparison with the algebraically based theories presented in 
the current paper. 
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The RCC theory is based on a primitive relation C’, which is in this context 
normally called the connection relation. This is axiomatised to be reflexive (C1) 
and symmetric (C2). The extensionality axiom (C3) is not given in the original 
RCC theory (Randell et al., 1992b) and does not strictly follow from the other 
axioms (see Bennett, 1997; Stell, 2000). However, the theory does contain 
definition 3.49 for the EQ relation; and, if (as seems to have been assumed in 
some subsequent development of RCC) this is taken as coinciding with logical 
equality, then C3 also holds. With this assumption, we have a contact relation 
in the sense defined in Sec. 3. 

The RCC theory introduces further relations by means of the definitions 
(3.42)—-(3.53) given above (Sec. 3), which include of course the RCC-8 relation 
set. The following axiom is given stipulating that every region has a non- 
tangential proper part: 





(RCC1) VadylyNT PP]. 


However, as shown in Diintsch et al., 2001b, this follows from the other axioms, 
if we assume the extensionality axiom C3. 

RCC also incorporates a constant denoting the universal region, a sum func- 
tion and partial functions giving the product of any two overlapping regions and 
the complement of every region except the universe. With slight modification 
to the original to replace the partial product and complement functions with 
relations, these are defined as follows: 


(RCCD1) «=U =,, Vy[xCy], 

(RCCD2) x=y+2 =, VwlwCr > [wCy V wCz]], 

(RCCD3) Prod(z,y,z) =, VuluCz > dvlvPx A vPy A uC], 
(RCCD4) Compl(z,y) =., Vz[(zCyorzNTPPx) A (zOy7zPx)). 





It should be noted that within the original RCC theory there is no such thing 
as a null (or empty) region. Thus there is no product of discrete regions or 
complement of the universal region. This means we do not have a full Boolean 
algebra of regions; but, in order that appropriate regions exist to fulfil the 
requirements of the guasi-Boolean structure suggested by the above definitions, 
the basic RCC theory should be supplemented with the following existential 
axioms: 





(RCC2) Vay|zOy > Jz[Prod(z, y, z)], 
(RCC3) Va[7(a =U) = dy[Compl(z, y)]. 





The many-sorted formalisation of RCC and the choice to exclude the ‘null 
region’ from the domain of regions was motivated partly by a desire to accord 
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with ‘commonsense’ notions of spatial reality (influenced by e.g. Hayes, 1985) 
and partly by wanting to improve the effectiveness of automated reasoning using 
the calculus. However, from the point of view of establishing properties of the 
formal system, it has been found that the lack of a null region is problematic 
since it considerably complicates the comparison with standard mathematical 
structures such as Boolean algebras. Hence, subsequent investigations (e.g. 
Stell, 2000; Duntsch et al., 1999) have often modified the original theory by 
introducing a null region so that the theory can be built upon a domain that has 
the basic Boolean algebra structure. 

Once the null region has been added, it is clear that the models of the revised 
RCC theory will be BCAs (as defined by Definition 3.30). Moreover, given 
the RCC axioms, it can be proved that every region is connected to its own 
complement: 


(3.74) Vay|Compl(z, y) —> Cy]. 


This corresponds to the BCA property BCA7. In fact any model of the RCC 
axioms modified to include the null region correspond to a BCA satisfying this 


property: 


LEMMA 3.57 An RCC model is an RCC algebra, i.e. a BCA (B,C) which 
satisfies BCAT. 


This correspondence enables us to use connected BCAs as an algebraic coun- 
terpart to the 1st-order RCC axioms. An another algebraic analysis of the RCC 
theory, employing a somewhat weaker axiomatisation, is given in Stell, 2000. 


5.2 The 4- and 9-intersection representations 


The 4 and 9 Intersection representations were originally described by Egen- 
hofer and his students (Egenhofer and Franzosa, 1991; Egenhofer and Herring, 
1991) as a means of representing relationships between geographic regions. 
The approach is based on the idea of interpreting regions as point sets and char- 
acterising binary spatial relations in terms of topological constraints on these 
sets. The originators suggest that the representation should be applied to Jor- 
dan curve bounded regions in the plane (i.e. regions that are homeomorphic to 
closed discs); however, there is no reason why it could not be applied more 
generally to regular closed subsets of a topological space. 

In the 4-intersection representation the idea is to consider the intersection of 
the boundary and interior of one region with the boundary and interior of another. 
Thus, for regions A and B, we consider 0(A)M0(B), 0(A) Nint(B), int(A) N 
O(B), int(A) N int(B), and we determine whether or not these intersections 
are empty (denoted Ø) or non-empty (denoted =Ø). The determined values are 
naturally represented by a 2x2 matrix, as shown in Table 5.1. 
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N o(B) int(B) N Ə(B) int(B 
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O(B)  int(B) n ôB) int(B 
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O(A) 0 O(A) 4 0 
int(A) 0 0 int(A) o 0 
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ji O(B) int(B) N O(B) int(B 
O(A) 0 0 Ə(A) 0 0 
int(A) -0 =0 int(A) -@ ~ 
NTPP~ TPP” 

















Table 5.1. Topological relations definable using the 4-intersection representation. 


By reference to Fig. 3.1 (in Sec. 3 above), it is easy to see that the base 
relations of the closed circle algebra can be described by this 4-intersection 
model (Egenhofer and Franzosa, 1991). 

An approach which extends the 4-intersection model also takes into account 
the complement of the sets in question, and can be described by the following 
matrix: 

int(x) Nint(y) int(x) N (y) int(x) N —y 
olx) Nint(y) alr) Nally) alL) N -—y 
=x N —int(y) —-xNd(y) =z N —y 


While the 4-intersection model described the topological invariant relations 
among closed Jordan curves, the 9-intersection model is able to describe such 
relations for sets, which have arbitrary shaped interiors, including lines and 
points. Details can be found in Egenhofer and Herring, 1991. 

Thus we see that the 9-intersection representation, based on a point-set inter- 
pretation of regions characterises exactly the same set of basic binary relations 
as the axiomatic RCC theory. This of course is not surprising given the corre- 
spondences between axiomatic algebras and topological spaces characterised 
by the representation theorems given in Sec. 4. 


6. Reasoning about topological relations 


The foregoing sections have defined a rich array of formal frameworks for 
representing topological relationships between regions. We now look as ways 
in which these representations can be employed to make inferences about topo- 
logical configurations of regions. 
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6.1 Compositional reasoning 


Compositional inference may be described in general terms as a deduction, 
from two relational facts of the forms aRb and bSc, of a relational fact of the 
form aT’c, involving only a and c. Such inferences may be useful in their own 
right or may be employed as part of a larger inference mechanism, such as 
a consistency checking procedure for sets of relational facts. In either case, 
one will normally want to deduce the strongest relation aT'c that is entailed by 
aRb ^ bSc and which is expressible in whatever formalism is being employed. 

In Ist-order logic we can express the strongest fact derivable from aRb A bSc 
by the formula a(R; S)b, where the ; operator is defined by: 





(3.75) a(R; S)y =a 3z[r Rz A zSy]. 


Hence, the meaning of ‘;’ coincides with that of composition in Binary Relation 
Algebras (as defined in Sec. 2). This may be called the strong composition 
operator. Itis also often called the extensional composition, because, if we 
know that x and y stand in a relation equivalent to R; S, we can infer the 
existence of an entity z, such that x Rz and zSy. 

Asa means of practical reasoning, inferring strong compositions in an expres- 
sive language such as 1st-order logic may not be very effective as the formulae 
generated will in general be more complex than the initial formulae and no more 
informative. However, if it is found that for a certain set of relations, every for- 
mula derived by compositional inference is equivalent to some relatively simple 
formula (preferably a single relation of the language or perhaps a disjunction 
of relations) then compositional inference may be a very powerful tool. 

In the case of the Allen calculus based on 13 basic temporal interval relations 
(Allen, 1983), it turns out that (under a very natural interpretation in terms of 
intervals on the rational line) the extensional compositions of any pair of base 
relations correspond to some disjunction of the basic 13 relations. Hence, 
composition can be applied without generating more complex relations. 

It has been found that in cases where the strong, extensional composition 
cannot be simply expressed, it is useful to generalise the notion of composition 
to allow weaker inferences. In particular the following notion is often used: 


DEFINITION 3.58 Given a theory © whose vocabulary includes a set Rels 
of relations, the weak composition, WComp(R,S), where R,S € Rels is 
defined to be the disjunction of all relations T; € Rels, such that there exist 
individual constants a, b, c, where the formula R(a,b) ^A S(b,c) A T;(a, c) is 
consistent with ©. 


This means that if WComp(R,S) = Tı U . . . U Tn then 


(3.76) OF VaVyVzl2Ry A ySz > (aT1z V...V Taz). 
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and, furthermore, Tı U...U Th is the smallest subset of Rels for which such 
a formula is provable. Moreover, it is easy to show that R; S is always a 
sub-relation (or equivalent to) WComp(R, S). 

Given this definition, inferences of the following form will always be valid: 


R(a,b) A S(b,c) A T(a,c) 
(3.77) (WComp(R, S) N T)(a, c) 





[WComp] 


For a finite set of relations, WComp(R, S) can be pre-computed for every 
pair of relations and stored in a matrix known as a composition table. This pro- 
vides a simple mechanism for computing compositional inferences by looking 
up compositions in the table. The typical mode by which this kind of compo- 
sitional reasoning is executed is to repeatedly infer compositional inferences 
using table look-up until either an inconsistency is detected or no new infer- 
ences can be made. Since their introduction by Allen, 1983, composition tables 
have received considerable attention from researchers in AI and related disci- 
plines (Vilain and Kautz, 1986; Egenhofer, 1991; Freksa, 1992; Randell et al., 
1992a; Rohrig, 1994; Cohn et al., 1994; Diintsch et al., 1999; Diintsch et al., 
2001b). (In fact Allen called his table a ‘transitivity table’, but ‘composition 
table’ is arguably more appropriate and it seems that this is becoming the 
standard term.) 

Table 6.2 is usually called The Composition Table for the RCC-8 relations. 
(The identity relation is omitted from the table since it is clear how compo- 
sition with 1’ works.) In general, the table gives the weak composition of 
the RCC-8 relations. This is because over many domains (e.g. over regular 
closed sets of an arbitrary topological space) the algebra is not atomic, so that 
compositional combinations of the RCC-8 relations generate an infinite set of 
different relations, many of which are not expressible as disjunctions of the 
RCC-8 generating set. 

Nevertheless, there are some more restricted interpretations under which the 
RCC-8 relations are indeed the atoms of an RA (so that every relation in the 
algebra is a disjunction of these relations). One simple example is the case 
where the domain of regions is the set De of closed circles in the plane (with the 
usual interpretation of the relations) as depicted in Fig. 3.1. In the case where 
the domain is taken as the Jordan curve bounded regions of the plane, the table 
also corresponds to extensional composition (Li and Ying, 2003b; Li and Ying, 
2003a). 

It needs to be mentioned, that the RCC-8 relations are never the atoms of an 
RA generated by C over an RCC algebra (Duntsch, 2005), thus the composition 
table is not extensional for such algebras. For instance the composition table 
shows that WComp(EC, EC) = 1' U DC U EC U PO UTPP U TPP“. If 
this were extensional, it would mean that for any regions a, b, such that aE Cb 
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we could find a third region c such that aEC'c and cEC’b. However, suppose 









































a = —b, then there can be no region in the relation EC to both a and b. 
Cc 
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; DR 
PP PP“ 
DC EC PO TPP NTPP TPP“ NTPP~ 
DC 1 DR, PO, | DR, PO, | DR, PO, | DR, PO, | DC DC 
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NTPP~ || DR, PO, | PO, PP“ PO, PP“ PO, PP“ O NTPP~ NTPP~ 
PP“ 



































Table 6.2. The composition table of De. 


6.2 Equational reasoning 


In Sec. 2 we looked at the algebraic characterisation of topological spaces in 
terms of Closure Algebras and their complementary Interior Algebras. Since 
these algebras can be defined by purely equational axioms, this representation 
suggests that it should be possible to use some form of equational inference to 
reason about topological relationships among regions. 

In general (according to Proposition 3.22) the elements of a closure algebra 
can correspond to arbitrary subsets of a topological space. However, in order 
that the domain of regions be compatible with the topological interpretations 
of region-based axiomatic theories (such as the BCAs discussed in Sec. 4.1 
and 4.2), we will often want to identify and reason about either regular open 
or regular closed sets. In the first case one should assert an equation x = 
int(cl(x)) for each region variable x; and in the second case one should assert 
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x = cl(int(x)). In either case we can define a large vocabulary of relations in 
terms of equations of an interior/closure algebra. 

If we are dealing with regions corresponding to regular closed sets then the 
following definitions of binary topological relations can be given: 


(3.78) zDCy <= -(a-y)=1, 

(3.79) cDRy <> —(int(z)-int(y)) =1, 
(3.80) xPy <> -a«+y=1, 

(3.81) cP°y <> «+-y=1, 

(3.82) “x«NTPy <= -2«+int(y) =1, 
(3.83) aNTP-y <> int(z)+-y=1, 
(3.84) cEQy <> f2=y. 


(The extension of NTP coincides with NT PP, except that 1NT P1 is true.) 

But C itself (as well as many other relations, including O) cannot be defined 
by an interior algebraic equation. This follows from the general observation 
that purely equational constraints are always consistent with any purely equa- 
tional theory (there must always be at least a trivial one-element model, in 
which all constants denote the same individual). Thus if the negation of some 
constraint can be expressed as an equation, then the constraint itself cannot be 
equationally expressible (otherwise that constraint would be consistent with its 
own negation). 

So to define C (and O) we need to employ disequalities: 


(3.85) sCy <= —-(a-y) £1, 
(3.86) rOy <=> —(int(x)-int(y)) #1. 


Moreover, all the RCC-8 relations can be defined by some combination of 
equations given in (3.79)—(3.84) and negations of these equations. Those not 
already specified, can be defined as follows: 


(3.87) rECy 
(3.88) xPOy 
(3.89) xT PPy 
(33.90) «TPP y 


(x-y) =1) A (int(—z) + int(—y) 4 1), 

ay) A1)A(-a@+yA1)A(x+—-y #1), 

x+y = 1) A (ry) A (int(—z) +y 4 1), 
t+—y=1) A (ry) A (x + int(—-y) #1). 


For many applications we will also want to specify that certain regions are 
non-empty. This is easily done using the disequality ~x # 1. Various 
other useful binary RCC relations are expressible by means of interior algebra 
equations. For example, EQ(x + y,1) can be expressed by XUY = 1. 


= (- 
a 
= L- 
= ( 
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The problem of reasoning with topological relations can thus be reduced to 
one of reasoning with algebraic equations and disequalities; and this in turn 
can be reduced to the problem of testing consistency of sets of equations and 
disequalities. Moreover, the following Lemma tells us that the consistency of 
such sets can be determined as long as we have a means of computing whether 
a given equation follows from a set of equations: 


LEMMA 3.59 A set of algebraic equalities and disequalities, 
{£1 = YL, ee Lie = Ym, Z1 É Wes On. F Yn}, is inconsistent just in case 
£1 = Y1,--- Lm = Ym F Zi = W; for some zi, such that1 <i <n. 





Clearly this kind of approach could be applied to any of the other purely 
equationally defined algebras defined above (in Sec. 2). 

Though equational reasoning has long been a major topic in mathematics and 
computer science and many general techniques are known, it seems that there 
has been little research directed specifically at equational reasoning in this kind 
of spatial algebra. However, an indirect way of implementing such reasoning 
is by means of an encoding into modal logic, described in the next section. 


6.3 Encoding in propositional modal logics 


A propositional modal logic augments the classical propositional logic with 
one or more unary connectives. We assume familiarity with the basics of these 
formalisms. Full details can be found in many texts, such as Hughes and 
Cresswell, 1968; Blackburn et al., 2001. 

We first consider normal modal logics with a single modality. As usual, the 
modal necessity operator will be denoted by L, and its dual possibility operator 
by > (where $ p =œ ~ 0O ~p). Let ¥ be the set of all (well-formed) propositional 
modal formulae (defined in the usual way). 








DEFINITION 3.60 A normal propositional modal logic MC is identified with 
the set of its theorems. More specifically, ML is a subset of §, satisfying the 
following conditions: 

(ML1) All classical tautologies are in MCL. 

(ML2) Ifp E€ ML and (p > q) E ML, then q E ML. 
(ML3) ME is closed under substitution. 

ML4) —(Defn. of) = (Ap >Q-7p) E€ ME. 
( 
( 
( 














ML5) (Extensionality) If (pq) E€ ML then (p= Oq) EML. 
ML6) O(p A =p) > (p A =p). 
ML7) (Olp Va) > (pV Og) EML. 


This definition of normal modal logics is chosen to make clear the connection 
with modal algebras. An more common approach is to take define a modal 
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logics as a set of formulae satisfying conditions ML1—4, together with the 
Rule of Necessitation: if p € MCL then Op E€ MEL. Normal modal log- 
ics are then defined as those additionally satisfying the Kripke schema, K: 
((OpA Olp > g)) — Og) € ME. The two specifications are known to be 
equivalent (see e.g. Chellas, 1980, Ch. 4). 

Normal modal logics can be interpreted in terms of the well-known Kripke 
semantics. Specifically, a model Wt of a normal modal logic MC is a structure 
(W, R, v), where W is a set, R a binary relation on W (i.e. (W, R) is a frame), 
and v : V — 2™ a valuation function which is extended over M£ as follows: 


v( =p) = W \ vl) = {w EW : w g o(9)}, 
vp Ag) = a vo), 

v(T) = 

v(O(¢)) = 


for all y, Y% E€ ML. 
The elements of W are often called possible worlds. 


























- : (Bu)[u € v(y) and uRw}, 


6.3.1 Modal logics and algebras. There is an intimate connection 
between propositional logics and algebras. The set § of all modal formulae can 
be regarded as a term algebra (i.e. an absolutely free algebra generated from 
the propositional constants) by taking the connectives as (syntactic) operators 
on formulae. 

To obtain an algebraic perspective on the structure of a particular modal 
logic ML we can construct a quotient algebra of § relative to the logical 
equivalence relation of M£. (Given an algebra A = (A, f1,..., fn), and an 
equivalence relation ~, the quotient algebra of X relative to ~ is the structure 
(A~, Sp ..., fX). Leta = {y | y ~ z}. Then A~ = {x™| x € A}, and 
JŽ,- 0%) = y® if fi(£1,..-, 2n) = y.) Each element of this quotient 
algebra will thus correspond to a semantically distinct proposition expressible 
in the logic. 


DEFINITION 3.61 Given ML C 3, the Lindenbaum-Tarski algebra mc of 
ME is the quotient algebra of § by the equivalence relation 


(3.91) creucy ifandonlyif x y E€ ML. 


The resulting algebras are modal algebras in the sense of Definition 3.7, and 
thus they are BAOs. 

The Lindenbaum-Tarski construction can also be used to characterise the 
equational class of all modal algebras: each equivalence x ~ mg y corresponds 
to a universally quantified equation Yu1, .. . , Unle = y], where v1,..., Un are 
all the propositional variables occurring in either x or y. 

There is a direct correspondence between modal algebras and modal logics. 
The rule ML5 ensures that ® (and hence D) is functional; ML6 corresponds 
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to the algebraic normality condition (3.3) and ML7 to additivity (3.2). The 
generality of the correspondence is expressed by the following proposition: 


PROPOSITION 3.62 (JONSSON, 1993) Let V(MCL) be the equational class 
generated by § uc. The mapping ML +> V(MCZ) is a dual isomorphism 
from the lattice of all normal modal logics to the lattice of equational classes 
of modal algebras. 


6.3.2 S4 and interior algebras. One of the better known modal 
logics is S4. This can be defined as a normal modal logic that also satisfies the 
following axiom schemata: 


(3.92) (pV Op) = Op, 
(3.93) OOD OD. 


S4 is more often defined by the schemata O y — vy (T) and O yg > | 
which are equivalent to those given here. 

Clearly, in the Lindenbaum-Tarski algebra generated from the set of theo- 
rems of S4, these schema will generate equations of the form of 3.4 and 3.5 
characterising a closure operator. From a semantic point of view, the class of 
models of S4 consists of all Kripke frames whose accessibility relation is reflex- 
ive and transitive; thus according to Proposition 3.15, their complex algebras 
are exactly the closure algebras. 

This correspondence is the basis of the encoding of topological relationships 
into S4 proposed in Bennett, 1996 and Bennett, 1997 and also investigated 
in Nutt, 1999. A similar method had previously been used in Bennett, 1994 
to encode modal formulae into intuitionistic propositional logic. This is based 
on the relation of intuitionistic logic to Heyting algebras, which in turn can be 
interpreted over topological spaces (Sec. 2 above). 

Let 7 = ọ denote the one-to-one mapping from terms of closure algebra to 
syntactically isomorphic formulae of S4: specifically y is obtained from 7 by 
replacing — by =, V by+,-by A ,clby > and int by O. The following Lemma 
enables us to use deduction in S4 to determine entailment among equations in 
closure algebra: 

















yp (4), 








LEMMA 3.63 (BENNETT, 1997) Let 7 = 1,..., T = 1 be equations of 
closure algebra and T; = pi. Then 


(3.94) | =1,...,m=1Fm=1 iff Of¢i,.-.,O¢nk sa vo. 








Because of Lemma 3.59, we can also use the modal encoding for testing 
inconsistency of sets of equations and disequations of closure algebra, and 
hence for reasoning about topological properties and relationships among spa- 
tial regions. 
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6.3.3 A bi-modal spatial logic. We now give a brief overview of a 
somewhat more expressive modal encoding of topological relationships pro- 
posed in Bennett, 1997. This is obtained by employing a bi-modal logic 
incorporating both S4 and the ‘universal’ modal operator (Goranko, V. and 
Passy, S., 1992), here denoted by W (with its dual being denoted, ¢, where 
4 = -H-y). As before, the S4 operators O and ¢ correspond respectively 
to the as interior and closure operators, int and cl. The interpretation of Wy 
is that y holds at all possible worlds. The universal modal operator is closely 
related to the better known S5 modality, which is the logic semantically de- 
termined by taking the accessibility relation to be reflexive, symmetric and 
transitive (i.e. an equivalence relation). However, this allows the possibil- 
ity that the set of worlds is partitioned into several sets of worlds which are 
not accessible to each other. Thus, if W were an S5 modality, My it would 
be true at world w as long as holds in all worlds in the same equiva- 
lence class as w—not necessarily in all worlds. The axiomatisation of W 
is the same as that of O, with the addition of the following two schemata 
(Goranko, V. and Passy, S., 1992; Wolter and Zakharyaschev, 2002): 


(3.95) Op — 4 p, 
(3.96) Bp — Op. 











As before, the $4 operators [O and © correspond respectively to the as interior 
and closure operators, int and cl. A formula of the form My ensures that the 
topological condition encoded by y holds at every point in space. Similarly, 
> means that there is some point p satisfying the condition represented by y. 
p can be thought of as a sample point, which bears witness to some topological 
constraint. For instance, where two regions x and y overlap, the corresponding 
modal formula ¢(O(x) A O(y)) ensures the existence of a point which is in 
the interior of both x and y. 

In terms of the bi-modal logic $'5/S4, a set of key RCC relations are repre- 
sented as as follows: 
































(3.97) Cia,y) = Ory), 

(3.98) DC(z,y) <> E-r V-y), 

(3.99) O(x,y) — > (Olz) A Oy)), 

(3.100) DR(z,y) <= MO(-2 V -y), 

(3.101) P(x,y) <= E(r— y), 

(3.102) =P(x,y) = (x A-y), 

(3.103) TP(z,y) <=> H(z — y) ^ O(a A c(-y)), 
(3.104) NTP(x,y) <> M(x — O(y)), 

(3.105) Non-Empty(z) <=> 42, 
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(3.106) Regular(z) <= M(O(-2) Vv O(O(2))). 


All the RCC-8 relations can be expressed in terms of these formula by using 
conjunction and negation. Further details of how this logic can be used for topo- 
logical reasoning are given in Bennett, 1997 and Wolter and Zakharyaschev, 
2002. 


6.4 A proof system for CRAs 


In this section we will describe a sound and complete logic for contact 
relation algebras, within which general facts about CRAs can be proved. The 
semantics of this logic is relational, introduced by Orlowska, 1991 (and further 
developed in Orlowska, 1996), and the proof system is in the style of Rasiowa 
and Sikorski, 1963. 

Our language £ consists of the disjoint union of the following sets: 








i) Aset{C, 1} of constants, representing the generating contact relation 
and the identity. 


ii) An infinite set V of individuum variables. 
iii) Aset {+,-,—, ; ,~} of names for the relational operators. 


iv) Aset {(,)} of delimiters. 


With some abuse of language, we use the same symbols for the actual operations. 
The terms of the language are defined recursively: 


i) Cand 1’ are terms. 
ii) If Rand S are terms, so are (R+ S), (R-S), (—R), (R; S), (R^). 


iii) No other string is a term. 


The set of all terms will denoted by 7. In the sequel, we will follow the usual 
conventions of reducing brackets. The set F of £-formulae is 


{aRy: RET, x,y EV}. 


A model of £ is a pair M = (W, m), where W is a nonempty set, and m : T > 
W x W is a mapping such that: 


(3.107) m(C) is a contact relation. 

(3.108) m(1’) is the identity relation on W. 

(3.109) m is a homomorphism from the algebra of terms to. 
(Rel(W),U, N, “159 o 


A valuation v is a mapping from VY to W. If xRy is a formula, then we say that 
M satisfies xRy under v, written as M,v = «Ry, if (v(x), v(y)) € m(R). 
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xRy is called true in the model M, if M,v © xRy for all valuations v. «Ry 
is called valid, if it is true in all models. 

The proof system consists of two types of rules: With decomposition rules 
we can decompose formulae into an equivalent sequence of simpler formulae. 
The decomposition rules are the same for every system of relation algebras. The 
specific rules are tailored towards the concrete situation; they modify a sequence 
of formulae and have the status of structural rules. The role of axioms is played 
by axiomatic sequences. 

Proofs have the form of trees: Given a formula x Ry, we successively apply 
decomposition or specific rules; in this way we obtain a tree whose root is x Ry, 
and whose nodes consist of sequences of formulae. A branch of a tree is closed 
if it contains a node which contains an axiomatic sequence as a subsequence. 
A tree is closed if all its branches are closed. 

















K,a(RUS)y, H K,x—(RUS)y, H 
(U) Se one (=U) 
K, Ry, «Sy, H K,a(-R)y, H | K,x(—S)y, H 
z K,xz(RAN S), H e K,x — (RA S)y, H 
K, xRy, H | K, xSy, H K, x(—R)y, £(—S)y, H 
©) K,xR“y, H =) K,a(-R*)y, H 
K,yRz, H K,y(—R)x, H 
K, xRy, H 
K, a(R; S)y,H , : 
Cz) where z is any variable 


K,2Rz,H,2(R; S)y| K,zSy,H,2(R; S)y 


K,x-— (R; S)y, H : ; ; 
(4; ) where z is a restricted variable 
K, x(—R)z, 2(—S)y, H 





Table 6.3. Decomposition rules. 


Rasiowa-Sikorski systems are, in way, dual to tableaux: Whereas in the latter 
one tries to refute the negation of a formula, the Rasiowa-Sikorski systems 
attempt to verify a formula by closing the branches of a decomposition tree 
with axiomatic sequences. 

The decomposition rules of the system are given in Table 6.3, and the specific 
rules for the system are given in Table 6.4. There, a variable z is called restricted 
in a rule, if it does not occur in the upper part of that rule. K and H are 
finite, possibly empty, sequences of £ formulae. For all R € 7, the following 
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sequences are axiomatic: 


(3.110) xRy, x(—R)y, 
(3.111) ri'r. 


K,x1'y, H 
K,yl'«,H 


P K,«l'y, H 
(tran 1’) , where z is any variable 
K,x1'z, H,xl'y| K, z1'y, H,2l'y 


K, xRy, H . ; 
(14) , where z is any variable 
K,«1'z,H,xRy|K,zRy,H,xrRy 


j K, xRy, H ; ; 
(15) , where z is any variable 
K, xRz, H, xRy | K,z1'y,H,xRy 


(sym 1’) 

















K,zCy, H K,zCy, H 
(rec)  ———— (symC) =—W——— 
K,x1'y, xCy, H K, yCzx, H 
K 
(ext C) where z and t are restricted 
K, x(—C)z,yCz | K,y(—C)t,£Ct| K,æ(—1')y variables 
K 


(cut C) 





Table 6.4. Specific rules. 

The following result shows that the logic is sound and complete: 
PROPOSITION 3.64 (DÜNTSCH AND ORLOWSKA, 2000) 

i) All decomposition rules are admissible. 

ii) All specific rules are admissible. 

iii) The axiomatic sequences are valid. 

iv) Ifa formula is valid then it has a closed proof tree. 

An example in Düntsch and Orlowska, 2000 shows that there is a CRA with 


infinitely many atoms below 1’, and thus, by a result of Andréka et al., 1997, 
the equational logic of CRAs is undecidable. 
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7. Conclusion 


In this chapter we have examined the topic of region-based spatial repre- 
sentation from a number of perspectives. We have looked at the relationships 
between algebraic models, point-set topology and axiomatic theories of spatial 
regions. The approach of modelling space in terms of a Boolean algebra, sup- 
plemented with additional operations and/or relations provides a very general 
and adaptable analysis. Moreover, such algebraic formalisms provide a power- 
ful tool for establishing correspondences between relational axiomatic theories 
and the models of point-set topology. Specifically, we have seen that Boolean 
Contact Algebras (BCAs), have essentially the same expressive capabilities as 
theories such as the Region Connection Calculus (Randell et al., 1992b), which 
was developed as a knowledge representation formalism for Artificial Intelli- 
gence. We have presented representation theorems that characterise topological 
models of BCAs. 

It is interesting to note that the properties of topological spaces that char- 
acterise the topological representations of relational theories (according to the 
representation theorems of Sec. 4) do not coincide with those most familiar to 
point-set topologists. This is primarily because the elements of region-based 
theories are modelled as regular subsets of a topological space. Thus, relevant 
properties for spaces models are typically weaker than better known separation 
properties, in that they impose conditions on regular subsets of the space rather 
than on points or on open or closed sets in general. Although we believe that 
these representations are particularly natural, it is worth noting that there may 
be alternative topological representations where the embedding of the algebraic 
structure in the topology takes a different form (as with the representations of 
Roeper, 1997 and Mormann, 1998). 

As well as considering algebras of regions, we have also seen how the for- 
malism of Relation Algebra provides an algebraic treatment of the relational 
concepts of a theory. This proves to be well suited to representing spatial rela- 
tions, in that a large vocabulary of significant spatial relations can be equation- 
ally defined from just the contact relation, C. The Relation Algebraic analysis 
also serves to provide a foundation for the technique of compositional infer- 
ence, which has been found to be effective in a number of AI applications, for 
reasoning with both temporal (Allen, 1983) and spatial relations (Randell et al., 
1992a; Renz and Nebel, 1999). 

A technique that has proved particularly useful for reasoning about topology 
has been the encoding into modal logic. Again, algebra provides a bridging 
formalism, since propositional logics have a direct correspondence to Boolean 
algebras with operators. Because the principal function of logical languages is 
to describe mechanisms of valid inference, much is known about how such infer- 
ences can be automated, and about the computational complexity of reasoning 
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algorithms using these systems. Thus the encodings have led to the develop- 
ment of decision procedures and establishment of complexity results for rea- 
soning about topological relations (Bennett, 1997; Renz and Nebel, 1997; Renz 
and Nebel, 1999; Wolter and Zakharyaschev, 2000; Wolter and Zakharyaschev, 
2002). Modal encodings have also been applied to encode relations in projective 
geometry (Balbiani et al., 1997; Venema, 1999). 

Another promising avenue for extending the use of modal encodings is by the 
use of multi-dimensional modal logics (Segerberg, 1973; Marx and Venema, 
1997; Gabbay et al., 2003), which are multi-modal logics, with different modal- 
ities ranging over orthogonal dimensions of their model structures. These have 
been used to capture both multiple spatial dimensions and the combination of 
space with time. Yet another approach is to employ modal logics , in which 
spatial relations are associated with the accessibility relation associated with 
the modal operators (Cohn, 1993; Lutz and Wolter, 2006). 

The current chapter has focused on purely topological aspects of spatial infor- 
mation. However, other geometrical properties have also been treated in terms 
of region-based relational and algebraic theories. An early paper of Tarski, 
1956a showed how the whole of Euclidean geometry could be re-constructed 
by taking regions (rather than points) as the basic spatial entities, and the rela- 
tion of parthood and the property of sphericity as the conceptual primitives. 
A simpler axiomatisation of a theory of this kind is given by Bennett, 2001. 
Though from a theoretical point of view such formalisms are highly interest- 
ing, itis less clear whether they could provide useful mechanisms for computing 
inferences. It seems that by adding only a little more than topology to a repre- 
sentation one easily obtains a computationally intractable theory. For instance, 
in Davis et al., 1999 it is shown that reasoning with the RCC-8 relations together 
with a convexity predicate is already massively intractable. The question of 
the expressive power of region based theories has been the subject of much 
research (e.g. Pratt and Schoop, 2000, and Davis, 2006) gives some rather gen- 
eral results demonstrating the very high expressive power of theories that allow 
quantification over regions. 

One potentially very useful development of spatial logics, which has yielded 
positive results regarding tractability, is the combination of spatial and temporal 
concepts into a combined spatio-temporal calculus. Certain restricted syntax 
fragments of modal logics that can encode spatial and temporal information can 
express a significant range of spatio-temporal relationships whilst remaining 
tolerably amenable to automated reasoning (Wolter and Zakharyaschev, 2000; 
Bennett et al., 2002). 

An important aspect of space that has not been explicitly considered in this 
chapter is dimensionality. The formalisms presented in this chapter do not 
explicitly constrain the dimensionality either of the regions or the embedding 
space. However, the interpretation of regions as regular sets of a space means 
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that in such models, regions will all have the same dimension as the whole 
space. The dimensionality of the space could be fixed by appropriate ax- 
ioms constraining the connection relation, but the dimensionality of regions 
would still be uniform. For many applications it would be useful to have a 
richer theory incorporating regions of different dimensionality into its domain. 
Axiomatic topological theories that can handle diverse dimensionalities have 
been proposed in Gotts, 1996, Galton, 1996 and Galton, 1994. However, the 
relationship between axiomatic theories of this kind and topological models has 
not been fully established and is certainly a rich area for further work. 

The more computationally amenable region-based calculi also suffer from 
inexpressivity regarding self-connectedness of regions—i.e. the domain can 
include multi-piece regions, but single and multi-piece regions cannot be dis- 
tinguished within the theory. (The distinction can easily be made in Ist-order 
theories such as the full RCC theory, where we can define 


VaSelf-Connected(x) > Vyz|(a = y+ z) > yCz, 


but the full RCC theory is undecidable: Dornheim, 1998.) This is closely 
related to their inexpressibility in regard to dimensionality. It was shown in 
Renz, 1998 that any consistent set of RCC-8 relations has a model in which the 
regions are self-connected regular subsets of a three-dimensional space. How- 
ever, as explained in Grigni et al., 1995, an interpretation over self-connected 
regions of two-dimensional space may not be possible, despite the existence of 
a higher dimensional model. Hence, in this case, enforcing self-connectedness 
of regions would have no affect on consistency unless we also had some 
means of enforcing planarity (or linearity) of the space. Developing any 
kind of computationally effective calculus for reasoning about topological re- 
lations between self-connected regions in two-dimensional space has proved 
elusive. Some results developed from a graph-theoretic viewpoint suggest 
that this is at least NP-hard, and may well be undecidable (Kratochvil, 1991; 
Kratochvíl and Matoušek, 1991). 

Another constraint on the structure of space, which has received attention 
is discreteness. There are many applications, such as describing or reason- 
ing about video images, where one is dealing with a discrete spatial structure. 
Axiomatic theories which allow atomic regions have been investigated by Ma- 
solo and Vieu, 1999, Galton, 1999 and Diintsch and Winter, 2006, while Duintsch 
and Vakarelov, 2006 present a generalisation of BCAs, in which the extension- 
ality axiom is dropped, and proves a representation theorem in terms of discrete 
proximity spaces. A deep analysis of the connection between relational, alge- 
braic and topological properties of proximity and contact algebras and their 
representation spaces, including the discrete case, can be found in Dimov and 
Vakarelov, 2006. 
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The diversity of spatial formalisms is testament to the richness and depth of 
spatial concepts. Indeed Tarski, 1956b suggested that geometrical primitives 
may provide a conceptual basis from which all precise concepts can be defined. 
Although axiomatic, region-based theories of topology are increasingly well 
understood and integrated with related areas of mathematics and knowledge 
representation, many directions for further research remain open. 
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1. Introduction 


Qualitative reasoning is an approach for dealing with commonsense knowl- 
edge without using numerical computation. Instead, one tries to represent 
knowledge using a limited vocabulary such as qualitative relationships be- 
tween entities or qualitative categories of numerical values, for instance, using 
{+, —, 0} for representing real values. An important motivation for using a 
qualitative approach is that it is considered to be closer to how humans repre- 
sent and reason about commonsense knowledge. Another motivation is that it is 
possible to deal with incomplete knowledge. Qualitative reasoning, however, 
is different from fuzzy computation. While fuzzy categories are approxima- 
tions to real values, qualitative categories make only as many distinctions as 
necessary—the granularity depends on the corresponding application. 

Two very important concepts of commonsense knowledge are time and space. 
Time, being a scalar entity, is very well suited for a qualitative approach and, 
thus, qualitative temporal reasoning has early emerged as a lively sub-field of 
qualitative reasoning which has generated a lot of research effort and important 
results. Space, in turn, is much more complex than time. This is mainly due 
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to its inherent multi-dimensionality which leads to a higher degree of freedom 
and an increased possibility of describing entities and relationships between 
entities. This becomes clear when enumerating natural language expressions 
involving space or time. While temporal expressions mainly describe order 
and duration (like “before”, “during”, “long”, or “a while”) or a personal or 
general temporal category (like “late” or “morning”), spatial expressions are 
manifold. They are used for describing, for instance, direction (“‘left’, “above’’), 
distance (“far’,“near’’), size (“large”, “tiny”), shape (“oval”, “convex”’), or 
topology (“touch”, “inside”). It is obvious that most spatial expressions in 
natural language are purely qualitative. 

Although there are doubts that because of its multi-dimensionality, space 
can be adequately dealt with by using only qualitative methods (the poverty 
conjecture of Forbus et al., 1987), qualitative spatial reasoning has become an 
active research area. Because of the richness of space and its multiple aspects, 
however, most work in qualitative spatial reasoning has focused on single as- 
pects of space. The most important aspects of space are topology, orientation, 
and distance. As shown in psychological studies (Piaget and Inhelder, 1948), 
this is also the order in which children acquire spatial notions. Other aspects 
of space include size, shape, morphology, and spatial change (motion). 

Orthogonal to this view is the question for the right spatial ontology. One 
line of research considers points as the basic entities, another line considers ex- 
tended spatial entities such as spatial regions as basic entities. While it is easier 
to deal with points rather than with regions in a computational framework, tak- 
ing regions as the basic entities is certainly more adequate for commonsense 
reasoning—eventually, all physical objects are extended spatial entities. Fur- 
thermore, if points are required, they can be constructed from regions (Biacino 
and Gerla, 1991). A further ontological distinction is the nature of the em- 
bedding space. The most common notion of space is n-dimensional continuous 
space (IR”). But there are also approaches which consider, e.g., discrete (Galton, 
1999) or finite space (Gotts, 1996). 

The most popular reasoning methods used in qualitative spatial reasoning are 
constraint based techniques adopted from previous work in temporal reasoning 
(see Sec. 2 for a comprehensive introduction to these techniques). In this 
chapter, we will focus exclusively on these techniques. In order to apply them, 
it is necessary to have a set of qualitative binary basic relations which have 
the property of being jointly exhaustive and pairwise disjoint, i.e., between 
any two spatial entities exactly one of the basic relations holds. The set of all 
possible relations is then the set of all possible unions of the basic relations. 
Reasoning can be done by exploiting composition of relations. For instance, if 
the binary relation Rı holds between entities A and B and the binary relation 
Rə holds between B and C, then the composition of Rı and Rə restricts the 
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possible relationship between A and C. Compositions of relations are usually 
pre-computed and stored in a composition table. 

The rest of the chapter is structured as follows. In Sec. 2, we explain the 
general idea of constraint-based reasoning. In Sec. 3, anumber of different con- 
straint calculi are introduced, which cover topology, orientation, and distance. 
Since we are interested in reasoning in these spatial calculi, we have to con- 
sider what computational resources are necessary to accomplish that. For this 
purpose, we will introduce computational complexity theory and explain how 
it can be applied in the context of constraint based reasoning. As we will see, 
almost all qualitative spatial calculi are computationally intractable. However, 
it is possible to identify tractable subsets, as we will show in Sec. 5. Based in 
these results, we will have a look at the practical efficiency of reasoning in spa- 
tial calculi using tractable fragments in Sec. 6. Finally, in Sec. 7, we consider 
the combination of different spatial calculi. 


2. Constraint-based methods for qualitative spatial 
representation and reasoning 


Knowledge about entities or about the relationships between entities is often 
given in the form of constraints. For instance, when trying to place furniture 
in a room there are certain constraints on the position of the objects. Unary 
constraints such as “The room is 5 metres in length and 6 in breadth” restrict 
the domain of single variables, the length and the breadth of the room. Binary 
constraints like “The desk should be placed in front of the window”, ternary 
constraints like “The table should be placed between the sofa and the arm- 
chair”, or in general n-ary constraints restrict the domain of 2, 3, or n variables. 
Problems like these can be formalised as constraint satisfaction problems. 


Given a set of m variables V = {x1,...,2%m} over a domain D, an n-ary 
constraint consists of an n-ary relation R; C D” and an n-tuple of variables 
(Zis, Zin) Written Rili,- , Zi). For binary constraints, we will also 


use the infix notation xı R;x2. A (partial) instantiation f of variables to values 
is a (partial) function from the set of variables V to the set of values D. We 
say that an instantiation f satisfies the constraint R;(x;,,...,;,,) if and only 

A constraint satisfaction problem (CSP) consists of a set of variables V over 
a domain D and a set of constraints O. The intention is to find a solution which 
is an instantiation such that all constraints in © are satisfied. 

In this work we restrict ourselves to binary CSPs, i.e., CSPs where only 
binary constraints are used. A binary CSP can be represented by a constraint 
network which is a labelled digraph where each node is labelled by a variable 
x; or by the variable index 7 and each directed edge is labelled by a binary 
relation. We will use the notation R;; to denote the relation constraining the 
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variable pair (x;, £j}. By overloading notation, we also use Rj; to denote the 
constraint Rj; (£i, £j) itself. 

A CSP is consistent if it has a solution. If the domain of the variables is 
finite, CSPs can be solved by backtracking over the ordered domains of the 
single variables. Backtracking works by successively instantiating variables 
with values of the ordered domain until either all variables are instantiated and 
a solution is found or an inconsistency is detected in which case the current 
variable is instantiated with the next value of its domain. If all possible instan- 
tiations of the current variable lead to an inconsistency, the previous variable 
becomes the current variable and the process is repeated. Backtracking is in 
general exponential in the number of variables. The process can be sped up by 
propagating constraints between the variables and eliminating impossible val- 
ues as soon as possible. If the domain of the variables is infinite, backtracking 
over the domain is not possible and other methods have to be applied. 


2.1 Binary Constraint Satisfaction Problems and 
Relation Algebras 


One way of dealing with infinite domains is using constraints over a finite set 
of binary relations. Ladkin and Maddux (1994) proposed to employ relation 
algebras developed by Tarski, 1941 for this purpose. Arelation algebra consists 
of a set of binary relations R which is closed under several operations on 
relations and contains some particular relations. The operations are union (U), 
intersection (N), composition (0), complement (-), and conversion (-~), where 
conversion and composition are defined as follows: 





(4.1) RoS © {la,y)| az: (x,z) € RA (z,y) € S} 
(4.2) R- & {(x,y)| yx) € R} 


In the following, we will—by abusing notation—identify sets of relations with 
their union. For example, we identify {R,S,T} with RU SUT. 

The particular binary relations mentioned above are the empty relation 0) 
which does not contain any pair, the universal relation x which contains all 
possible pairs, and the identity relation Id which contains all pairs of identical 
elements. 

We assume that a set of constraints © contains one constraint for each pair 
of variables involved in O, i.e., if no information is given about the relation 
holding between two variables x; and x, then the universal relation * constrains 
the pair, i.e., Rj; = *. Another assumption that we make is that whenever a 
constraint R;; between x; and x; is in O, the converse relation constrains x; 
and Ti, i.e., (Ri) = Rji. 
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Determining consistency for CSPs with infinite domains is in general unde- 
cidable (Hirsch, 1999). A partial method for determining inconsistency of a 
CSP is the path-consistency method which enforces path-consistency of a CSP 
(Montanari, 1974; Mackworth, 1977). A CSP is path-consistent if and only 
if for any partial instantiation of any two variables satisfying the constraints 
between the two variables, it is possible for any third variable to extend the 
partial instantiation to this third variable satisfying the constraints between the 
three variables. 

A straight-forward way to enforce path-consistency on a binary CSP is to 
strengthen relations by successively applying the following operation until a 
fixed point is reached: 


VE: Rij:=Rij N (Rik 9 Rki). 


The resulting CSP is equivalent to the original CSP in the sense that it has 
the same set of solutions. If the empty relation results while performing this 
operation, we know that the CSP is inconsistent. Otherwise, the CSP might 
or might not be consistent. Provided that the composition of relations can be 
computed in constant time, the algorithm sketched has a running time of O(n°), 
where n is the total number of nodes in the graph. More advanced algorithms 
can enforce path-consistency in time O(n?) (Mackworth and Freuder, 1985). 
Fig. 4.1 shows the O(n?) time path-consistency algorithm by van Beek, 1992, 
which uses a queue to keep track of those triples of variables that might be 
affected by the changes made and which have to be analysed again. 


2.2 Relation Algebras based on JEPD Relations 


Of particular interest are relation algebras that are based on finite sets of 
Jointly exhaustive and pairwise disjoint (JEPD) relations. JEPD relations are 
sometimes called atomic, basic, or base relations. We refer to them as basic 
relations. Since any two entities are related by exactly one of the basic relations, 
they can be used to represent definite knowledge with respect to the given level 
of granularity. Indefinite knowledge can be specified by unions of possible 
basic relations. In this chapter we denote a set of basic relations with B and 
it should be clear from the context which particular set of basic relations we 
are referring to. If the set of relations formed by generating all unions over 
these basic relations is closed under composition and converse, then this set of 
relations is the carrier of a relation algebra. We denote the set of all relations 
by 28 alluding to the fact that we identify sets of relations with their unions. 

For these relation algebras, the universal relation is the union over all basic 
relations. Converse, complement, intersection and union of relations can easily 
be obtained by performing the corresponding set theoretic operations. Compo- 
sition of basic relations has to be computed using the semantics of the relations. 
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Algorithm: PATH-CONSISTENCY 
Input: A set © of binary constraints over the variables 71, £2,..., En 
Output: path-consistent set equivalent to O, or fail, if inconsistency is detected 


1. Q= {(i j, k), (ki, j) | i< jik Zi, k# ÍY; 
(i indicates the 2-th variable of ©. Analogously for 7 and k) 
2. while Q 4 Ý do 
3. select and delete a path (i, k, 7) from Q; 
4. if REVISE(i, k, 7) then 
5. if Rij = Ý then return fail 
6. else Q — Q U{(i, j, k), (k, i,j) | kK AIK A J} 


Function: REVISE(i, k, 7) 

Input: three variables i, k and j 

Output: true, if Rj; is revised; false otherwise. 

Side effects: Rij and Rj; revised using the operations N and o 
over the constraints involving i, k, and j. 


. oldR := Rij; 

< Rij = Rij (Rig o Rey); 

. if (oldR = R;;) then return false; 
š Rji = Ry; 
. return true. 


nA BWN Re 


Figure 4.1. Van Beek’s PATH-CONSISTENCY algorithm. 


Composition of unions of basic relations can be obtained by computing the 
union of the composition of the basic relations. Usually, compositions of the 
basic relations are pre-computed and stored in a composition table. 

The best known example of such a relation algebra is the Interval Algebra 
introduced by Allen, 1983, which defines 13 different basic relations between 
convex intervals on a directed line. The basic relations and a graphical depiction 
are given in Table 4.1. Even though the interval algebra was introduced for 
temporal representation and reasoning, there is anumber of spatial calculi which 
are derived from the interval algebra. Some of them we will mention in this 
chapter. 

We say that a relation R is a refinement of a relation S if and only if R C 
S. Given, for instance, a union of relations {R,, R2, R3}, then the relation 
{R1, R2} is a refinement of the former relation. This definition carries over to 
constraints and to sets of constraints. Then, a set of constraints O’ is a refinement 
of © if and only if both CSPs have the same variables and for all relations R’; 3 
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Interval Sym- Pictorial Endpoint 

Base Relation bol Example Relations 
x before y < XXX XS LYT A LYT 
y after x > yyy | X<Y, XT <YT 
x meets y m XXXX X~<Y, xX <L YT 
y met-by x m~ yyyy | Xt =Y, X` < Y7 
x overlaps y o XXXX X”~<Y X <Y7, 
y overlapped-by x | o~ yyyy Xt >Y, X™<YT 
x during y d XXX X- >Y, X <Y'] 
y includes x d~ yyyyyyyy | Xt >Y, XT <Y™ 
x starts y s XXX AP SY", AX LYS 
y started-by x s~ yyyyyyyy XT SY, XT SYS 
x finishes y f KASVA SW eee 
y finished-by x f yyyyyyyy | X >Y, XT =Y™ 
x equals y = XXXX XT SY 7, XT Y 
yyyy XX SY p XS Ye 





Table 4.1. The thirteen basic relations of Allen’s interval algebra. 


constraining the pair x;, xj in O’ and all relations R;; constraining the same 
variables in ©, we have Ri; C Rij. ©' is said to be a consistent refinement of 
© if and only if ©’ is a refinement of © and both © and ©’ are consistent. A 
consistent scenario ©, of a set of constraints © is a consistent refinement of © 
where all the constraints of ©, are assertions of basic relations. 

This chapter deals with determining consistency of binary constraint satis- 
faction problems that are based on JEPD relations. Let B be a finite set of JEPD 
binary relations. The consistency problem CSPSAT(S) for sets S C 28 over a 
(possibly infinite) domain D is defined as follows: 


Instance: A set V of variables over a domain D and a finite set © of binary 
constraints R(x;,x;), where R € S and xj, £j E€ V. 


Question: Is there an instantiation of all variables in © with values from D 
such that all constraints are satisfied? 


In the general case CSPSAT is undecidable (Hirsch, 1999), but in many interest- 
ing cases it is possible to prove decidability or even tractability of CSPSAT(S). 

If CSPSAT is decidable for a certain subset S C 2° then it is possible to 
decide CSPSAT for other subsets of 2° by using a non-deterministic algorithm. 
This is done by selecting refinements of the relations such that the refinements 
are contained in S. For example, suppose S C 2% contains the relations 
S1,..., Sn, the relation R € 2° is not contained in S, and R = S1 U S3 U S4. 
Then, the constraint R(x, y) can be processed by guessing non-deterministically 
one of the relations S1, S3, and S4. In general, a subset S C 2B splits another 
subset T C 28 exhaustively if for every relation T of T there are refinements 
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Algorithm: CONSISTENCY 

Input: A set © of binary constraints over the variables x1, £2, ..., £n and a sub- 
set S C 25 that splits 2° exhaustively and for which CSPSAT(S) is decidable. 
Output: true, iff O is consistent. 


1. PATH- CONSISTENCY(O0) 

2. if O contains the empty relation then return false 

3. else choose an unprocessed constraint R(x, y) and 
split R into S1,..., Sk E S such that S1 U...U5,=R 

4. if no constraint can be split then return DECIDE(O) 

5. for all refinements S; (1 < l < k) do 

6. replace R(x, y) with Sı(x, y) in © 

7. if CONSISTENCY (O) then return true 


Figure 4.2. Backtracking algorithm for deciding consistency. 


S1,...,5, E S such that T = S,;U...US,. If S splits 7 exhaustively, it is 
obvious that decidability of CSPSAT(S) implies decidability of CSPSAT(T). 
Furthermore, it implies that CSPSAT(Z) can be decided in polynomial time 
on a non-deterministic Turing machine, if CSPSAT(S) can be decided in poly- 
nomial time. 

The non-deterministic algorithm sketched above can be turned into a deter- 
ministic one by employing a backtracking scheme. The backtracking algorithm 
given in Fig. 4.2 is a generalisation of the one proposed by Ladkin and Reine- 
feld, 1992, and relies on a set S that splits 2° exhaustively. Note that a set S 
splits 2° exhaustively if and only if S contains all base relations. S is called 
the split set. The backtracking algorithm uses a function DECIDE which is a 
sound and complete decision procedure for CSPSAT(S). The (optional) pro- 
cedure PATH-CONSISTENCY in line 1 is used as forward-checking and restricts 
the remaining search space. Nebel, 1997 showed that this restriction preserves 
soundness and completeness of the algorithm—provided the split set is closed 
under intersection, composition, and converse. If the decision procedure DE- 
CIDE runs in polynomial time, CONSISTENCY is exponential in the number 
of constraints of ©. If enforcing path-consistency is sufficient for deciding 
CSPSAT(S), DECIDE(@) in line 4 is not necessary and one can return true at 
this point. 

The efficiency of the backtracking algorithm depends on several factors. One 
of them is, of course, the size of the search space which has to be explored. A 
common way of measuring the size of the search space is the average branching 
factor, i.e., the average number of branches each node in the search space has. 
For the backtracking algorithm described in Fig. 4.2 this depends on the average 
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number of relations of the split set S into which a relation has to be split. The 
less splits in average the better, i.e., it is to be expected that the efficiency of 
the backtracking algorithm depends on the split set S and its branching factor. 
Another factor is how the search space is explored. The backtracking algorithm 
of Fig. 4.2 offers two possibilities of applying heuristics. One is in line 3 where 
the next unprocessed constraint can be chosen, the other is in line 5 where the 
next refinement can be chosen. These two choices influence the search space 
and the path through the search space. Good choices should increase efficiency 
of the backtracking algorithm. 

Other fundamental reasoning problems are the minimal label problem CSP- 
MIN, the problem of finding the strongest entailed relation for each pair of vari- 
ables from a given set of constraints, and the entailment problem CSPENT, i.e., 
decide whether a particular constraint is entailed by a set of constraints. As was 
shown for the corresponding temporal problems (see the next section), the en- 
tailment problem, the minimal label problem, and the consistency problem are 
equivalent under polynomial Turing reductions (Vilain et al., 1989; Golumbic 
and Shamir, 1993). 


3. Spatial constraint calculi 


In qualitative spatial reasoning it is common to consider a particular aspect 
of space such as topology, direction, or distance and to develop a system of 
qualitative relationships between spatial entities which cover this aspect of 
space to some degree and which appear to be useful from an applicational or 
from a cognitive perspective. If these relations are based on a set of jointly 
exhaustive and pairwise disjoint basic relations which is closed under several 
operations, it is possible to apply constraint based methods for reasoning over 
these relations (see Sec. 2). For this it is only necessary to give a composition 
table; either for all relations or for the basic relations plus a procedure for 
computing the compositions of complex relations. 

The composition table should be obtained using the formal semantics of the 
relations. Otherwise it is not possible to verify correctness and completeness 
of the inferences. Formal semantics of the relations are also necessary for find- 
ing efficient reasoning algorithms which are essential for most applications. 
Without formal semantics it is sometimes not even possible to show that rea- 
soning over a system of relations is decidable (e.g., the 9-intersection relations 
of Egenhofer, 1991, as shown by Grigni et al., 1995). 

In the following subsections we survey some important approaches to the 
main aspects of space, topology, direction, and distance. Instead of summaris- 
ing many different approaches, we focus on those approaches which have been 
formally analysed. 
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Table 4.2. Topological interpretation of the eight base relations of RCC-8. All spatial regions 
are regular closed, i.e., x = c(i(x)) and y = c(i(y)). i(-) specifies the topological interior of a 
spatial region, c(-) the topological closure. 


3.1 Topology 


Topological distinctions between spatial entities are a fundamental aspect of 
spatial knowledge. Topological distinctions are inherently qualitative which 
makes them particularly interesting for qualitative spatial reasoning. Although 
there is a large body of work on topology developed in mathematics, this is not 
very well suited for qualitative spatial reasoning. Mathematical research on 
topology is not concerned with reasoning over topological relationships and as 
such does not provide us with any reasonable topological calculi and reasoning 
mechanisms (Gotts et al., 1996). 

Topological approaches to qualitative spatial reasoning usually describe re- 
lationships between spatial regions rather than points, where spatial regions are 
subsets of some topological space. Most existing approaches on formalising 
topological properties of spatial regions are based on work from Whitehead, 
1929, Clarke, 1981 and Clarke, 1985, who axiomatised mereotopology using a 
single primitive relation, the binary connectedness relation. Some approaches 
also distinguish between a mereological primitive, the parthood relation, and 
a topological primitive, the connected relation (Borgo et al., 1996). Using 
these primitive relations it is possible to define many other relations. A set of 
jointly exhaustive and pairwise disjoint relations which can be defined in all 
approaches of this kind are the eight relations DC, EC, PO, EQ, TPP, NTPP, 
TPP~',NTPP~!. In the best known approach in this domain, the Region 
Connection Calculus by Randell et al., 1992, these relations are known as the 
RCC-8 relations. In Table 4.2 we defined the RCC-8 relations using the inte- 
rior and exterior of spatial regions. Sample instances of the relations are given 
in Fig. 4.3. The relation symbols are abbreviations of their meanings: DisCon- 
nected, Externally Connected, Partially Overlapping, EQual, Tangential Proper 
Part, Non-Tangential Proper Part and the converse relations of the latter two 
relations. 
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(x,y) €DC (x,y)€EC (x,y) €TPP (a,y)€TPP-* 


SQ @ © 


(x,y)€PO (#,y)€EQ (a,y)ENTPP (a, y) €NTPP~* 


Figure 4.3. Two-dimensional examples for the eight basic relations of RCC-8. 


What distinguishes the different approaches and what thereby influences 
the definable relations is the interpretation of the connectedness relation and 
the properties of the considered regions. Some approaches distinguish between 
open and closed regions (Randell and Cohn, 1989; Asher and Vieu, 1995) which 
allows, for instance, to define different kinds of contact. Asher and Vieu, 1995 
distinguished between strong contact (two regions have points in common) and 
weak contact (two regions are disjoint but their topological closures share com- 
mon points). Other approaches do not make this distinction and treat regions 
which are open, closed, or neither equally (Randell et al., 1992). The Region 
Connection Calculus (Randell et al., 1992) considers only the topological clo- 
sure of regions: two regions are connected if their topological closures share a 
common point. Cohn et al., 1997 argue that this definition is more appropriate 
for commonsense spatial reasoning since there is “no reason to believe that 
some physical objects occupy closed regions and others open”. Orthogonal to 
the interpretation of the connectedness relation is the distinction of what regions 
are considered. A very common restriction is to use only non-empty regular 
regions. As shown by Asher and Vieu, 1995, models based on Clarke’s con- 
nectedness relation require all regions to be nonempty and regular. However, 
it is possible to specify additional properties of regions such as dimensionality, 
internal connectedness, i.e., whether a region consists of one-piece or of mul- 
tiple pieces, or the existence of holes. The different approaches are compared 
in Cohn and Varzi, 1998 and Cohn and Varzi, 1999. In particular, the RCC-8 
constraint language uses non-empty, regular closed regions which are subsets 
of a regular connected topological space. Regions do not have to be internally 
connected and are allowed to have holes. 

All of these approaches have in common that the relations are axiomatised 
and defined in first-order logic which provides them with formal semantics. 
The formal properties of first-order theories based on a connectedness relation 
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were studied by Grzegorczyk, 1951, Dornheim, 1998, Pratt and Schoop, 1998 
and Schoop, 1999. These are very expressive approaches and lead easily to un- 
decidability of formal reasoning, i.e., logical implication in these theories is not 
decidable in general. When constraining oneself to less expressive languages 
such as constraint calculi, the computational costs are, of course, less. Con- 
straint calculi can be regarded as the special case of first-order sentences where 
only existentially quantified region variables are used. Using the RCC-8 rela- 
tions, we can state constraints suchas, forexample, DC(x1, £2), TPP~1(a2, x3), 
{EC, PO}(a3,21). These are interpreted over the domain of regular closed 
regions of any regular topological space, such as, e.g., the n-dimensional Eu- 
clidean space. Now, given the composition table of RCC-8, we can use the 
algorithms introduced in the previous section in order to decide whether the set 
of constraint is consistent, or whether other constraints are logically implied. 
A prerequisite, however, is that a method for deciding consistency for some 
subset of the relation system formed by the the RCC-8 relations can be found. 
Bennett, 1994 gave an encoding of the RCC-8 relations in propositional modal 
logic and, thus proved that reasoning over the RCC-8 constraint language is 
decidable. In fact, this technique can be used as a decision method for RCC-8 
constraint systems. 

A different approach to defining topological relations was given by Egen- 
hofer, 1991 in the area of spatial information systems. Egenhofer defined binary 
relations according to the 9 different intersections of the interior, exterior, and 
boundary of regions, hence, called 9-intersection. Depending on the regions 
that are used, many different relations can be defined in this way (Egenhofer 
etal., 1994; Egenhofer and Franzosa, 1994). If only two-dimensional, internally 
connected regular regions without holes are considered and only emptiness or 
non-emptiness of the intersection is taken into account, this results in the same 
set of eight basic relations as definable in the above described approaches. In 
contrast to the connection based approaches, this approach is not provided with 
formal semantics which makes it very difficult to study its formal properties. 
For instance, attempts were made to identify sound and complete algorithms 
for reasoning over the eight relations defined by Smith and Park, 1992 and 
Egenhofer and Sharma, 1993, while it was taken for granted that path-consistency 
decides consistency if only basic relations are used. As shown by Grigni et al., 
1995, this is not the case for Egenhofer’s definition of the eight topological 
relations. 


3.2 Orientation 


Orientation is, like topology, very well suited for a qualitative approach. 
In everyday (non-technical) communication, orientation of spatial entities with 
respect to other spatial entities is usually given in terms of a qualitative category 
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Figure 4.4. Orientation relations between points: (a) cone-based (b) projection-based (c) 
double-cross. 


like “to the left of” or “northeast of” rather than using a numerical expression like 
“53 degrees” (which is certainly more common in technical communication like 
in aviation). Unlike the topological approaches we discussed in the previous 
section, orientation of spatial entities is a ternary relationship depending on the 
located object, the reference object, and the frame of reference which can be 
specified either by a third object or by a given direction. In the literature one 
distinguishes between three different kinds of frames of reference, extrinsic 
(external factors impose an orientation on the reference object), by some in- 
herent property of the reference object), deictic (the orientation is imposed by 
the point of view from which the reference object is seen) (Hernandez, 1994, 
p. 45). reference is given, orientation can be expressed in terms of binary 
relationships with respect to the given frame of reference. 

Most approaches to qualitatively dealing with orientation are based on points 
as the basic spatial entities and consider only two-dimensional space. Frank, 
1991 suggested different methods for describing the cardinal direction of a point 
with respect to a reference point in a geographic space, i.e., directions are in 
the form of “north”, “east”, “south”, and “west” depending on the granularity. 
These are, however, just labels which can be equally termed as, for instance, 
“front”, “right”, “back”, and “left” in a local space. Frank distinguishes between 
two different methods for determining the different sectors corresponding to 
the single directions: the cone-based method and the projection-based method 
(see Fig. 4.4). The projection-based approach allows us to represent the nine 
different relations (n, ne, e, se, s, sw, W, nw, eq) in terms of the point algebra 
by specifying a point algebraic relation for each of the two axes separately. 
This provides the projection-based approach (which is also called the cardinal 
algebra; see Ligozat, 1998) with formal semantics which were used by Ligozat, 
1998 to study its computational properties. In particular, Ligozat found that 
reasoning with the cardinal algebra is NP-complete (See below in Sec. 4) and, 
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Figure 4.5. Two different Star calculi, one with 8 lines forming 33 relations and one with 4 
lines forming 17 relations. 


further, identified a maximal tractable subset of the cardinal algebra by using 
the concept of preconvex relations, a method which has already been used for 
Allen’s interval algebra (Ligozat, 1996). 

A generalisation of these calculi was proposed and analysed by Renz and 
Mitra, 2004. Their calculus, the Star calculus (see Fig. 4.5), is based on a 
number of n lines l; with given angles ô; (for arbitrary n) which define 2n sectors 
and 4n + 1 basic relations. The number of lines and the angles of the sectors 
can be adopted to the given application, so the Star calculus can be used for 
representing and reasoning about qualitative directions of arbitrary granularity. 
The Star calculus has some interesting properties. For example, it can be shown 
that when having three or more lines it is possible to emulate a coordinate system 
which is due to having the lines as separate basic relations. This removes 
the distinction made between qualitative and quantitative representation and 
also means that qualitative reasoning methods like path-consistency cannot 
be complete for deciding consistency. Renz and Mitra therefore proposed to 
combine the lines and the sectors and to consider them as new basic relations. In 
both cases the full calculus is NP-hard while reasoning over the basic relations 
is tractable. 

A further point-based approach was developed by Freksa, 1992, the so-called 
double-cross calculus, which defines the direction of a located point to a refer- 
ence point with respect to a perspective point. Within this approach three axes 
are used: one is specified by the perspective point and the reference point, the 
other two axes are orthogonal to the first one and are specified by the reference 
point and the perspective point. These axes define 15 different ternary basic 
relations (see Fig. 4.4c). The computational properties of this calculus have 
been studied by Scivos and Nebel, 2001. It turned out that the consistency 
problem is NP-hard even if only the 15 basic relations are used. 
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Figure 4.6. Orientation relations between extended entities: (a) rectangle algebra (b) direction- 
relation matrix. 


Developing orientation relations between extended spatial entities is much 
more difficult than between points. Extended objects often have their own 
intrinsic directions like a natural front. Also, the direction between extended 
objects with complex shapes such that, for instance, their convex hulls inter- 
sect is not at all clear. Even for simpler objects there is often no agreement 
about which natural language expression describes their orientational relation- 
ship best. Therefore, most approaches use approximations to the spatial re- 
gions or use only spatial regions of a particular kind. One approach which has 
been chosen by many researchers results in restricting all regions to be rectan- 
gles whose sides are parallel to the axes determined by the frame of reference 
(Guesgen, 1989; Papadias and Theodoridis, 1997; Balbiani et al., 1998). In this 
approach all regions can be represented by their projections to the defining axes 
which corresponds to having Allen’s interval algebra (Allen, 1983) for each axis 
separately, i.e., every relation is a pair of two interval relations (see Fig. 4.6a). 
For two-dimensional space this results in 13 x 13 different basic relations (also 
called the rectangle algebra; see Balbiani et al., 1998) whose formal semantics 
are provided by the interval algebra. 

Balbiani etal., 1998 and Balbiani et al., 1999a studied the formal properties of 
the rectangle algebra (and also of the block algebra which is the n-dimensional 
extension of the interval algebra; see Balbiani et al., 1999b). NP-completeness 
of the algebra carries over from the interval algebra. Balbiani et al., 1998 
and Balbiani et al., 1999a identified a tractable subset of the rectangle algebra 
following a line of reasoning which has been introduced by Ligozat, 1996, 
namely, by considering convex and preconvex relations. Unlike for the interval 
algebra, the set of preconvex relations is not closed under the fundamental 
operations. Thus, Balbiani et al. extended the concept of preconvexity by 
distinguishing between weakly and strongly preconvex relations, and show 
that the set of strongly preconvex relations is a tractable subset of the rectangle 
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algebra for which path-consistency is sufficient for deciding consistency. In 
fact the rectangle algebra represents more than just orientation between two 
rectangles but also their topological relations. Hence, the rectangle algebra can 
be regarded as an approach to combining topology and orientation. However, 
because of the large number of relations of the rectangle algebra (a total number 
of 2169 relations) reasoning even over a tractable subset can be very inefficient. 

An interesting but less expressive approach to representing orientational 
relationships between extended spatial entities was introduced by Goyal 
Egenhofer, 2001. Their calculus consists of a 3 x 3 direction-relation ma- 
trix which represents the 9 sectors formed by the minimal bounding axes of 
an extended spatial entity (see Fig. 4.6b). For each sector it is possible to 
specify whether the located object is contained in the sector or not, or (non- 
qualitatively) to which degree the located object is contained in the sector. 

Skiadopoulos and Koubarakis, 2005 developed reasoning algorithms for this 
calculus and analysed its computational properties, but their algorithms are not 
based on constraint based methods like path-consistency. 


3.3 Distance 


Together with topology and orientation, distance is one of the most important 
aspects of space. Unlike the other two, distance is a scalar entity. Dealing with 
distance information is an important cognitive ability in our everyday life. In 
order to grab something, for instance, we must be good in judging distances. 
When communicating about distances, we usually use qualitative categories 
like “A is close to B” or qualitative distance comparatives like “A is closer to 
B than to C”, but also numerical values like “A is about one meter away from 
B”. As indicated by the above examples, one can distinguish between absolute 
distance relations (the distance between two spatial entities) and relative dis- 
tance relations (the distance between two spatial entities as compared to the 
distance to a third entity). While absolute distance can be represented either 
qualitatively or quantitatively, relative distance is purely qualitative. When 
representing absolute distance in a qualitative way, this also depends on the 
scale of space which is used. Montello, 1993 suggests four different kinds of 
scales of space: figural space, vista space, environmental space, and geographic 
space. 

Most approaches to qualitative distance consider points as the basic entities. 
Absolute distance relations are obtained, e.g., by dividing the real line into sev- 
eral sectors such as “very close”, “close”, “commensurate”, “far”, and “very 
far” depending on the chosen level of granularity (Hernandez et al., 1995). 
Relative distance can be obtained by comparing the distance to a given refer- 
ence distance which results in ternary relations such as “closer than”, “equidis- 
tant”, or “farther than”. Reasoning about qualitative distances leads to several 
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Figure 4.7. Different approaches to representing positional information: (a) absolute distances 
combined with cone-based orientation (Clementini et al., 1997) (b) relative distances combined 
with projection-based orientation (Isli and Moratz, 1999). 


difficulties. For instance, given a sequence of collinear points p1,..., Dp such 
that p; is close to p;41 for every i, for which n is pn far from pı? Moreover, 
combining distance relations does not only depend on the distances itself but 
also on the position of the corresponding points. For instance if point B is 
far from A and C is far from B, then C can be very far from A if A, B, and 
C are aligned and if B is between A and C; or C can be close to A if the 
angle between AB and BC is small. Therefore, it seems advisable to study 
distance in combination with orientation. This combination is called positional 
information. 

One approach for developing a position calculus is by Clementini et al., 1997, 
who combine acone-based orientation approach with absolute distance relations 
(see Fig. 4.7a). Clementini et al. present different procedures for computing the 
composition of two positional relations (A, B) and (B, C). They consider three 
special cases where BC is the same, opposite, or orthogonal direction to AB. 
Another approach is by Isli and Moratz, 1999, who propose several position 
calculi on various levels of granularity by combining relative distance relations 
with different approaches to orientation such as the projection-based approach 
(see Fig. 4.7b) or the double-cross calculus. The computational properties of 
these approaches have not been studied yet. 


4. Computational complexity 


Since we are interested in automated reasoning with the spatial calculi de- 
scribed above, it is a good idea to get an understanding of how computationally 
demanding reasoning in these calculiis. Here computational complexity theory 
is the right theoretical tool. 

In the field of computational complexity (Papadimitriou, 1994), computa- 
tional problems are classified according to their need for resources for solving 
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them, usually the running time and the memory consumption. This allows to 
compare the complexity of different problems and to design algorithms for a 
whole class of problems. For classifying computational problems, they are usu- 
ally expressed as decision problems, i.e., problems that require a simple yes/no 
answer. Such problems can be equivalently viewed as formal languages over 
some alphabet X£, which is formed by all yes-instances. 

Most problems can be easily translated into an equivalent decision problem. 

Assume for example the problem of finding a satisfying truth assignment for 
a propositional formula. The corresponding decision problem is the problem 
SAT: given a set of variables V anda propositional formula ¢ over V in CNF, is 
there a satisfying truth assignment for ¢? The complexity of a decision problem 
is usually measured according to the worst-case running time or memory con- 
sumption of the best possible algorithm. If we now can prove lower bounds on 
the runtime for the decision problem then these lower bounds apply obviously 
to the original problem as well. 

Running time as well as memory consumption of an algorithm depends on the 
size n of its input, i.e., on the size of the problem instance, and can be expressed 
as a function f(n). For classifying algorithms according to their running time, 
the asymptotical behaviour is more important than f itself. This is specified in 
terms of the O-notation which gives an upper bound on the running time within 
a constant factor (Cormen et al., 1990). An algorithm with a running time of 
O(n?) or faster is usually considered to be efficient. In areas like database 
systems where instances have a very large size, a running time of O(n?) is too 
slow. In these areas efficient algorithms should have a linear running time. 


4.1 Tractability and NP-completeness 


There is a large number of different complexity classes that are used to cat- 
egorise decision problems (Johnson, 1990). Particularly important is the class 
of decision problems that can be solved in polynomial time using a determin- 
istic algorithm. This complexity class is called P and it is considered to be the 
class of efficiently solvable problems. Problems in P are also called tractable 
problems, problems outside P are called intractable problems. 

Interestingly, there exists a large class of problems for which nobody has 
found polynomial-time algorithms yet, but it appears equally hard to prove that 
no such algorithms exist. In order to capture these problems, one extends the 
notion of algorithm. The class of problems solvable in polynomial time using 
a non-deterministic algorithm is called NP, which is equivalent to specifying 
that a given solution of an NP problem can be verified in polynomial time 
using a deterministic algorithm. In the sequel, an algorithm will always be a 
deterministic algorithm, unless otherwise stated. It is clear that P is a subset 
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of NP, but it is not known whether P is a proper subset of NP or whether P is 


equal to NP, which is called the P Ž NP problem. 

An important method of comparing problems is specifying a reduction from 
one problem to another. Given two problems A, B C &*, problem A can be 
reduced to problem B by giving a constructive transformation f: X* — X“ 
such that f(x) € B if and only if x € A. If f can be computed in polynomial 
time, the reduction is a polynomial (time) reduction. If A is polynomially 
reducible to B (written as A <, B), then any polynomial time algorithm for 
solving B can be used to solve A. Thus, for showing that a particular decision 
problem A is in P, it is sufficient to find another problem B € P such that 
A <p B. 

A decision problem A is said to be NP-hard if any other problem in NP can 
be polynomially reduced to A. An NP-hard problem which is itself contained 
in NP is called NP-complete. NP-complete problems are the most difficult 
problems in NP. In fact, most of the problems for which nobody has found 
an efficient algorithms yet but which are resistant against proving them to be 
intractable fall into this class. In order to prove a decision problem A to be NP- 
hard, it is sufficient to find another NP-hard problem that can be polynomially 
reduced to A. The first problem that was identified to be NP-complete is the 
SAT problem (Cook, 1971). In this work we use the following NP-complete 
propositional decision problems (Garey and Johnson, 1979): 


Given: A set of variables V and a propositional formula @ over V in CNF such 
that each clause of @ has exactly three literals. 


Questions: 


1 Is there a satisfying truth assignment for 6? (8SAT) 


2 Is there a satisfying truth assignment for ¢ such that each clause has 
at least one true literal and at least one false literal? 


(NOT-ALL-EQUAL-3SAT) 


3 Is there a satisfying truth assignment for ġ such that each clause has 
exactly one true literal? (ONE-IN-THREE-3SAT) 


Some variants of the propositional satisfiability problem are solvable in poly- 
nomial time. This includes the 2SAT problem, the propositional satisfiability 
problem of Krom formulae, and the HORNSAT problem, the propositional 
satisfiability problem of Horn formulae, which is of particular importance in 
this work. It is generally believed that P 4 NP, and, hence, that NP-complete 
problems are intractable. This is also the assumption of this work. So far, 
any algorithm for an NP-complete problem has at least super-polynomial 
running time. 
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Figure 4.8. Typical phase-transition behaviour of randomly generated instances. 


4.2 Phase Transitions 


Having proved a problem to be NP-complete is not the end of the computa- 
tional analysis of a problem, but rather its beginning. NP-completeness is just 
a worst-case measure of a problem. It means that for any algorithm there exist 
instances which cannot be solved in polynomial time. It is possible that only 
one in a million instances is very hard and that the other instances can be solved 
easily. 

There are several ways to deal with NP-complete problems. One way is to 
develop efficient approximation algorithms which are correct but not complete 
for deciding either solubility or insolubility. Another way is to use complete 
algorithms which require exponential time in the worst-case and to develop 
heuristics which solve many instances efficiently. In all cases the effectiveness 
of new algorithms and heuristics should be verified using a large number of 
instances. Since it is usually not easy to obtain a large number of real-world in- 
stances, many researchers generate instances randomly with respect to different 
control parameters. 

Cheeseman et al., 1991 found that randomly generated instances of the NP- 
complete problems they studied had a very special behaviour: when ordering 
these instances according to a particular problem-dependent parameter, there 
are three different regions with respect to the solubility of the instances that 
occur when changing the parameter. In one region instances are soluble with 
a very high probability, in one region instances are insoluble with a very high 
probability, and in between these two regions there is a very small region where 
the probability of solubility of these instances changes abruptly from very high 
to very low (see Fig. 4.8(a)). Cheeseman et al., 1991 called this region the 
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phase-transition. In this region a small change of the local parameter leads to 
a large change in the solubility of the instances. 

Cheeseman etal., 1991 further found that almost all hard instances are located 
in the phase-transition region. In general, instances in the phase transition 
appear to be harder than instances in the other two regions (see Fig. 4.8(b)). 
This is because instances in soluble region are under-constrained and for this 
reason any search method finds a solution very fast without much backtracking. 
Similarly, instances in the insoluble region are over-constrained and for this 
reason search methods fail quite early when searching through the space of 
possible solutions. In some studies, however, it turned out that some under- 
constrained instances are particularly hard (Gent and Walsh, 1996). 

The behaviour of randomly generated instances of NP-complete problems 
described by Cheeseman et al. was found by many researchers for many NP- 
complete problems, although satisfiability problems were the most studied 
problems. A typical parameter for satisfiability problems that causes a phase 
transition is the ratio of clauses-to-variables. An interesting selection of papers 
on the topic can be found in Hogg et al., 1996. 


4.3 How to prove NP-hardness and NP membership for 
spatial CSPs 


In order to prove NP-hardness of a decision problem, in our case the con- 
sistency problem of a set of spatial constraints CSPSAT, it is sufficient to find 
another NP-hard problem that can be polynomially reduced to the problem at 
hand. Usually this has to be done in a different way for each new problem again 
and again and it is in many cases a difficult task to find a new transformation 
and to prove that it is a one to one transformation. The difficulty of this problem 
can be estimated when considering that new NP-hardness proofs often deserve 
a publication. 

When looking at spatial CSPs over different sets of relations it is striking that 
they all have the same structure with different relations. One might expect that 
the same reduction with different parameters can be used for different sets of 
relations and that a general transformation scheme can be used. In this section 
we present a scheme which we developed for proving NP-hardness of different 
subsets of RCC-8 and which seems to be general in the way that the parame- 
ters of the scheme can be found by exhaustive search over possible relations, 
no matter what the relations are. So the transformations could essentially be 
identified automatically for any system of relations. 

Our scheme uses a transformation from a propositional satisfiability problem 
to CSPSAT(S) where S is a subset of a system of relations 2° by constructing a 
set of spatial constraints © for every instance Z of the propositional satisfiability 
problem, such that O is consistent if and only if Z is a positive instance. The 
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propositional satisfiability problems we use are 3SAT, the problem of deciding 
whether there is a truth assignment for a set of clauses where each clause has 
exactly three literals, as well as two variants of 3SAT where truth assignments 
of particular types are required. These variants are NOT-ALL-EQUAL-3SAT, 
the problem of deciding whether there is a truth assignment such that for every 
clause at least one literal is assigned true and one literal is assigned false, 
and ONE-IN-THREE-3SAT, the problem of deciding whether there is a truth 
assignment such that for every clause exactly one literal in every clause is 
assigned true. All three decision problems are NP-hard (Schaefer, 1978). 

The different transformations have in common that every variable v of the 
propositional satisfiability problem is transformed to two constraints £, { Ry, Rf} 
Yy and x,{ Ri, Rf}Y— corresponding to the positive and the negative literal 
of v, where R; and Ry are relations of S with RQ Rẹ = Ø. vis assigned true if 
and only if x,,{ R;}y,, holds and assigned false if and only if x, {Ry }y, holds. 
Since the two literals corresponding to a variable need to have opposite assign- 
ments, we have to make sure that x,{R,}y, holds if and only if r,{ Rs} y_, 
holds, and vice versa, for which additional polarity constraints are required. In 
addition, every literal occurrence l of the propositional satisfiability problem is 
transformed to the constraint x){R:, R¢}yy, where xı{Ri}y; holds if and only 
if l is assigned true. In order to assure the correct assignment of positive and 
negative literal occurrences with respect to the corresponding variable, polar- 
ity constraints are required again. For instance, if the variable v is assigned 
true, i.e., ty {Ri}y, holds, then xp{Rt}yp must hold for every positive literal 
occurrence p of v, and x, {Ry }y,, must hold for every negative literal occur- 
rence n of v. Further, clause constraints have to be added to ensure that the 
clause requirements of the specific propositional satisfiability problem are sat- 
isfied. For example, if {i, j, k} is a clause of an instance of ONE-IN-THREE- 
3SAT, then exactly one of the constraints x;{ Ri }y;, 7j{ Re}y;, and x, { Ri} y;, 
must hold. 

According to this scheme, all we have to do in order to find a transforma- 
tion is to identify relations Ry, Ry € S, the polarity constraints which enable 
to propagate the assignment of literal occurrences to other literal occurrences, 
and the clause constraints which ensure that properties of clauses also hold for 
their transformation. These constraints can be found by exhaustively assigning 
and testing the polarity CSP of figure 4.9(a) and based on this, the clause CSP 
of figure 4.9(b). If it is possible to identify the polarity and the clause con- 
straints, then we have found a polynomial transformation from a propositional 
satisfiability problem to the consistency problem of S. 

The next step is to show that this transformation is a many-to-one trans- 
formation such that whenever we have a positive instance of the propositional 
satisfiability problem we get a positive instance of the consistency problem. Un- 
like finding polarity and clause constraints, this part of the NP-hardness proof 
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Figure 4.9. The polarity constraints (a) ensure that positive and negative literals of the same 
variable have opposite assignments. The clause constraints (b) ensure that the clause require- 
ments of the particular 3SAT problem are satisfied. 


cannot be automated as it depends on the domains we are using. However, 
since the CSP we get is very structured with only the polarity constraints and 
the clause constraints, this can be easy to show in many cases. It is actually 
an advantage that the domain we are using is infinite as it allows to treat the 
different polarity and clause constraints almost independently. Examples for 
transforming propositional satisfiability problems to CSPSAT(S) for different 
subsets S of RCC-8 can be found in Renz and Nebel, 1999. 

The next step in the complexity analysis of a given spatial calculus is to 
prove NP membership of CSPSAT. Recall that in order to show that a decision 
problem is a member of NP, we have to show that a possible solution can be 
checked in polynomial time. So for showing that CSPSATis in NP for a system 
of relations 28 over a domain D it is sufficient to show that CSPSAT(B) over D 
is amember of NP. While for many NP-complete problems the NP-membership 
proof is easier than showing NP-hardness, it is the other way around for spatial 
CSPs. This is due to the fact that we might have to check arbitrary spatial enti- 
ties which might not even be representable in a computational framework (see 
e.g., Renz, 1998) and due to the infinity of the domain D. Proving NP member- 
ship can be very difficult and has to be done for each system of relations and for 
each domain separately. Consider for example the RCC-8 relations and Egen- 
hofer’s relations. The composition table of the relations are the same, but the 
domains are different. While the RCC-8 domain consists of regular subsets of a 
topological space, Egenhofer’s domains consist of connected two-dimensional 
regions without holes which is much more restricted. The consequence of this 
is that while RCC-8 is in NP (Renz, 1998), NP membership of Egenhofer’s 
calculus is still an open problem (Grigni et al., 1995). Instead of proving NP 
membership by showing that a given solution can be verified in polynomial 
time, we can also give a polynomial time decision procedure for CSPSAT(L) 
over D and show that whenever the decision procedure recognises an instance 
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© as consistent, there is an instantiation of all variables in © with values of 
the domain D such that all constraints of © are satisfied. Having a polynomial 
decision procedure is a stronger result and implies NP membership. In the next 
session we will look at how such decision procedures can be identified. 


5. Identifying tractable subsets of spatial CSPs 


Reasoning about most interesting spatial calculi is NP-hard. This, however, 
is often true only for the full calculus, i.e., if all relations 28 can be used. If 
we restrict ourselves to subsets S C 28 of the full set of relations, it might be 
possible that reasoning over this subset is tractable. Ideally we are interested 
in finding maximal tractable subsets of 2° which are those subsets which are 
tractable and which become NP-hard if any other relation is added. This rep- 
resents the boundary between tractability and NP-hardness. Some subsets are 
obviously tractable such as the set of relations that contain the identity relations 
as a disjunct. The subsets that are most interesting are those that contain all the 
basic relations B. So as a minimal requirement and as a first step we have to 
show that the set of basic relations is tractable. 

For RCC-8, Renz and Nebel, 1999 showed tractability of the basic relations 
by developing a polynomial transformation of RCC-8 constraints into SAT 
formulae. Those RCC-8 relations that transform into a Horn formula together 
form a tractable subset. Altogether 64 relations were identified in this way, 
among them were all the basic relations. While we could try to develop a new 
algorithm or a new transformation for every spatial calculus and for different 
subsets of them, it is highly desirable that the path-consistency algorithm (see 
Sec. 2.1) can be used for deciding consistency of tractable subsets. If this is 
the case then consistency can be decided purely by algebraic operations on 
the relations without having to fall back to the infinite domains. And we have 
to deal with the domains only once for proving that path-consistency decides 
consistency. Obviously, this again depends strongly on the domains and the 
relations that are used and cannot be generalised. Therefore we have to find a 
new tractability proof for every set of basic relations over every domain. This 
can be very complicated as we have to deal with infinite domains. 

For RCC-8, for example, the proof that the path-consistency algorithm de- 
cides consistency for the basic relations (actually for a larger set of relations) 
was done as follows (Renz and Nebel, 1999). First, it was analysed how ap- 
plying path-consistency can lead to an inconsistency. Then it was shown that 
whenever the path-consistency algorithm detects an inconsistency, positive unit 
resolution applied to the SAT encoding of RCC-8 produces the empty clause. 
In Renz, 1998 an algorithm was presented which computes an instantiation for 
all variables of any consistent set of constraints over the RCC-8 basic relations. 
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This algorithm works for Euclidean spaces in all dimensions d. For d > 3 it 
also works for connected regions without holes. 

Once it has been shown that path-consistency decides consistency for the 
basic relations, it is possible to try to extend the set of relations and to identify 
larger tractable subsets. There are basically two general methods which can 
be used for extending tractability of subsets of relations to larger subsets: the 
closure method (Renz and Nebel, 1999) and the refinement method (Renz, 
1999). We will describe these methods in the following section. In particular 
the refinement method seems very powerful and will be presented in more detail. 


5.1 Closure of sets of relations 


Given a system of relations 25, the number of subsets S C 2% that we 
might have to analyse for a computational analysis is huge, namely, 22181), 
This number can be slightly reduced if only those subsets are considered that 
contain all the basic relations and possibly the universal relation. Fortunately, 
we can reduce the number of subsets further by noting that the computational 
complexity associated with an arbitrary subset S is identical to the complexity 
associated with the closure of this subset under composition, intersection, and 
converse, denoted by S—an observation that was first used in determining 
a maximal tractable subset of Allen’s interval calculus (Nebel and Biirckert, 
1995, Theorem 14). Renz and Nebel, 1999 proved this for arbitrary systems of 
relations and came up with the following theorem. 


THEOREM 4.1 Let C be a set of binary relations that is closed under compo- 
sition, intersection, and converse. Then for any subset S C C that contains the 


universal relation, the problem CSPSAT(S) can be polynomially reduced to 
CSPSAT(S). 


Note that Theorem 4.1 holds only if there exists an infinite supply of fresh 
variables; this is not always the case (e.g., bounded variable problems which 
are studied in logic and model theory). Another requirement of Theorem 4.1 
is the possibility to specify more than one constraint for each pair of variables. 
Otherwise the identity relation must be contained in S. The following corollary 
specifies how Theorem 4.1 will be used. 


COROLLARY 4.2 Let S be a subset of 2°. 


~ 


1 CSPSAT(S) € P if and only if CSPSAT(S) € P. 


n 


2 CSPSAT(S) is NP-hard if and only if CSPSAT(S) is NP-hard. 


The first statement of Corollary 4.2 can be used to increase the number 
of elements of tractable subsets of CSPSAT considerably. With the second 
statement of Corollary 4.2 NP-hardness proofs of CSPSAT can be used to 
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exclude certain relations from being in any tractable subset of CSPSAT. In 
any case, we will have to analyse only those subsets that are closed under 
composition, converse and intersection. 

The computational analysis of RCC-8 shows how powerful this method is. 
The closure of the set of 64 relations that transform to Horn formulae consists 
of 148 relations (called Hg) and turns out to be a maximal tractable subset of 
RCC-8. Furthermore, it has been shown (Renz and Nebel, 1999) that path- 
consistency decides CSPSAT(Hs). 


5.2 The refinement method 


In this subsection we present a general method for proving tractability of 
reasoning over disjunctions of a JEPD set 6 of binary relations over a domain 
D which are atoms of a relation algebra, i.e., a method for proving tractability 
of CSPSAT(S) for sets S C 2% (see Sec. 2.2). In order to do so, this method 
requires a subset 7 of 28 for which path-consistency is already known to decide 
CSPSAT(Z). Then the method checks whether it is possible to refine every 
constraint involving a relation in S according to a particular refinement scheme 
to a constraint involving a relation in 7 without changing consistency. The 
following definition will be central for this method. 


DEFINITION 4.3 (REDUCTION BY REFINEMENT) 
Let ST C 2%. S can be reduced by refinement to T, if the following two 
conditions are satisfied: 


1 for every relation S € S there is a relation Ts € T with Ts C S, 


2 every path-consistent set © of constraints over S can be refined to a set 
O' of constraints over T by replacing «;Sx; € © with x;Tgx; € O' for 
i < j, such that enforcing path-consistency to ©' does not result in an 
inconsistency. 


Note that in the above definition constraints x;S; are refined only for i < j. 
This is no restriction, as by enforcing path-consistency the converse constraint 
x;S~ x; will also be refined. Rather it offers the possibility of refining, e.g., 
converse relations to other than converse sub-relations, 1.e., if, for instance, R 
is refined to r, R~ can be refined to a relation other than r~. This property of 
a set of relations can be used to derive its tractability. 


LEMMA 4.4 Ifpath-consistency decides CSPSAT(T) forasetT C 2, and S 
can be reduced by refinement to T, then path-consistency decides CSPSAT(S). 


Proof. Let © be a path-consistent set of constraints over S. Since S can be 
reduced by refinement to 7, there is by definition a set ©’ of constraints over 
T which is a refinement of © such that enforcing path-consistency to ©’ does 
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not result in an inconsistency. Path-consistency decides CSPSAT (T), so O’ is 
consistent, and, hence, O is also consistent. a 


Since path-consistency can be enforced in cubic time, it is sufficient for proving 
tractability of CSPSAT(S) to show that S can be reduced by refinement to a 
set 7 for which path-consistency decides CSPSAT(T ). Note that for refining 
a constraint «Sy (S € S) to a constraint rTsy (Ts € T), it is not required 
that Ts is also contained in S. Thus, with respect to common relations the two 
sets S and 7 are independent of each other. This is in contrast to Theorem 4.1 
which states that the tractability of a set of relations implies the tractability of 
its closure. 

We will now present a method for showing that a set of relations S C 28 
can be reduced by refinement to another set 7 C 2%. In order to manage the 
different refinements, a refinement matrix is introduced that contains for every 
relation S € S all specified refinements. 


DEFINITION 4.5 (REFINEMENT MATRIX) 
A refinement matrix M of S has |S| x 218l Boolean entries such that for S € S, 
R € 28, M[S\[R] = true only if RC S. 


For example, if we want to build a refinement matrix which states that that 
the relation {DC,EC,PO, TPP} can be refined to the relations {DC, TPP} 
and {DC}, then we set M[{DC,EC,PO, TPP}][R] is true only for R = 
{DC, TPP} and for R = {DC} and false for all other relations R € 2°. 
M is called the basic refinement matrix if M|S]|R] = true if and only if 
= Es 

Renz, 1999 proposes the algorithm CHECK-REFINEMENTS (see Fig. 4.10) 
which takes as input a set of relations S and a refinement matrix M of S. The 
algorithm uses triples of relations T = (R12, R23, R13) which represent sets 
of constraints {x Ry2y, yR232z, x Rı3z2} for some variables x, y, z. It computes 
all possible path-consistent triples of relations R12, R23, R13 of S (step 4), and 
enforces path-consistency (using a standard procedure PATH- CONSISTENCY) 
to every refinement Rj», Ro3, Rig for which M[Ri;][R;;] = true for all i, j € 
{1,2,3},i < j (steps 5,6). If one of these refinements results in the empty 
relation, the algorithm returns fail (step 7). Otherwise, the resulting relations 

12, R33, Rig are added to M by setting M[R;;][R}] = true for all i, j € 
{1,2,3},i < j (step 8). This is repeated until M has reached a fixed point 
(step 9), i.e., enforcing path-consistency on any possible refinement does not 
result in new relations anymore. If no inconsistency is detected in this process, 
the algorithm returns succeed. 

A similar algorithm, GET-REFINEMENTS, returns the revised refinement 
matrix if CHECK-REFINEMENTS returns succeed and the basic refinement 
matrix if CHECK- REFINEMENTS returns fail. Since B is a finite set of re- 
lations, M can be changed only a finite number of times, so both algorithms 
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Algorithm: CHECK- REFINEMENTS 

Input: A set S and a refinement matrix M of S. 

Output: fail if the refinements specified in M can make a path-consistent 
triple of constraints over S inconsistent; succeed otherwise. 


1. changes — true 

2. while changes do 

3.  oldM M 

4. for every path-consistent triple 

T = (Riz, R23, R13) of relations over S do 

5. for every refinement T’ = ( Ria, R43, Rig) of T 
with oldM [R12] [Rial = oldM | R93] [Rog] = 
old M[R1s][R}3] = true do 


6. T” — PATH-CONSISTENCY(T”) 

ds if T" = (R{,, R33, R{3) contains the empty 
relation then return fail 

8. else do M[Ri9|[R{2| — true, 


M[Ro3]|[R53] — true, 

M[Ri3][Ri3] — true 
9. if M = old M then changes — false 
10. return succeed 


Figure 4.10. Algorithm CHECK-REFINEMENTS. 


always terminate. If n = |24] is the total number of relations, then there are at 
most n° possible triples of relations in step 4, at most n? possible refinements of 
each triple in step 5, and at most n? iterations of the while loop. Thus, a rough 
estimation of the worst-case running time of both algorithms leads to O(n°). 


LEMMA 4.6 Let © be a path-consistent set of constraints over S and M a 
refinement matrix of S. For every refinement ©! of © with «Rx; € ©' 
only if x;Rz; € O, i < j, and M[R][R'] = true, the following holds: if 
CHECK-REFINEMENTS(S, M) returns succeed, enforcing path-consistency 
to O' does not result in an inconsistency. 


If CHECK- REFINEMENTS returns succeed and GET- REFINEMENTS returns 
M’, we have pre-computed all possible refinements of every path-consistent 
triple of variables as given in the refinement matrix M’. Thus, applying these 
refinements to a path-consistent set of constraints can never result in an incon- 
sistency when enforcing path-consistency. 
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THEOREM 4.7 Let S,T C 2%, and let M be a refinement matrix of S. Let 
M” be the refinement matrix returned by GET-REFINEMENTS(S, M). If for 
every S € S there isa Ts € T with M'[S]|[Ts] = true, then S can be reduced 
by refinement to T. 


By Lemma 4.4 and Theorem 4.7 we have that the procedures CHECK- REFINE- 
MENTS and GET-REFINEMENTS can be used to prove tractability for sets of 
relations. 


COROLLARY 4.8 Let S,T C 2° be two sets such that path-consistency de- 
cides CSPSAT(T ), and let M be a refinement matrix of S. GET- REFINEMENTS 
(S, M) returns M'. If for every S € S there is a Tg € T with M'|S][Ts] = 
true, then path-consistency decides CSPSAT(S). 


If a suitable refinement matrix can be found, CHECK-REFINEMENTS can 
be used to immediately verify that reasoning over the given set of relations 
is tractable. One problem with this method is that the algorithms, though 
polynomial, are not very efficient. Especially for large sets of relations the 
algorithms are very slow. Fortunately, the algorithms are used for determining 
tractability of reasoning over sets of relations and not for the reasoning process 
itself. Renz, 2002 proposed a faster version of the algorithm which uses a 
refinement array instead of a refinement matrix which reduces the runtime of 
the algorithm to O(n* logn). 

In the following subsection we show how the refinement method can be 
applied to RCC-8 for proving certain subsets to be tractable. For RCC-8 it 
will lead to a complete analysis of tractability by identifying all three maximal 
tractable subsets. 


5.3 Applying the refinement method 


The refinement method requires for any input set of relations S C 2° a 
subset 7 C 28 for which path-consistency is known to decide consistency and 
a refinement strategy S > T. Assuming that a set 7 is known, the main tasks 
are to find a candidate set S and a refinement strategy, i.e., we have to find for 
every relation of S a relation of 7 and apply the refinement algorithm using the 
different refinement strategies. 

Candidate sets S; can be found by using the closure method and the known 
NP-hard relations. Renz, 1999 identified candidate sets for RCC-8 by comput- 
ing the largest subsets of RCC-8 that contain the basic relations, the universal 
relation, are closed under the operators and do not contain any of the known NP- 
hard relations, i.e., the relations that can be used for the NP-hardness proofs. 
This resulted in only three candidate sets (which are called Cg, Qs and the 
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already known maximal tractable subset Hs) which can be tested using the 
refinement method, provided that a refinement strategy can be found. 

One way of finding a refinement strategy is to use a greedy method of extend- 
ing partial refinement strategies by first refining only one or a few relations, fill 
the refinement matrix/array using the refinement algorithm and if no inconsis- 
tency occurs add some more refinements. This can be repeated until a working 
refinement strategy can be found or until it is shown that no refinement strat- 
egy exists. It might also be possible that a particular refinement strategy, the 
identity refinement strategy, is applicable. The identity refinement strategy re- 
fines each relation R € S to the relation R' = R \ ID, where ID € B is the 
identity relation. Renz, 1999 observed that all relations that are contained in 
the two candidate sets for RCC-8 which are not contained in fg can be refined 
to relations of Hg by removing the identity relation. It turned out that applying 
the refinement algorithm to the candidate sets of RCC-8 leads to refinement 
matrices that contain a basic relation for each relation of the candidate sets. 
This shows that each of these candidate sets can be refined to the set of basic 
relations and, therefore, that the candidate sets are tractable and can be decided 
by the path-consistency algorithm. Renz, 1999 also applied the identity refine- 
ment matrix to the known maximal tractable subset ORD-Horn of the interval 
algebra (Nebel and Bürckert, 1995) and it turns out that the refinement method 
also works for the interval algebra. The refinement method has recently been 
extended to identify tractable subsets automatically given only the composition 
table of the relations (Renz, 2007). 

Now we have all the tools for identifying (maximal) tractable subsets of a 
system of spatial relations. In the next subsection we show how these sets can 
be used for finding fast solutions to intractable CSPSAT instances. 


6. Practical efficiency of reasoning methods 


In the previous section we described how to find tractable subsets of the usu- 
ally NP-hard spatial calculi. For most of the tractable subsets path-consistency 
or even simpler methods are sufficient for deciding consistency, so except for 
very large instances or for calculi over a large set of relations, there are usually 
no efficiency problems when considering instances that contain only relations 
of a tractable subset. Efficiency problems occur, however, if we go outside the 
tractable subsets and enter the NP-hard territory. As we will see later, instances 
of an NP-hard problem can often be solved very fast in practice. There are 
basically four reasons for this. The first one is that the interleaved applications 
of path-consistency during the backtracking search is often very powerful and 
already eliminates many labels that cannot lead to a solution. The second rea- 
son is that large tractable subsets reduce the size of the backtracking search tree 
by several orders of magnitude. This results from the possibility of splitting 
relations into tractable sub-relations instead of splitting them into all contained 
basic relations. The third reason is that different heuristics and strategies can be 


Qualitative Spatial Reasoning Using Constraint Calculi 191 


applied for solving hard instances. Often it is the case that there is a heuristic 
for which a hard instances turns out to be easy. So the more heuristics and 
strategies are available the higher is the likelihood that one of them can solve 
an instance fast. The last reason is the observation that most instances outside 
the phase-transition region are in almost all cases very easy to solve. In the 
following we will discuss these points in more detail and show results from an 
empirical investigation of the practical efficiency of RCC-8. 


6.1 Generating Test Instances 


In order to test the practical efficiency of reasoning algorithms, it is neces- 
sary to generate a large number of test instances. Ideally these should be real 
instances of existing applications. If such an application is not available, in- 
stances have to be generated systematically or randomly. Since many instances 
are easy to solve, it is important to try to generate instances that are as hard 
as possible. When randomly generating instances, there is usually a parameter 
that produces a phase-transition of the probability of satisfiability of the gener- 
ated instances, i.e., when increasing the value of the parameter, the probability 
changes from almost 1 to almost 0 (or vice versa) within a very small range of 
the parameter (see Fig. 4.8). Almost all instances outside the phase-transition 
region are very easy to solve while the phase-transition region contains most 
hard instances (Cheeseman et al., 1991). The most useful instances for em- 
pirical study of reasoning algorithms can therefore be found in and around the 
phase-transition region, which has to be empirically determined. 

For randomly generated RCC-8 instances it turned out that one phase- 
transition is induced by the degree d of nodes, i.e., how many edges for each 
node of the constraint graph are randomly instantiated on average (Renz and 
Nebel, 2001). The phase-transition turns out to be around d = 10. Another 
way of generating hard instances is to randomly generate instances that contain 
only relations that are outside the tractable subsets. This, however, is mainly 
for testing the behaviour of the algorithms in extreme cases and is not very 
representative for practical purposes for which it might better to analyse a uni- 
form distribution of the relations. Another important factor when generating 
random instances is to make sure that the instances are not trivially flawed 
(Achlioptas et al., 1997), i.e., the probability that small inconsistent sub-CSPs, 
such as inconsistent triples, are contained in the instances should not be high 
and should not determine the phase-transition. 

The following empirical results are taken from Renz and Nebel, 2001 and 
show how randomly generated RCC-8 instances can be solved very efficiently. 
The random RCC-8 instances were generated according to the model A(n, d, l), 
where n is the number of variables, d the average degree and l the average 
number of base relations per relation. The relations were selected among all 
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RCC-8 relations, for RCC-8 l = 4.0 means that all relations are selected with 
equal probability. 


6.2 Testing Algorithms 


When testing algorithms on the generated instances, several properties are 
interesting and should be observed. One is of course the time it takes to solve the 
instances, but it is also important to compare the number of nodes of the back- 
tracking search space that were visited while solving instances. This value is 
important for comparing algorithms on different machines as the run-time dif- 
fers from machine to machine and also depends on other factors such as the load 
and the available memory of the machine used for the test. Instead of using 
only the average values (runtime, visited nodes, etc.) we also look at different 
percentiles, i.e., we order the values and look at the values of the elements at 
position 50%, 70%, 90%, or 99%. Since we are dealing with an NP-complete 
problem for which some instances take a very long time to solve, taking the 
average only would be too erratic. In the following we mainly look at 99% 
percentile instances as these give a good indication of the performance for the 
hardest among the instances. 


6.3 Effect of using large tractable subsets 


A very important factor in obtaining more efficient solutions to instances of 
an NP-hard spatial reasoning problem is the use of large tractable subsets of the 
NP-hard set of relations. The backtracking algorithms split each constraint into 
sub-constraints that contain only relations of a tractable subset where each split 
spans a new subtree of the search space. Using large tractable subsets makes 
it possible to split the constraints into fewer sub-constraints, thus reducing the 
number of subtrees and the size of the search space. This can be measured in 
terms of the average branching factor of a search tree. For RCC-8, using the 
set of basic relations for splitting the constraints leads to an average branching 
factor of b = 4 which corresponds in this case to the average number of basic 
relations in each of the 256 RCC-8 relations. For the maximal tractable subsets 
Hs, Qs, and Cg, the average branching factors are b = 1.4375, b = 1.516, 
and b = 1.523, respectively. The average size of the search spaces can be 
computed as p(r*—n)/2 As can be seen in Table 4.3 this results in considerably 
smaller search spaces. This however is not fully reflected in the empirical results 
because of the effect of the interleaved applications of the path-consistency 
algorithm at each node of the search tree which eliminates inconsistent relations 
from the constraints and has a similar effect of reducing the search space. Both 
methods together, path-consistency and large tractable subsets, already lead to 
quite impressive results for solving randomly generated RCC-8 instances. In 
Fig. 4.11 we see the 99% percentile running times for solving instances of the 
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#regions | B(4.0) | 8(2.5) | Hs (1.4375) 
5 10° 9537 37 
7 4.4 x 101? | 2.3 x 108 2040 
10 1.2 x 1077 | 8.1 x 101” 107 
20 2.5 x 10114 | 4.1 x10% | 8.8 x 107° 
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Table 4.3. Average size of the search space depending on the number of variables and the 
branching factor of the split set. 
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Figure 4.11. 99% percentile running times for solving RCC-8 instances of the phase-transition 


region using different tractable subsets (d = 8.0 to d = 10.0, 2,500 instances per data point). 


phase-transition region using different tractable subsets. The maximal tractable 
subset lead to considerably faster solutions but not as much faster as suggested 
by table 4.3 


6.4 Effect of different heuristics 


Another factor for obtaining faster solutions is to use different heuristics 
for choosing the path through the search space. There are two positions in 
the backtracking algorithms where a heuristic choice can be made. One is the 
order in which the constraints are selected, the other choice is the order of 
the sub-relations when splitting a constraint. For both choices we can apply 
different heuristics which influence the search space and the path through the 
search space. It is clear that the choice of the heuristics has more effect on 
consistent instances. In order to determine that an instance is consistent, it is 
sufficient to find one path from the root of the search tree to a consistent leaf. 
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Figure 4.12. Percentile 99% CPU time of the different heuristics for solving A(n, d, 4.0) 
(d = 8.0 to d = 10.0, 2,500 instances per data point). 


So if the perfect heuristic choice is made at all nodes, any consistent instance can 
be solved without backtracking. For inconsistent instances, all possible leafs 
of the search tree must be inconsistent, so the fastest way to determine incon- 
sistency is when this can be detected early on in the search tree. We chose two 
different heuristics for the ordering of constraints and two for the ordering of 
sub-relations (Nebel, 1997) 


static/dynamic: Constraints are processed according to a heuristic evaluation 
of their constrainedness which is determined statically before the back- 
tracking starts or dynamically during the search. 


local/global: The evaluation of the constrainedness is based on a local heuristic 
weight criterion or on a global heuristic criterion (van Beek and Manchak, 


1996). 


In Fig. 4.12 the 99% percentiles are shown for the different combinations of 
heuristics, the second column of Table 4.4 shows the number of hard instances 
for each combinations. Hard instances are considered to be those that cannot 
be solved by using 10.000 visited nodes in the search space. It can be seen that 
although some combinations are better than others, they are all quite successful 
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Figure 4.13. Fastest solution of the hard instances when running all heuristics in parallel. 


and the differences are not enormous. Their real advantage is described in the 
following section. 


6.5 Effects of combining different strategies 


We denote as a strategy a choice of tractable subset for splitting, a heuristic for 
constraint selection and a heuristic for sub-relation ordering. As described in the 
previous section, every consistent instance can be solved without backtracking 
if the right heuristic choice is made at each node. Therefore it is not surprising 
that some instances can be solved faster by one strategy while other instances 
are solved faster by other strategies. This means that it might be possible to 
solve more instances efficiently by combining different strategies than by each 
strategy alone. We tested this hypothesis by running all strategies on the set 
of all hard instances identified in the experiment described above. It turns out 
that almost all of these hard instances can be solved by at least one strategy 
(see Table 4.4). We also looked at which strategy gives the first response, 
i.e., which strategy solves each instance fastest, which is shown in the same 
table. In most cases, the first response comes very fast, usually with less 
than 300 visited nodes in the search space (see Fig. 4.13). It is surprising 
that the inconsistent instances can be solved particularly fast which shows a 
clear advantage of the method of combining different strategies to random 
methods with restarts. Random methods are actually completely useless for 
inconsistent instances because these methods are not complete. In order to push 
our methods even further, we also looked at how well the different strategies 
complement each other, and tried to find the combination of strategies which 
solves the instances with the least accumulated number of nodes. It turns 
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A(n, d, 4.0) 
Heuristics | # Hard Instances Solved Instances 1. Response 
Hsg/sta/loc 64 91.88% 19.80% 
Hs/sta/glo 42 94.67% 12.56% 
Hs/dyn/loc 52 93.40% 24.37% 
Hs/dyn/glo 100 87.31% 13.58% 
Cs/sta/loc 81 89.72% 6.35% 
Cs/sta/glo 58 92.64% 5.20% 
Cs/dyn/loc 78 90.10% 5.96% 
Cs/dyn/glo 108 86.63% 6.60% 
Qs/sta/loc 81 89.72% 9.77% 
Qs/sta/glo 54 93.15% 12.06% 
Qs/dyn/loc 74 90.61% 10.15% 
Qs/dyn/glo 104 86.80% 12.82% 
Bistalloc 68 91.37% 1.40% 
Bista/glo 89 88.71% 1.27% 
Bidyn/loc 70 91.12% 0.89% 
Bidyn/glo 162 79.44% 0.89% 
Bistalloc 163 79.31% 0.51% 
Bistalglo 222 71.83% 0.25% 
Bldyn/loc 209 73.48% 0.51% 
Bidyn/glo (303) z 0.13% 
combined 788 99.87% 














Table 4.4. The second column shows the number of hard instances for each heuristic, there are 
788 hard instances in total. Column three shows the percentage of solved hard instances for 
each heuristic and column four the percentage of first response when orthogonally running all 
heuristics. Note that sometimes different heuristics are equally fast. Therefore the sum is more 
than 100%. 


out that four strategies (Hs/staticl global, Hg/dynamicllocal, Cg/dynamic/local, 
B/staticllocal) complement each other particularly well. By combining these 
four strategies, almost all instances in the phase transition region can be solved 
by restricting the combined number of visited nodes to a value which is linear in 
the size of the instances. We tested this for CSPs up to a size of 500 variables, 
i.e., CSPs with about 25.000 relations. At that point the increased run-time 
of the interleaved path-consistency computations turned out to be the limiting 
factor. 


6.6 Discussion 


We have seen that even though spatial reasoning with RCC-8 is an NP- 
complete problem, we were able to solve almost all of the hardest instances 
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identified in our experiments in reasonable time. This is only possible through 
the use of the maximal tractable subsets that we identified by a theoretical anal- 
ysis of the reasoning problem. For RCC-8 this turns out to work particularly 
well, which is due to the existence of three different maximal tractable subsets 
for RCC-8 but also because RCC-8 is a rather small algebra. Empirical studies 
for Allen’s interval algebra (Allen, 1983) which has 13 basic relations but only 
one maximal tractable subset which contains all basic relations, show that rea- 
soning is still much more efficient in practice when using the maximal tractable 
subclass than without (Nebel, 1997), but the overall practical efficiency was 
not as impressive as for RCC-8. Nevertheless, identifying maximal tractable 
subsets that contain all basic relations is an essential part if more efficient solu- 
tions to an NP-complete spatial or temporal reasoning problem are to be found. 
Although attempts have been made to identify maximal tractable subsets that 
do not contain all basic relations (Krokhin et al., 2003), these subsets cannot be 
used for obtaining more efficient solutions to the general NP-complete problem 
as itis not possible to split each relation into members of these maximal tractable 
subset. Another important finding is that combining different strategies leads to 
much better results than trying to optimise one strategy. In that respect we can 
conclude that the more strategies the better. This includes analysing different 
heuristics as well as using different tractable subsets which even includes using 
subsets of tractable subsets. 


7. Combination of spatial calculi 


There has been a large amount of research on qualitative spatial calculi, a 
fraction of it has been described in this chapter, and more in other chapters of 
this book. The usefulness of these research efforts, however, largely depends 
on how well this research can make its way into practical applications. Without 
a doubt, space is one of the fundamental aspects of our daily life and of our 
physical world, and therefore qualitative spatial representation and reasoning 
should be an essential part in many applications. Itis remarkable, however, 
that up to now there are relatively few real applications. One reason for this 
lack of applications is that research has mainly focused on understanding and 
analysing single, isolated aspects of space, like distance, direction, or topology. 
The spatial calculi we presented so far all fall into this “single aspect” category. 

On the other hand, almost all possible applications require different aspects 
of space and not only topology or only direction. Future research on qualitative 
spatial representation and reasoning should focus strongly on combining differ- 
ent aspects of space, on developing and analysing spatial calculi over different 
aspects of space, and on methods for dealing with these calculi. In this chapter 
we will present some promising first attempts in this direction. 
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The first approach is by Gerevini and Renz, 2002, who combine topology 
and size information and who introduce several modifications of the existing 
constraint algorithms for dealing with different kinds of constraints. A second 
approach is by Renz, 2001, who combines directional and topological infor- 
mation for one-dimensional intervals by adding direction of intervals to the 
interval algebra. 


7.1 Different ways of combining multiple aspects of space 


Constraint-based approaches in principle support the use of different kinds 
of constraints if they work on the same domains. This is relatively straight- 
forward if finite domains are used where the constraint algorithms manipulate 
the domains of the variables. For qualitative spatial reasoning where infinite 
domains are used and constraint algorithms work on relation-symbols instead 
of restricting domains, this turns out to be a difficult problem. The reason for 
this is that relations over one aspect are not independent of relations of another 
aspect. For example if the distance between two objects is large, they cannot 
overlap. If one objectis contained in the other one it must be smaller. These are 
two simple examples which show that topology is neither independent from di- 
rection nor from size. These restrictions and dependencies must be enforced on 
the relational level and must therefore be analysed when developing a combined 
calculus and must be precomputed like a composition table. 

One way of developing a calculus for multiple aspects of space is to take 
the relations for each aspect, for example two sets of basic relations R = 
{Ri,...,Rn} and S = {S1,..., Sm}, and form new relations as the cross 
product R x S. Some of the new relations will be empty and can be removed. 
The advantage of this approach is that the dependencies of the different aspects 
are implicitly encoded in the new composition table and that all the existing 
reasoning algorithms can be used. The disadvantage is the large number of rela- 
tions that result from this approach (which makes reasoning and also analysing 
the combined calculus very time and space consuming). An example for this 
approach is by Pujari et al., 1999 in the area of temporal reasoning where the in- 
terval algebra is combined with relative durations of intervals. Another example 
is by Renz, 2001, who added direction of intervals to the interval algebra. 

An alternative approach is to treat the different aspects separately and to 
develop new reasoning algorithms for combining different sets of constraints 
and their dependencies. Different aspects and different granularities can then be 
added in a modular way without having an explosion in the number of relations. 
One problem here is how to keep track of the interactions between the different 
sets of relations and their interactions, i.e, how can relations of different sets 
be composed, intersected etc. This approach will be further discussed in the 
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Table 4.5. Interdependencies of basic RCC-8 relations (r) and basic QS relations (s). 


following section where we take the combination of topology and qualitative 
size as an example. 

In any case, it is essential that different aspects can only be combined when 
they use the same underlying spatial entities, such as points or regular regions. 


7.2 Combining topological and size information 


When having two sets of basic relations A and B over a domain D where 
both of them split D x D exhaustively, it is clear that the relations of the two sets 
taken together are not pairwise disjoint and, hence, cannot be independent of 
each other. Instead of looking at the intersections of all the relations and to treat 
them as a new set of JEPD relations, we will present in this section methods 
for taking the sets separately and propagating their interactions. For this it is 
necessary to first look at all the interactions that can possibly occur. As an 
example we take the work by Gerevini and Renz, 2002, who combined RCC-8 
with qualitative size relations. Given a set V of spatial region variables, a set 
of QS-constraints over V is a set of constraints of the form size(x) S size(y), 
where S € QS, size(x) is the size of the region x, size(y) is the size of the 
region y, and x,y E€ V. OS = {<,>,=,<,>,4,<=>}. Their interactions 
are rather simple and are mainly due to the fact that regions which are contained 
in other regions must be smaller than the containing region. All interactions 
can be found in Table 4.5. Sizerel(r) is the qualitative size relation entailed 
by an RCC-8 relation r, while Toprel(s) is the RCC-8 relation entailed by a 
qualitative size relation. 

Now we consider pairs of relations as new relations, i.e., we consider con- 
straints of the form xRy where R = (Ra, Rp) and Ra € A and Rẹ € B. This 
is equivalent to having two sets of constraints © and X over the same set of 
variables and the same domain where © contains the RCC-8 constraints and © 
the qualitative size constraints. If both sets are independently consistent, it is 
clear that the two sets taken together are not necessarily consistent too as their 
interactions must be considered. 

A natural method for deciding the consistency of a set of RCC-8 constraints 
and a set of OS-constraints, would be to first extend each set of constraints 
with the constraints entailed by the other set, and then independently check 
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the consistency of the extended sets by using a path-consistency algorithm. 
However, as the example below shows, this method is not complete for Hs 
constraints. 

Another possibility would be to compute the strongest entailed relations (min- 
imal relations) between each pair of variables before propagating constraints 
from one set to the other. However, this method has the disadvantage that 
it is computationally expensive, as the best known algorithm for computing 
the minimal network of a set of constraints over either Hs, Cg or Qg requires 
O(n*) time. 

Finally, a third method could be based on iteratively using path-consistency 
as a preprocessing technique and then propagating the information from one 
set to the other. A similar method is used by Ladkin and Kautz to combine 
qualitative and metric constraints in the context of temporal reasoning (Kautz 
and Ladkin, 1991). Note that imposing path-consistency is sufficient for con- 
sistency checking of a set of constraints over Hs, Cg, Qs, and OS, but is 
incomplete for computing the minimal relations (van Beek, 1992; Renz and 
Nebel, 1999). The following example shows that the information would need 
to be propagated more than once, and furthermore it is not clear whether in 
general this method would be complete for detecting inconsistency. 


EXAMPLE 4.9 Consider the set O formed by the following Hg constraints 
#o{ TPP, EQ}x2, x1{TPP, EQ, PO}z0, af TPP, EQ te», x4{TPP, EQ}z3, 
and the set X formed by the of following QS-constraints 
size(xo) < size(x2), size(x3) < size(x1), size(x2) < size(x4). 


We have that O and X are independently consistent, but their union is not 
consistent. Moreover, the following propagation scheme does not detect the 
inconsistency: (a) enforce path-consistency to © and © independently; (b) 
extend X with the size constraints entailed by the constraints in ©; (c) extend 
© with the topological constraints entailed by the constraints in 4; (d) enforce 
path-consistency to © and © again. In order to detect that O U X is inconsistent, 
we need an additional propagation of constraints from the topological set to the 
size set. 

Instead of directly analysing the complexity and completeness of the prop- 
agation scheme illustrated in the previous example, Gerevini and Renz, 2002 
proposed a new method for dealing with combined topological and qualitative 
size constraints. In particular, they propose an O(n?) time and O(n”) space 
algorithm, BIPATH-CONSISTENCY, for imposing path-consistency to a set of 
constraints in RCC-8 U QS. BIPATH-CONSISTENCY solves CSPSAT for any 
input set © of topological constraints in either He, Cg or Qg, combined with 
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Algorithm: BIPATH-CONSISTENCY 

Input: A set © of RCC-8 constraints, and a set £ of QS-constraints over the 
variables £1, £2, ..., £n of O. 

Output: fail, if © U O is not consistent; path-consistent sets equivalent to }& 
and O, otherwise. 


1. Q {(i,7) |i <j}; (i/j indicates the i-th/j-th variable of ©.) 
2. while Q £ Ú do 

3. select and delete an arc (i, j) from Q; 

4. fork # i,k # j (k € {1...n}) do 

5. if BIREVISION(G, j, k) then 

6 if Rix = Ú then return fail 

7 else add (i, k) to Q; 

8. if BIREVISION(K, i, j) then 

9 if Ry; = Ú then return fail 

10. else add (k, j) to Q. 


Function: BIREVISION(i, k, j) 

Input: three region variables 7, k and 7 

Output: true, if Rj; is revised; false otherwise. 

Side effects: Rij and Rj; revised using the operations N and o over the 
constraints involving i, k, and j. 


1. if one of the following cases hold, then return false: 

(a) Toprel (sip) O tik = U: and Sizerel(tik) N Sik = Us, 

(b) Toprel(skj) N tkj = Ur and Sizerel(tkj) N Skj = Us 

. oldt := t;;; olds := sij; 

. tij = (tij N Toprel(sij)) N (tig N Toprel(six)) o (tk; N Toprel(spj))); 
-Sij = (Sij N Sizerel(tij)) N (sig N Sizerel(tik)) © (Skj N Sizerel(tki))); 
. if sij A olds then ty; := (ti; O Toprel(si;)); 

. if (oldt = t;;) and (olds = s;;) then return false; 

. tji = Converse(ti;); Sji = Conver se(sij); 

. return true. 


OCONNMNBWN 


Figure 4.14. BIPATH-CONSISTENCY. 


any set of size constraints in QS involving the variables of ©. Thus, despite 
this framework is more expressive than a purely topological one over the same 
set of relations (and therefore has a larger potential applicability), the problem 
of deciding consistency can be solved without additional worst-case cost. 
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BIPATH-CONSISTENCY is a modification of Vilain and Kautz’ path- 
consistency algorithm (Vilain and Kautz, 1986; Vilain et al., 1989) as described 
by Bessiére, 1996, which in turn is a slight modification of Allen’s algorithm 
(Allen, 1983). The main novelty of the algorithm is that BIPATH-CONSISTENCY 
operates on a graph of pairs of constraints. The vertices of the graph are con- 
straint variables, which in our context correspond to spatial regions. Each edge 
of the graph is labelled by a pair of relations formed by a topological rela- 
tion in RCC-8 and a size relation in QS. The function BIREVISION(:, k, 7) 
has the same role as the function REVISE used in path consistency algorithms 
for constraint networks (e.g., Mackworth, 1977). The main difference is that 
BIREVISION(Z, k, 7) considers pairs of (possibly interdependent) constraints, 
instead of single constraints. 

A formal description of BIPATH-CONSISTENCY is given in Fig. 4.14, where 
Rj; is a pair formed by a relation t;; in RCC-8 and a relation s;; in QS; Rj; = 0 
when t;; = 0 or si; = 0; U; indicates the universal relation in RCC-8 and U, 
the universal relation in OS. 

Gerevini and Renz, 2002 prove soundness and completeness of BIPATH- 
CONSISTENCY for the maximal tractable subsets of RCC-8 combined with 
qualitative size relations. 


THEOREM 4.10 Given a set O of constraints in either Hs, Cg or Qs, anda set 
X of constraints in QS involving variables in ©, consistency of O U X can be 
decided using the BIPATH-CONSISTENCY algorithm in O(n?) time and O(n?) 
space, where n is the number of variables involved in © and ©. 


Using the BIPATH-CONSISTENCY algorithm, combined sets of constraints can 
be solved in cubic time just like the normal path-consistency algorithm for 
each of the two sets alone, i.e., they can be solved without additional worst- 
case cost. Soundness and completeness of BIPATH-CONSISTENCY do not 
hold automatically and has to be proved for each combination of different 
relations anew. Sometimes, however, the computational properties of combined 
calculi can be more favourable than both of them alone if the interactions with 
the other type of relations refines relations that make deciding consistency 
NP-hard to relations for which it is tractable. As can be seen in Table 4.5, 
whenever we have a definite qualitative size constraint size(x)S'size(y) with 
S € {<, >, =} and this constraint is combined with an RCC-8 constraint «Ry 
resulting in xR’y, then R’ will contain basic relations of at most one of the sets 
{TPP, NTPP}, {TPP}, NTPP~*+}, {EQ} and possibly some relations of the 
set {DC, EC, PO}. In other words, any relation of R € RCC-8 \ Hg will be 
refined to a relation R’ € Hs and therefore it is possible that sets of constraints 
for which it is NP-hard to decide consistency become tractable after adding 
constraints over a different set of relations. 

The opposite is of course also possible, that combining two calculi results ina 
calculus that has ahigher complexity than both alone. This is the case for another 
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combination that Gerevini and Renz analysed, namely, combining RCC-8 with 
metric size information. They considered metric size constraints of different 
kinds, metric relative size constraints size(x) Ra. size(y) where a is a positive 
rational number, size difference constraints size(x) — size(y) € I where Iisa 
continuous interval of rational numbers, or domain size constraints size(x) € I. 
The main difference of combining RCC-8 with metric size information as 
compared to qualitative size information is that it is possible to express that a 
set of regions completely fills another region. Combining RCC-8 with any of 
these metric size calculi, which are all independently tractable, leads to NP- 
hardness even when combined with only the RCC-8 basic relations. Without 
the PO relation, i.e., considering only the 7 other RCC-8 basic relations, the 
combination is tractable though. 

This was an example of how different calculi can be combined. Future re- 
search effort within qualitative spatial representation and reasoning should deal 
with modularising different aspects and different granularities in a similar way 
to how topology and size was combined here, studying their interactions and 
developing algorithms for reasoning about combined calculi. Then different ap- 
plications could use the modules that are needed for the particular application, 
the interactions between the different modules and combine them using al- 
gorithms like BIPATH-CONSISTENCY. If possible, these combinations should 
have favourable computational properties and should enable efficient solutions. 


7.3 Combining topological and directional information 
for intervals 


In this section we give an example for a combination of two aspects of 
space that relies upon forming new relations out of two given sets of relations. 
One of the two aspects of space we are looking at is the interval algebra (IA) 
(Allen, 1983), which was originally defined for temporal reasoning. However, 
for applications that can require only a one-dimensional spatial representation, 
it makes sense to use the interval algebra. Some possible applications are from 
the area of traffic management. Roads and railway lines can be regarded as 
one-dimensional routes, but also air and sea traffic mainly operates on given 
routes. A single route can be represented as a one-dimensional space and the 
vehicles on a route as intervals. The main difference of vehicles on a route to 
the interval algebra is that vehicles and also routes have a direction. This can 
be captured by extending the interval algebra with direction of intervals (Renz, 
2001). Direction in a one-dimensional space is quite simple as there are only 
two directions, front and back, or same direction and different direction. 

A straightforward way for dealing with directed intervals would be to add ad- 
ditional constraints on the direction of intervals to constraints over the Interval 
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—_P —_ =— —_ 
X Y X Y 
= — —=_ ——_ 
b X Y x 


Figure 4.15. Four structurally different instantiations of the relation “x behind y” with directed 
intervals. 


Algebra and treat the two types of constraints separately while propagating in- 
formation from one type to the other (similar to what has been done in Gerevini 
and Renz, 1998.) We say that an interval has positive direction if it has the same 
direction as the underlying line and negative direction otherwise. So possi- 
ble direction constraints could be unary constraints like “x has positive/negative 
direction” or binary constraints like “x and y have the same/opposite direction”. 
This approach, however, is not possible since the Interval Algebra loses its prop- 
erty of being a relation algebra when permitting directed intervals. This can 
be easily seen when considering the “behind” relation of Fig. 4.15. The actual 
converse of “x behind y” is a subset of “y is behind or in front of x” which 
cannot be expressed within the Interval Algebra. If using “y is behind or in front 
of x” as the converse of “x behind y”, whose converse is again “x is behind 
or in front of y”, then applying the converse operation (-~~) twice leads to a 
different relation than the original relation. This is a contradiction to one of the 
requirements of relation algebras (R~~ = R) (Ladkin and Maddux, 1994). 
This contradiction does not occur when we refine the “behind” relation into two 
disjoint sub-relations “behind—” and “behind” where the subscript indicates 
that both intervals have the same (=) or opposite (Æ) direction. The converse 
of both relations is “in-front-of_” and “behind”, respectively. Applying the 
converse operation again leads to the original relations. 

Since a relation algebra must be closed under composition, intersection, and 
converse, we have to make the same distinction also for all other IA relations. 
This leads to the definition of the directed intervals algerba DIA (Renz, 2001). 
It consists of the 26 basic relations given in Table 4.6, which result from refin- 
ing each IA relation into two sub-relations specifying either same or opposite 
direction of the involved intervals, and of all possible unions of the basic rela- 
tions. This gives a total number of 27° DIA relations. Converse relations are 
given in the same table entry. If a converse relation is not explicitly given, the 
corresponding relation is its own converse. We denote the set of 26 DIA basic 
relations as B. Then DIA = 28. Complex relations which are the union of more 
than one basic relation R1,..., Ry, are written as {R,,..., Rg}. The union of 
all basic relations, the universal relation, is denoted {x}. 

A DIA basic relation R = Iq consist of two parts, the interval part J which 
is a spatial interpretation of the Interval Algebra and the direction part d which 
gives the mutual direction of both intervals, either = or Æ. If a complex relation 
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Directed Intervals Sym- Pictorial 
Basic Relation bol Example 
x behinds y b— -x-> 
y in-front-of= x f= -y-> 
x behindy y bz <-X- 
-y-> 
x in-front-ofy y fy -x-> 
<-y- 
x meets-from-behind— y mb= | -x-> 
y meets-in-the-front= x mf= -y-> 
x meets-from-behinds y mbz | <-x- 
-y-> 
x meets-in-the-fronty y mfy | -x-> 
<-y- 
x overlaps-from-behind— y ob= —x—> 
aadar oe ae 
x overlaps-from-behindy y obz <—x— 
x overlaps-in-the-front y ofz —x—> 
<—y— 
x contained-in— y C= -x-> 
a oe ae 
x contained-ing y Cy <-x- 
id een are 
x contained-in-the-back-of= y | cb= -x-> 
Sy Scie font a he kan 
x contained-in-the-back-ofy y | cbz <-x- 
aa Ca 
x contained-in-the-front-of= y | cf= -x-> 
a aa ee ne 
x contained-in-the-front-ofy y | cfz <-x- 
nd ea Pe pe 
x equals— y eq= —x—> 
x equalsy y eqz —x—> 
<—y— 








Table 4.6. The 26 basic relations of the directed intervals algebra. 
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R consist of basic relations with the same direction part d, we can combine the 


interval parts and write R = {I',...,I*}q instead of R = {J}... 


., IE}. We 


write Re (resp. Rn) in order to refer to the union of the interval parts of every 
sub-relation of a complex relation R where the direction part is {=} (resp. {F}.) 
In this way, every DIA relation R can be written as R = {R.}-U{Rp}z. DIAZ 
denotes the set of 213 possible interval parts of DIA relations. 
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Table 4.7. \A basic relations R, their reverses R”, and their spatial interpretations dia(R). 


It is important to note that the spatial interpretation of the Interval Algebra 
was chosen in a way that the interval part of a relation xIqy only depends on the 
direction of y and not on the direction of x. Therefore, if the direction of x is 
reversed, written as %, then only the direction part changes, i.e., c[gy = Tl-qy. 
This would not be the case in a straightforward spatial interpretation of the 
original temporal relations. For instance, IA relations like “x started-by y” 
or “x finished-by y” depend on the direction of x. Instead, these relations 
are interpreted spatially as “x extends-the-front/back-of y” and “a contained- 
in-the-front/back-of y”. This interpretation is independent of the direction of x. 
When all intervals have the same direction, both interpretations are equivalent. 
In order to transform the spatial and the temporal interval relations (independent 
of the direction of the intervals) into each other, we introduce two mutually 
inverse functions dia : IA +> DIA; and ia : DIA; + IA, i.e., dia(ia(R)) = R 
and ia(dia()) = R. The mapping is given in Table 4.7. 

All relations of the directed intervals algebra are invariant with respect to the 
direction of the underlying line, i.e., when reversing the direction of the line, 
all relations remain the same. This is obviously not the case for the Interval 
Algebra, e.g., if x is before y and one reverses the direction of the time line, 
then x is after y. In order to transform DIA relations into the corresponding IA 
relations and vice versa, we introduce a unary reverse operator (-”) on relations 
R such that R” specifies the relation which results from R when reversing the 
direction of the underlying line. For all relations R € DIA we have that R” = R. 
For IA relations, the reverse relation is given in Table 4.7. The reverse of a 
complex relation is the union of the reverses of the involved basic relations. The 
reverse of the composition (0) of two relations is equivalent to the composition 
of the reverses of the two involved relations, i.e., (Ro S)" = R” o S". Applying 
the reverse operator twice results in the original relation, i.e., R™ = R. Using 
the reverse operator we can also specify what happens with a relation lay if 
only the direction of y is changed. Then the topological relation of the intervals 
stays the same, but the order changes, i.e., “front” becomes “behind’’/“‘back” 
and vice versa. The mutual direction also changes. This can be expressed in 
the following way: xIqy = x dia(ia(I)")4 J. 

We now have all requirements for computing the composition (0g) of DIA 
relations using composition of IA relations (denoted here by 0;,) as specified 
by Allen, 1983. 
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THEOREM 4.11 Let Rp, Sq be DIA basic relations. 
1 Ifq = {=}, then Rp og Sq = dia(ia(R) oia ia(S))p 
2 Ifq = {F}, then Rp og Sq = dia(ia(R)" oia ia(S)) Ap 


The composition of complex relations is as usual the union of the compo- 
sition of the contained basic relations. It follows from the closedness of the 
Interval Algebra that DIA is closed under composition, intersection, converse, 
and reverse. 

We have now obtained a new set of relations together with the necessary 
operations that cover the combination of the original two aspects. As opposed 
to the combination of topology and size, we can use the standard backtracking 
and path-consistency algorithms for reasoning about these relations. However, 
since we now have 26 basic relations, and 2° possible subsets, it is much more 
difficult to analyse computational properties for these relations than it is for the 
interval algebra. Fortunately, it is partly possible to use the complexity results of 
the interval algebra in order to find complexity results for the directed interval 
algebra. NP-hardness of the directed intervals algebra follows immediately 
from NP-hardness of the interval algebra. But also tractable subsets can be 
identified by exploiting results for the interval algebra. Among them, most 
importantly, the set of DIA basic relations was shown to be tractable. In the 
proof of this result it is not important that all non-universal relations are basic 
relations, only that all non-universal relations consist of DIA basic relations 
with the same direction part. Therefore, tractability for the basic relations can 
be extend to the following result. 


THEOREM 4.12 Let S be a tractable subset of the Interval Algebra which 
is closed under the reverse operator. Then S* = {dia(R)=|R € S}U 
{dia(R)z|R E€ S}U {*} is a tractable subset of the directed intervals algebra. 





Renz, 2001 showed that ORD-Horn, the only maximal tractable subset of the 
interval algebra that contains all basic relations is closed under the reverse 
operator. Therefore, the above theorem also applies to ORD-Horn. What’s 
more, it was shown that path-consistency decides consistency for 71~, the set 
of DIA relations which results from ORD-Horn (see Theorem 4.12). 








THEOREM 4.13 Path-consistency decides CSPSAT(H=). 


Apart from these initial results the complexity of the directed intervals algebra 
is open and maximal tractable subsets have not yet been identified. 
8. Conclusions 


In this chapter we have shown how spatial information can be represented 
using constraint calculi. This is a natural way of using qualitative spatial calculi 
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and allows us to use methods and techniques developed for constraint satisfac- 
tion problems. We introduced qualitative spatial calculi, constraint satisfaction 
methods for reasoning over these calculi and presented methods for analysing 
the computational properties of spatial calculi. These methods are very general 
and can be used for different kinds of spatial calculi. We showed how a compu- 
tational analysis can lead to very efficient reasoning even though the reasoning 
problems are NP-hard. 

Using the presented methods we can specify a roadmap for how spatial calculi 
should be analysed in order to find efficient reasoning methods: 


1 Define a useful set of basic relations B over a certain aspect of space, on 
a certain level of granularity and over a certain spatial domain. 


2 Formally compute the composition table. 


Try to find an NP-hardness proof for CSPSAT(2®) using the method of 
polarity and clause constraints. 


w 


4 Try to show that CSPSAT(B) is tractable and that path-consistency de- 
cides consistency. 


Nn 


Identify larger tractable sets by applying the closure and the refinement 
methods. If possible identify maximal tractable subsets. 


nN 


Using an empirical analysis, identify the combination of strategies that 
is most effective in solving instances of CSPSAT(2°). 


These methods work for calculi based on a single aspect of space. For 
most practical applications, however, it is necessary to combine more than one 
aspect of space. In this chapter we presented some example of how relations 
can be combined. One way is to form new relations that cover the combined 
aspects, another way is to treat the relations separately and to keep track of 
and propagate their interactions like it is done by Gerevini and Renz’ bipath- 
consistency algorithm. 

It is one of the main challenges of qualitative spatial reasoning to provide 
general methods for combining different calculi and for analysing the compu- 
tational properties of combined calculi. What we presented here is only a small 
step in this direction and lots of future research is required. The goal of such an 
analysis could be a toolbox of calculi for different spatial and also temporal as- 
pects on different granularities and efficient algorithms for their combinations. 
Each application could then pick the required sets of relations and the most 
efficient algorithms for combining these relations. 
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1. Modal logics and spatial structures 
1.1 What does modal logic have to do with space? 


Despite historical links between the foundations of mathematics and 
development of axiomatic geometry, substantial logics for significant spatial 
structures have been scarce. Perhaps the best-known examples are both due to 
Tarski. The first is his still amazing work on the first-order theory of elementary 
Euclidean geometry, including the surprising proof of its decidability, and the 
resulting abstract theory of real-closed fields. This was the metamathematical 
finale to Hilbert’s Foundations of Geometry, itself the culmination of Euclid’s 
Elements. This strand is taken up by several chapters in this handbook (Ch. 2, 
Ch. 7), but it will be only mentioned in passing in this chapter. For our purposes 
here, the founding event is Tarski’s topological interpretation of modal logic, 
culminating in his proof with McKinsey that the simple decidable modal logic 
S4 is complete for interpreting modal © as topological closure on the reals or 
any metric space like it. In what follows we concentrate on the latter modal 
direction in spatial logics, which is also represented in several other chapters 
of the handbook (Ch. 3, Ch. 7, Ch. 10). 
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It seems fair to say that there are mostly scattered results in this modal 
line, suggestive though they may be. To quickly survey several diverse direc- 
tions in this line we recall Segerberg, 1973 on two-dimensional modal logics, 
Shehtman, 1983 on logics of physical structures (which was part of Dragalin’s 
program of investigating modal logics of geometrical structures in physical 
spaces), Goldblatt, 1980 on the logic of Minkowski space-time, Chellas, 1980 
on neighborhood semantics (originally proposed by Montague and Scott in the 
1960s), the appendix of van Benthem, 1983b on calculi for relative nearness, 
the work of the ‘Georgian School’ in modal logics of topology (partly surveyed 
in Esakia, 2004), Bennett, 1995 on the ‘calculus of regions’, Venema, 1999 
on ‘compass logic’ in the two-dimensional plane, and Stebletsova, 2000 and 
Stebletsova and Venema, 2001 on modal logics for projective geometry. So 
far these ingredients have never added up to one coherent tradition of “spatial 
logic’, although some attempts have been made occasionally (cf. Anger et al., 
1996). In contrast to this state of affair, temporal logic has been a thriving 
research program for many years (cf. van Benthem, 1995 or Hodkinson and 
Reynolds, 2006). One of the goals of this handbook in general and our chapter 
in particular is to fill in this gap. 

Our starting point is the topological interpretation of modal logic (Tarski, 
1938; McKinsey and Tarski, 1944), which we state in the modern truth- 
conditional format. The basic language £ has a countable set P of proposition 
letters, boolean connectives =, V, A, —, and modal operators O, ©. A topo- 
logical model or simply a topo-model is a topological space (X, 7) equipped 
with a valuation function v : P P(X). 





DEFINITION 5.1 (BASIC TOPOLOGICAL SEMANTICS) Truth of modal for- 
mulas is defined inductively at points x in a topo-model M = (X, rT, v): 
M,x Ep 
M,x =| ng if not M,x = ọ 
M,cxFvpAw iff M,x} yand M,< H Y 
M, x = Oy if JU € r(x €U andYy E€ UM,y E p) 
M, x = Oy if YU E€rT(xEU—>J3yEU:M,y Ep). 


ff x € v(p) for each p € P 


m. 









































As usual we can economize by defining, e.g., p V ẹ% as =(~y A ~y), and Ow 
as =D. We will do this whenever convenient. 














This usual symbolic truth definition has an immediate spatial interpretation. 
Given any concrete model, each formula of the language denotes a region of 
the topological space being modelled. For instance, take the real plane JR? 
with the standard topology. Consider a valuation function having some spoon 
shaped region as the value of the proposition letter p, as depicted in Fig. 5.1.a. 
Then, the formula ~p denotes the region not occupied by the spoon, i.e., the 
background; the formula Op denotes the interior of the spoon region p and so 
on, as explained in Fig. 5.1. 
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Figure 5.1. Each modal formula identifies a region in a topological space. (a) A spoon, p. 
(b) The container part of the spoon, Op. (c) The boundary of the spoon, Op A O-p. (d) The 
container part of the spoon with its boundary, Op. (e) The handle of the spoon, p A =~ 0p. 
In this case the handle does not contain the junction handle-container point. (f) The junction 
handle-container point of the spoon, OOpA (pA 7©Up): a singleton in the topological space. 



























































Thus, a simple modal language can define regions in space in a perspicuous 
and appealing way, and allow us to check assertions about them. Moreover, the 
same modal notation also facilitates spatial reasoning. For instance, the valid 
axiom O(p^q) > (Op^ Oq) says that two ways of describing a region—either 
as the interior of the intersection of two sets or as the intersection of the interiors 
of those sets—always amount to the same thing. Thus, modal logic is also a 
small inference engine for basic spatial manipulations. 

We will consider other modal languages and logics for spatial structures later 
on. For the moment we merely point out that the preceding example contains 
two different perspectives on the encounter between modal logic and space. 
Some modal logicians see topological models as a means of providing new se- 
mantics for existing modal languages, mostly for logic-internal purposes. This 
can be motivated a bit more profoundly by thinking of topologies as models for 
information, making this interest close to central logical concerns. But some- 
one primarily interested in space as such will not worry about the semantics of 
modal languages. She will rather be interested in spatial structures by them- 
selves, and spatial logics will be judged by how well they analyze old structures, 
discover new ones, and help in reasoning about them. Both perspectives will 
play in our presentation, with the mathematics largely the same, but the sort of 
issues suggested sometimes a bit different. 

In the remainder of this introductory section we discuss these issues in more 
detail, as a first pass through the topic. In Sec. 1.7 we then describe the setup 
for the rest of this chapter. 
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1.2 Relational semantics for modal logic 


The standard models for modal logic are the well-known binary relational 
graphs, with necessity interpreted as truth in all accessible worlds, and possi- 
bility as truth in at least one accessible world: 














M,s = Og iff Vt(sRt — M,t H ») 
M, s = Og iff dt(sRt and M,t = p). 











In this chapter we presuppose a basic acquaintance with modal logic in this 
style. We refer to Blackburn et al., 2001 and van Benthem and Blackburn, 2006 
for a quick introduction in a modern spirit. In particular, here are some core 
themes that will occur below. 

The natural measure of expressive power for the basic modal language over 
the class of arbitrary relational models is the invariance of all formulas under 
bisimulations between models M, w and N, v, which provides the right measure 
for structural equivalence as far as the language is concerned. This invariance 
analysis can be fine-tuned to play Ehrenfeucht-Fraisse-type model comparison 
games between models (Doets, 1996) in which the Duplicator player has a 
winning strategy over a k-round game iff the two models M, w and N, v satisfy 
the same modal formulas up to modal operator depth k. As for axiomatics, 
the class of all standard models validates precisely the minimal modal logic 
K, whose most noteworthy principle is the above-mentioned distributivity of 
modal O over conjunction. But deductive power goes up on special model 
classes. E.g., the modal logic S4 with axioms Op — p and Op — p 
is complete for the class of all reflexive and transitive frames, and there is a 
host of other natural stronger logics. These correspondences between natural 
conditions on accessibility relations in graphs and modal axioms of certain 
shapes can also be studied per se as a matter of semantic definability. There 
are even powerful methods for automatic analysis of modal axioms for their 
frame content. But in addition to deductive power and correspondence analysis 
of the basic language, there is also expressive power: the ability to say more 
about the same class of structures. Many modal languages in use extend the 
basic propositional formalism mentioned above by adding operators such as 
the ‘universal modality’ (‘true in all worlds’), or temporal-style operators like 
‘Until’ or ‘Since’. 

Finally, to complete this lightning summary, modal languages are designed 
with a certain balance in mind. E.g., the basic modal language is like the 
language of first-order logic in that it allows for quantification over objects. 
But this quantification is only ‘local’ or ‘bounded’, tied by accessibility to 
the current world. Trading in some first-order expressive power in this way, 
however, comes with a bonus: validity and satisfiability in the basic modal 
language are decidable, indeed PSPACE-complete. Moreover, looking at other 


















































Modal Logics of Space 221 


key tasks for a logical calculus, it may be noted that model checking for finite 
models is PSPACE-complete for the full first-order language, whereas it takes 
only polynomial time for modal logic. Likewise, testing two finite models for 
the existence of a bisimulation can be done in polynomial time, whereas the 
corresponding problem for the complete first-order language is the so-called 
Graph Isomorphism Problem, which is known to be in NP. More generally, 
extended modal languages try to boost expressive power on relevant structures, 
while skirting the cliffs of complexity. Well-known examples of such trade- 
offs much higher up are the ‘guarded fragment’ of first-order logic (Andréka 
et al., 1998) or the non-first-order modal ‘:-calculus’ enriching the basic modal 
language with non-first-order operators for smallest and greatest fixed-points 
(Harel et al., 2000). 

Even though these features of modal logic have not evolved for specific spa- 
tial reasons, they are often germane to thinking about space. First of all, binary 
relational models themselves are a form of geometrization of modal semantics. 
Of course, they resemble abstract graphs and diagrams rather than regions of 
Euclidean spaces, but still, geometrical intuitions play a role in understanding 
how it all works. Indeed, models of this sort can represent significant spatial 
structures. An example is the work of Shehtman, 1983 and Goldblatt, 1980 
from the early 1980s (cf. also Ch. 11). Interestingly, in relativistic space-time, 
the crucial primitive notion is not the ternary ‘betweenness’ of classical geom- 
etry, but the binary relation C'xy of forward causal accessibility, which runs 
from a point z to all points y in the interior of its future light-cone, where causal 
signals can reach (see Fig. 5.2.a). 
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(a) (b) 
Figure 5.2. Forward modality in Minkowski space-time and validity of the $4.2 Confluence 


Axiom. 


Shehtman and Goldblatt independently proved that the complete modal logic 
of forward causal accessibility equals the modal logic $4.2 which extends S4 
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with the so-called ‘Confluence Axiom’ OOp — Op. The latter principle is 
illustrated in Fig. 5.2.b. It expresses the relativistic fact that any two different 
causal futures, as seen from the current point, even when not causally connected 
themselves, could potentially still lead to a common future history. Again we 
see how modal formulas express significant facts about space(—time). 

All technical topics in our survey of relational semantics make spatial sense. 
Bisimulation-invariance analysis of expressive power is very close to thinking 
about geometrical transformations and invariants (van Benthem, 2002), which 
goes back to the foundations of geometry in the 19th century. Also, modal 
logics can represent special styles of spatial reasoning, as we just saw. And 
issues of optimal language design have also emerged already. For instance, the 
above topological semantics for the basic modal language is still ‘local’, not in 
the sense of binary accessibility, but in being restricted to what is true in open 
neighborhoods of the current point. But many natural topological notions do 
not have this local character. E.g., a space is connected if it cannot be split into 
two non-empty clopen sets. This global property of topological spaces cannot 
be expressed in the basic modal language. But it can if one adds a universal 
modality. Finally, in all this, the issue of ‘balance’ returns. Modal systems are 
typically attempts at uncovering significant spatial structures, while providing 
low-complexity (decidable) calculi for reasoning with them. 


1.3 Background: the many semantics of modal logic 


In a sense, spatial interpretations of modal logic challenge the existing order. 
The now dominant relational semantics is really a product of the 1950’s/1960’s. 
Its historical predecessors include algebraic semantics, which has been used 
extensively in the technical literature in the form of boolean algebras with 
operators. Venema, 2006 surveys the state of the art. Another earlier semantics 
of modality is Gédel’s provability interpretation: a story which is told with many 
new historical details in Artemov, 2006. The latter paper is also an excellent 
broader source for mathematical uses of modal logic, including a brief, but 
useful account of spatial ones. 

Clearly, our topological semantics is another 1930’s challenger. This mod- 
elling was particularly vivid and attractive for the language of intuitionistic 
logic, where open sets may be viewed as information stages concerning some 
underlying point—an interpretation which returns in much greater sophisti- 
cation in the topos semantics (see Ch. 8). The informational interpretation 
of topology will not be a major concern in this chapter, but we do mention 
topological semantics for epistemic logic briefly in Sec. 3.4 as it raises some in- 
teresting new issues that do not become visible in the standard binary relational 
modelling. 
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Also worth noting is that the topological semantics generalizes easily to the 
so-called neighborhood models for modal languages. Here one just assumes 
some binary relation RxY associating worlds x with sets of worlds Y (not 
necessarily open environments), with the same truth condition as above: 














yistrue at world x iff there exists a set Y with RrY 


all of whose members y satisfy the formula y. 


Neighborhood models are used, e.g., to express output relations for concur- 
rent computation (Peleg, 1987), relations of ‘support’ or ‘dependence’ in logic 
programming (van Benthem, 1992), or relations of ‘power’ for forcing a game 
to end in certain sets of outcomes in games, starting from some current node 
(Pauly, 2001). Neighborhood semantics is an interesting counterpoint to topo- 
logical semantics, because it shows what happens further down the road. The 
minimal modal logic now loses Distributivity, retaining only upward mono- 
tonicity for the two modalities. There is still a notion of generalized bisimu- 
lation, however, whose topological version will return in Sec. 1.4. Finally, as 
to the balance, the complexity of satisfiability in neighborhood semantics goes 
down from PSPACE-complete to NP. The latter is not true, however, for the 
topological interpretation, as it retains the Distribution Axiom, and its minimal 
modal logic S4 is still PSPACE-complete. 

All these different semantics are related. In particular, topological models 
are a special case of neighborhood models, and reflexive and transitive rela- 
tional models are a special case of topological models, as will be explained 
below. Neighborhood models are also related to algebraic ones, but we will 
forego such details in this chapter. Even so, these technical connections have 
their uses. For instance, topological semantics still includes binary relational 
semantics as the special case of ‘Alexandroff topologies’ (cf. Sec. 2.4.1). Thus, 
its generalizations of standard modal notions, such as bisimulations, may be 
viewed as a significant extension of the latter’s scope of applicability. Like- 
wise, we will see in Sec. 3.2 how a topological perspective actually clarifies 
issues in binary relational model theory, viz. the axiomatization problem for 
classes of products of modal frames. And finally, topological viewpoints have 
suggested new modal languages and structures such as the ‘Chu spaces’ of 
Pratt, 1999 (cf. van Benthem, 2000a on a first-order/modal style analysis of 
invariance and expressive power). 

We conclude with one illustration going the other way. Despite its immediate 
spatial appeal, the topological semantics is also more complex than the binary 
relational semantics. Instead of matching up one modal operator O with one 
quantifier, it matches it up with the 3V combination of two nested quantifiers: 
‘there exists an open set such that for all its elements...’. This makes things 
less perspicuous, and it may in fact be the reason why the topological inter- 
pretation, though historically first, was eventually supplanted by the simpler 
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Kanger-Hintikka-Kripke graph-based version. But this is not all there is to be 
said. For, one can analyze the above HV in terms of two consecutive modalities 
© openUelement, Where the first states the existence of an open set, while the 
second accesses its elements. From this point of view, the topological seman- 
tics lives inside a standard bimodal language over two-sorted binary relational 
models having both points and sets as objects. There are even mathematical 
reduction results showing precisely how far this reduction goes. This amounts 
to a richer many-sorted view of space, where both points and sets can be ‘ob- 
jects’ on a par. This style of thinking, too, is in the line with the geometrical 
tradition, which has points, lines, and spaces as objects on a par rather than 
ascending stages in some abstract set-theoretic hierarchy. van Benthem, 1999 
presents a defense of many-sorted reformulations of complex modal semantics 
in temporal and spatial settings. The only framework that we know of where 
this ‘unravelling’ into separate modal stages is taken seriously in a spatial sense 
is the ‘topological logic of knowledge’ of Dabrowski et al., 1996 (cf. also Ch. 6). 
The bulk of existing work, however, is squarely within the standard topological 
framework, to which we now return. 

















1.4 Modal logic and topology: first steps 


The topological interpretation explained above brings some interesting shifts 
in perspective. E.g., the crucial modal feature of locality in graph models 
now means that a formula is true at M, x iff it is true at x in any submodel 
whose domain is that of M restricted to some open neighborhood of x. Thus, 
regions are essential, and more generally, a modal approach provides a calculus 
of regions de-emphasizing constellations of points. As such, it is close to 
‘region versus points’ theories of time and space (Allen, 1983; Allen and Hayes, 
1985; van Benthem, 1983b; Randell et al., 1992). 

There are also subtle differences with modal logic that lie just below the 
surface. E.g., binary relational semantics validates unlimited distributivity: the 
modal box distributes over arbitrary infinite conjunctions of formulas. This is 
not so in topological semantics: 





Figure 5.3. Nested intervals refuting countable distributivity. 


Let the proposition letters p; be interpreted as the open intervals (—1/7, 1/7). 
Then the point 0 satisfies the countable conjunction of all formulas Op;. But 
0 does not satisfy the related infinitary modal formula with the box over the 
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conjunction, since that intersection is just the set {0}, whose topological interior 
is empty (see Fig. 5.3). 


1.4.1 Expressive power: topo-bisimulation and topo-games. To 
understand the expressive power of a modal language, a suitable notion of 
bisimulation is needed. The following definition reflects the semantic definition 
of the modal operators and can be seen as composed of two sub-moves: one in 
which points are linked, and one in which containing opens are matched. 


DEFINITION 5.2 (TOPO-BISIMULATION) A topological bisimulation or si- 
mply a topo-bisimulation between two topo-models M = (X,7,v) and M’ = 
(X’, 7’, v’) is a non-empty relation T C X x X’ such that if Tx’ then: 


1 x €v(p) & x’ € v' (p) for each p € P 








2 (forth): x € U € T — JU’ € 7’: x’ € U’ and Vy’ € U' Ay EU: yTy’ 














3 (back): z’ € U’ er’ — JU Er: 2 €U < and Vy € U Ay € U' : yTy’. 


A topo-bisimulation is total if its domain is X and its range is X’. If only the 
atomic clause (1) and the forth condition (2) hold, we say that the second model 
simulates the first. 


REMARK 5.3 We point out that the forth condition in the definition of topo- 
bisimulation is equivalent to the T-image of every open subset of (X, 7} being 
open, while the back condition is equivalent to the T’-inverse image of every 
open subset of (X’, 7’) being open. 


Topo-bisimulation captures the adequate notion of ‘model equivalence’ for 
the basic language £ topologically interpreted. Evidence for this comes from 
the following two results (cf. Aiello and van Benthem, 2002a). 


THEOREM 5.4 Let M = (X,7,v) and M' = (X',7',v’) be two topo-models, 
and x € X and x' € X' be two topo-bisimilar points. Then for each modal 
formula p we have M,x = yọ iff M',x' = ọ. That is, modal formulas are 
invariant under topo-bisimulations. 


THEOREM 5.5 Let M, M' be two finite models, and x € X, x' € X' be such 
that for each y we have M,x — » iff M’,2' H| ọ. Then there exists a topo- 
bisimulation between M and M' connecting x and x'. That is, finite modally 
equivalent models are topo-bisimilar. 


Topo-bisimulations are coarsenings of the basic structural equivalence in 
topology: 
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THEOREM 5.6 Jf two topological spaces (X,T) and (X', T’) are homeomor- 
phic, then for each valuation v on (X,T) there exists a valuation v' on (X', T’) 
such that the topo-models M = (X,7,v) and M’ = (X',7',w’) are topo- 
bisimular. 


See Aiello and van Benthem, 2002a for details as well as connections with 
other topological notions of structural similarity such as homotopy. 

Topo-bisimulation is a standard model-theoretic tool for assessing expres- 
sivity of our language with respect to spatial patterns. Nevertheless, when 
comparing e.g. two image representations, it may still be too coarse. To refine 
the similarity matching, one can define a topological model comparison game 
TG(M, M',n) between two topo-models M, M’. The idea of the game is 
that two players challenge each other picking elements from the two models to 
compare. ‘Spoiler’ wins if he can show the models to be different, ‘Duplicator’ 
wins if he can show the models to be ‘similar’. Winning strategies for Dupli- 
cator in infinite games, requiring never-ending continued responses, match up 
precisely with topo-bisimulations. Furthermore, for finite-length games, games 
and modal formulas are connected by the Adequacy Theorem: 


THEOREM 5.7 (AIELLO AND VAN BENTHEM, 2002A) Duplicator has a 
winning strategy in the topo-game TG(M, M'n, x,2’) iff x and x' satisfy 
the same formulas of modal operator depth up to n in their respective models 
M, M'. 


Fig. 5.4 shows how many rounds Spoiler will need to distinguish positions on 
cutlery. The number of rounds corresponds to the depth of a modal ‘difference 
formula’ for the points under comparison. For instance, the single round in 2(a) 
corresponds to Op versus —Op, the two rounds in 2(b) correspond to ~O 0p 
versus ©Op, while the three rounds in 2(c) correspond to O(p ^ =©Up) versus 
30(p A a©Up). The formal definition of a game, and an extensive discussion 
of plays and strategies are in Aiello and van Benthem, 2002a, while algorithms 
to compute winning strategies for comparing models are illustrated in Aiello, 
2002b. 


1 Round 2 Rounds 3 Rounds 

































































(a) (b) (c) 


Figure 5.4. Game rounds needed for distinguishing shapes. 
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Excursion: Our examples show how logical games match topological notions 
very well. But there is a much earlier historical precedent. It is shown in van 
Dalen, 2005 how Brouwer defined the crucial topological notion of dimension 
in terms of the following game: 

Player 1 chooses two disjoint closed subsets A;, Bı of the space. Player 2 
then chooses a closed separating set Sı. Player 1 now chooses two disjoint 
closed subsets of Sı. Player 2 then chooses a closed separating set Sz inside 
Sı, etc. 

Player 2 wins if a separating set Sn is reached after n rounds which is totally 
disconnected. According to Brouwer, the dimension of a space is the lowest 
natural number n for which Player 2 has a winning strategy in the n-round 
game. 


1.4.2 Deductive power: topo-logics. Now consider logical validity 
and hence the general calculus for spatial reasoning in this language. The 
logic S4 is defined by the KT4 axioms and the rules of Modus Ponens and 
Necessitation (see Sec. 2.2 below). In the topological setting these principles 
translate into the following ones, with an informal explanation added: 





















































(N) the whole space is open 
(0p ^ 0q) O(p^q) (R) open sets are closed under finite intersections 
ip — OOp (4) the interior operator is idempotent 
ip > Dp (T) the interior of any set is contained in the set. 





Then the universally valid formulas topologically interpreted are precisely the 
theorems of S4. But McKinsey and Tarski, 1944 proved a much more striking 
result. 


‘THEOREM 5.8 S4is complete for any dense-in-itself metric separable space. 


Thus, S4 is also the logic of any Euclidean space JR” with the standard 
topology. Mints, 1998 proved completeness of S4 for the Cantor space in a 
particularly elegant manner. 

More restricted spatial structures generate stronger modal logics on top of S4. 
Take, for instance, the serial subsets of the real line, being the finite unions of 
convex intervals (Aiello et al., 2003). These have been used to model life-spans 
of ‘events’ in linguistics and computer science. Now consider the following 
additional axioms: 





(BD2) (=p ^ Op) > Op 
(BW2) a(pAg A O(pAng) A O(=pAq) A O(=pA-q)). 











These are complete for the serial sets. To give an impression of what is going 
on, look at Fig. 5.5, with a serial set denoted by p, and take the axiom BD». 
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Figure 5.5. A serial set of JR and the defined subformulas by the axiom BDz. 


In relational semantics, this axiom bounds the depth of the model by 2. In 
topological semantics, it states that the points that are both in the complement 
ap of a region and in its closure p, must be in the regular closed portion OOp 
of the region itself. 

Similarly, one can look at interesting 2-dimensional topological spaces. Here 
is a modal axiom 












































(BD3) ©(Op3 A O(Op2 A OOp, A 7p) A ap2) > p3 


valid in the ‘rectangular serial’ sets of the plane JR?. These special structures 
are investigated in Aiello et al., 2003 and van Benthem et al., 2003. The latter 
provides an axiomatization for logics of this sort for Euclidean spaces of any 
dimension (see Sec. 2.6 below). 


1.5 Modal logics of other spatial structures 


Our account so far may have suggested that modal logic of space must be 
about topology. But this is not the case at all. Moving on from topology to 
more ‘rigid’ spatial structures, modal logic returns just as well, though in new 
guises. For instance, consider affine geometry, where the major notion is a 
ternary notion of betweenness (xyz), which says that point y lies in between 
points x and z, allowing y to be one of these endpoints. Now define a binary 
betweenness modality <B>: 


M,x = <B>(y,v) iff dy,z: B(yxz) and M,y = yand M, z = 4. 


Again, this leads to very concrete spatial pictures. This time standard ge- 
ometrical figures can be described by modal means. Consider Fig. 5.6. Let 
the proposition letter p denote the set of three points on the left forming the 
vertices of a triangle. The next two phases of the picture show how the formula 
<B>(p, p) holds on the sides of the triangle, while the whole triangle, including 
its interior, is defined by the modal formula <B>(<B>(p, p), p). 

Clearly all of the earlier technical modal notions make sense once more. 
For instance, we can study modal bisimulation between geometric figures, or 
modal deduction about triangles or convex figures. We will study such issues 
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Figure 5.6. The formulas p, <B>(p, p) and <B>(<B>(p, p), p). 


for geometrical modal logics later in Sec. 4. For the moment, we just note 
how this formalism can express significant geometric facts in surprising ways. 
Consider the following basic geometrical principle, known as ‘Pasch’s Axiom’ 
(see Fig. 5.7), written as follows in first-order notation: 





Vtzyzu( (ztu) & B(yuz) > Iv : B(xvy) & B(vtz)) 





Figure 5.7. Pasch’s property. 


It says that any line drawn through a vertex of a triangle and continuing 
into its interior must cross the opposite side to that vertex at some point. This 
does not look modal at all, but in fact it is! Consider the following axiom of 
associativity for the betweenness modality: 


<B>(p, <B>(q,r)) > <B>(<B>(p,q),7). 
We will see in Sec. 4 that: 
FACT 5.9 Modal Associativity corresponds to Pasch’s Axiom. 


At this stage a useful exercise for the reader would be to check how, unpacking 
the nested modalities <B> in the antecedent, Pasch’s Axiom is in fact precisely 
what is needed to see the validity of Associativity. In Sec. 4 and 5 we will 
develop these ideas further to also include metric geometry and eventually 
even linear algebra. 


1.6 Logical analysis of space once more 


Let us summarize the methodology of this chapter once more. Topologists 
or geometers do not worry about formal systems: they just state what they 
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see in whatever formalism at hand. Logicians, however, propose a trade-off: 
specify a formal language restricting the notions one can talk about, and then 
see what complete logic comes out, perhaps even in the form of a decidable 
calculus. Tarski’s elementary geometry is still a paradigm for this approach, 
and so are other logics discussed in Ch. 2 of this handbook on first-order theories 
of polygons and of mereotopology. Incidentally, the spatial perspective also 
highlights quite different uses of a logical formalism. One is its descriptive role 
in defining spatial patterns, allowing us to describe these, check whether they 
hold in given situations, and compare different properties of spatial structures. 
Another is its deductive role as a calculus of reasoning about space, which 
is associated with other tasks, such as mathematical theorizing, information 
extraction from spatial databases, or reasoning by a robot trying to plan actions 
in a partially unknown environment. 

Used in either mode, modal languages are fragments of first-order ones, re- 
stricting expressive power even further, but promising better complexity. The 
very multiplicity of modal languages is an advantage here, as we can work at dif- 
ferent levels of structure, measured by different notions of invariance, whether 
topo-bisimulation, or some logical or geometrical strengthening thereof. This 
fits with the mathematical idea that space can be studied legitimately at various 
levels of detail. Finally, consider the issue of the modal balance between expres- 
sive power and computational complexity. Indeed, there are low-complexity 
modal logics for some spatial structures. But there are also some phenom- 
ena showing that things can be complicated. For instance, in affine geometry, 
while the minimal modal logic of our binary betweenness modality <B>(y, w) 
is decidable, this same language becomes undecidable over the special class of 
associative relations that we just associated with Pash’s Axiom. The reason is 
that it can then straightforwardly encode the word problem for semigroups. 

Moreover, our two guiding examples from Tarski’s work do not point in the 
same direction in terms of the sources of their decidability. Modal Topology is 
indeed decidable for reasons of modal parsimony by abstract general methods 
having little to do with peculiarities of topological spaces. But Elementary 
Geometry is decidable not because its language has judiciously toned-down 
expressive power, but because its intended model of Euclidean Space is so rich 
that it happens to support a procedure of quantifier elimination, providing us 
with the decision algorithm, which is very special for this geometric setting. 
We refer to Ch. 9 for more accumulated evidence on complexity of logics for 
spatial reasoning. 


1.7 Contents of this chapter 


This concludes our introduction to modal languages of spatial structures. The 
rest of this chapter is organized as follows. Sec. 2 contains a more extensive 
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formal treatment of topological models, leading up to the general comple 
teness theorem for topological models, and from there, to the landmark com- 
pleteness theorem for the reals using modern techniques. Following that, we 
also discuss richer modal logics for more special topological structures, includ- 
ing placing restrictions on sets that can serve as values for propositions. Next, 
Sec. 3 surveys a number of more recent special topics in this area. These in- 
clude (a) alternative interpretations of the modalities in terms of the topological 
derivative, (b) combining modal logics for describing products of topological 
spaces, (c) language extensions that can express further topological structure, 
and finally (d) topological models for epistemic logic, with an excursion into 
fixed-point extensions of the language that can define various notions of com- 
mon knowledge. Sec. 4 is a discussion of modal languages for geometric struc- 
tures, starting with affine cases, and then moving to modal languages for metric 
relations of relative nearness. We also provide a comparison with first-order 
languages for these structures. Finally, Sec. 5 looks at modal logics for math- 
ematical morphology, which basically amounts to analyzing certain subsets of 
vector spaces. This topic also involves connections with modal ‘arrow logics’ 
for analyzing structures in relational algebra. Sec. 6 contains our conclusions. 


2. Modal logic and topology: basic results 
2.1 Topological preliminaries 


We start by surveying briefly the basic topological concepts that will be used 
throughout this chapter. They can be found in any textbook on general topology 
(see, e.g., Engelking, 1989; Kelley, 1975; Kuratowski, 1966). 


DEFINITION 5.10 A topological space is a pair ¥ = (X,T), where X isa 
nonempty set and T is a collection of subsets of X satisfying the following three 
conditions: 


1 0,X €7; 
2 IfU,V €7, thnUnvV ET; 
3 If (Uj hier E 7, then Ucr Ui ET. 


The elements of 7 are called open sets. The complements of open sets are 
called closed sets. An open set containing x € X is called an open neighbor- 
hood of x. 

A family 6B C 7 is called a basis for the topology if every open set can be 
represented as the union of elements of a subfamily of B. It is well-known that 
a family B of subsets of X is a basis for some topology on X iff (i) for each 
x € X there exists U € B such that x € U, and (ii) for each U,V € B, if 
x € UNV, then there exists W € Bsuchthatr € W CUNY. 
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For A C X, a point x € X is called an interior point of A if there is an open 
neighborhood U of x such that U C A. Let Int(A) denote the set of interior 
points of A. Then it is easy to see that Int(A) is the greatest open set contained 
in A, called the interior of A. A point x € X is called a limit point of A C X 
if for each open neighborhood U of x, the set AM (U — {x}) is nonempty. The 
set of limit points of A is called the derivative of A and is denoted by d(A). 
Let Cl(A) = A U d(A). Then it is easy to see that x € Cl(A) iff UN A is 
nonempty for each open neighborhood U of x, and that C1( A) is the least closed 
set containing A, called the closure of A. 

Let Int and Cl denote the interior and closure operators of X, respectively. 
Then it is well known that the following are satisfied for each A, B C X: 


Int(X) = X Cl(0) = 0 

Int(An = Int(A) MInt(B) CI(AU B) = Cl(A) UCI(B) 
Int(A) C ACCA) 

Int(A) C re CI(C1(A)) € C(A). 


Moreover, there is a duality Int( A) = X — Cl(X — A), and a topological 
space can also be defined in terms of an interior operator or a closure operator 
satisfying the above four conditions. 

We also let t( A) denote X — d(X — A). Then x € t(A) iff there exists 
an open neighborhood U of x such that U C AU {x}. We call t(A) the co- 
derivative of A. Let d and t denote the derivative and co-derivative operators 
of X, respectively. Then it is well known that the following are satisfied for 
each A,B C X: 


d(AUB) =d(A)Ud(B) t(AN B) = t(A) Nt(B) 
d(d(A)) C AU d(A) AN t(A) C t(t(A)). 


DEFINITION 5.11 Let X be a topological space and A be a subset of X. 
1 Ais called clopen if it is both closed and open. 
2 Ais called dense if Cl(A) = X. 
3 Ais called boundary if Int(A) = 9. 
4 Ais called dense-in-itself if A C d(A). 


For a topological space 1’, the family {U;};e7 © 7 is called an open cover 
of X if Ujer Ui = X. 


DEFINITION 5.12 Let X be a topological space. 
I X is called discrete if every subset of X is open. 
2 X is called trivial if Ọ and X are the only open subsets of X. 
3 X is called dense-in-itself if d( X) = X. 
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4 X is called separable if there exists a countable dense subset of X. 

5 X is called compact if every open cover of X has a finite subcover. 

6 X is called connected if ý and X are the only clopen subsets of X. 

7 X is called 0-dimensional if clopen subsets of X form a basis for the 
topology. 

8 X is called extremally disconnected if the closure of each open subset of 
X is clopen. 

In the next definition we recall the separation axioms To, Ty, Tı, and T3. 
DEFINITION 5.13 Let X be a topological space. 


I X is called a To-space if for each pair of different points there exists an 
open set containing one and not containing the other. 


2 X is called a Tg-space if for each x € X there exists an open neighbor- 
hood U of x such that {x} is closed in U. Equivalently, X is a Ty-space 
iff dd(A) C d(A). 


3 X is called a T,-space if for each pair of different points there exists 
an open set containing exactly one of the points. Equivalently, X is a 
T\-space iff each {x} is closed in X. 


4 X is called a T2-space or a Hausdorff space if for each pair x,y € X of 
different points there exit disjoint open neighborhoods of x and y. 


It is well known that every T5-space is a T\-space, that every T)-space is a 
Ta-space, and that every T-space is a Tp-space, but not vice versa. 

Let ¥ and yY be topological spaces. We call Y a subspace of X if Y C X 
and U is an open subset of Y iff there exists an open subset V of X such that 
U=VNY. 


DEFINITION 5.14 Let X and Y be topological spaces and f : X — Y bea 
map. 

1 f is called continuous if U open in Y implies f7} (U) is open in X. 

2 f is called open if U open in X implies f(U) is open in Y. 

3 f is called interior if it is both continuous and open. 


We call Y a continuous image of X if there exists a continuous map from X 
onto Y. Open and interior images of ¥ are defined analogously. 

Let { X; bicz be a family of pairwise disjoint topological spaces. We define 
the topological sum of {X;}ier as the pair Bye, Xi = (User Xi, T), where 
U € riff UN X; © 7. If the members of the family { X; }iez are not pairwise 
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disjoint, then the topological sum is defined using disjoint union instead of 
set-theoretic union. 


2.2 Relational semantics and some modal logics 
Before exploring topological interpretations in more depth, we recall some 


basic facts from relational semantics. 


2.2.1 The uni-modal case. We recall that a frame is a relational struc- 
ture § = (W, R) such that W is a nonempty set and R is a binary relation on 
W. A valuation of the basic modal language £ in ¥ is a function v from the 
set P of propositional letters of £ to the powerset of W. A pair M = (%,v) is 
called a model (based on §). Given a model M, we define when a formula y is 
true at a point w E€ W by induction on the length of ọ: 


a wk piffw € v(p); 
a w ny iff not w H g; 
a wH y ^y iff w | y and w H Y; 

















WE vy iff (Vu = W)(wRv >v H p); 


and hence also 





a wH Oy iff (du e W)(wRv & v E y). 





We say that ọ is true in M if ọ is true at every point in W, and that vy is valid 
in § if ọ is true in every model M based on §. Finally, we say that ọ is valid 
in a class of frames if ọ is valid in every member of the class. 

Below we list several standard modal logics and their axiomatizations. 


DEFINITION 5.15 


I The basic logic K of all frames is axiomatized by the axiom: 























(K) (p > q) > (Op > Og). 








with Modus Ponens and Necessitation as the only rules of inference: 


y yoy (N) KA 


MP 
(MP) a 7 














2 The logic T of reflexive frames is axiomatized by adding to K the axiom: 





(T) Op > p 
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3 The logic K4 of transitive frames is axiomatized by adding to K the 
axiom: 


(4) Op — OOp 























4 The logic S4 of reflexive and transitive frames is axiomatized by adding 
to K the axioms (T) and (4). 


5 The logic S5 of reflexive, transitive, and symmetric frames is axiomatized 
by adding to S4 the axiom: 





(B) p— Op 











Each logic listed above is complete with respect to its relational semantics. 
In fact, each of these logics is complete with respect to its finite frames, and 
therefore has the finite model property (see, e.g., Blackburn et al., 2001). 


2.2.2 Multi-modal cases. Multi-modal languages are conspicuous in 
modern applications of modal logic, which often call for combining operators. 
This happens in a spatial setting, e.g., when describing different topologies 
at the same time. Such combinations arise by performing certain operations 
on component logics. Here we recall several basic facts about ‘fusion’ and 
‘product’ of uni-modal logics. Most of this material can be found in the textbook 
Gabbay et al., 2003. 














The fusion: Let Loo, be a bimodal language with modal operators Oy 
and O3. 
































DEFINITION 5.16 The fusion of K with itself, denoted by K È K, is the least 
set of formulas of Lon, containing the axiom (K) for both O; and Oe, and 
closed under Modus Ponens, 01-Necessitation, and O2-Necessitation. 






























































The K $ K-frames are triples § = (W, Ri, R2), where W is a nonempty set 
and R; and Rp» are binary relations on W. It is known that K $ K is complete 
with respect to this semantics; in fact, it has the finite model property. 

We are interested in the fusion of S4 with itself, which we denote by S49 S4. 
It is defined similar to the fusion of K with itself. The S4@S4-frames are triples 
& = (W, Ri, R2), where W is a nonempty set and R and Rez are reflexive and 
transitive. We call such a frame rooted if there is aw € W such that for all 
v € W it holds that w( Rı U R2)*v, where (Rı U R2)* is the transitive closure 
of R1 U Rə. Itis known that S484 is complete with respect to this semantics; 
in fact, S4 ® S4 is complete with respect to finite rooted S4 6 S4-frames. 

Let T2 2 denote the full infinite quaternary tree whose each node is Rj-related 
to two of its four immediate successors and R2-related to the other two (see 
Fig. 5.8). We will make use of the next proposition in Sec. 3.2.2. 
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Figure 5.8. T2,2. The solid lines represent Rı and the dashed lines represent R2. The dotted 
lines at the final nodes indicate that the pattern repeats infinitely. 


PROPOSITION 5.17 (VAN BENTHEM ET AL., 2005) S4@S4 is complete 
with respect to T2 9. 


The product: For two K-frames § = (W, S} and 6 = (V,T), define the 
product frame § x © to be the frame (W x V, R1, R2), where for w, w € W 
and v,v' € V: 

(w,v)Ri(w’,v’) iff wSw and v = v 

(w,v)Ro(w", v") iff w = w and vTv’. 


The frame § x © can be viewed as a K © K-frame by interpreting the 
modalities O0; and O2 of Loo, as follows. 























(w,v) FOip iff V(w',v’) if (w,v)Ri(w’, v’) then (w’,v’) = y 
(w,v) = O02% iff V(w',v’) if (w,v)Re(w’, v’) then (w’,v’) = y. 























Let K x K denote the logic of products of K-frames. It is well known that 
K x K is axiomatized by adding the following two axioms to the fusion KẸ K: 























com = O,U9p + O20 1p 




















chr = ©, Oop > O20 4p. 





In a similar fashion we define the product of two S4-frames. Let S4 x S4 
denote the logic of products of S4-frames. As with K x K, the product logic 
S4 x S4 is axiomatized by adding com and chr to the fusion S4 6 S4. 


2.3 Interpreting O as interior and © as closure 


Let M = (4,v) be a topo-model, where ¥ = (X, T) is a topological space 
and v : P — P(X) is a valuation. We gave the inductive definition of when a 
formula y is true at a point x of the model M in Definition 5.1 of Sec. 1.1. The 
and © clauses of Definition 5.1 imply that if ọ is interpreted as a subset A 
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of a topological space 1’, then Uy stands for Int( A) and Oy for Cl(A). Either 
notion can be used as a primitive. In what follows we emphasize one or the 
other, depending on the ease of exposition. 


DEFINITION 5.18 We say that is true in M = (Xv) if ọ is true at every 
x E€ X. We say that ọ is valid in X if y is true in every model based on %. 
Finally, we say that ọ is valid in a class of topological spaces if p is valid in 
every member of the class. 


EXAMPLE 5.19 Let Top denote the class of all topological spaces. 


1 First we show that (T) is valid in Top. Let ¥ € Top, M = (¥,v) bea 
topological model, and x = Op for x € X. Then there exists an open 
neighborhood U of x such that y = p for each y € U. In particular, 
since x € U, we obtain that x = p. 














2 Next we show that (4) is valid in Top. Let ¥ € Top, M = (4,v) bea 
topological model, and x = Op for x € X. Then there exists an open 
neighborhood U of x such that y — p for each y € U. But then y = Op 
for each y € U, implying that x = Op. 












































3 Now we show that (K) is valid in Top. Let ¥ € Top, M = (¥,v) bea 
topological model, and for x € X we have x — O(p — q) and x — Op. 
Then there exist open neighborhoods U and V of x such that y = p > q 
for each y € U and z — p foreach z € V. Let W = U N V. Then W 
is an open neighborhood of x and for each w € W we have w = p > q 
and w } p. Therefore, w } q for each w € W, implying that x = Og. 























4 Finally, we show that the necessitation rule preserves validity. If Oy is 
not valid, then there exists a topological model M = (¥,v) anda € X 
such that x | p. Therefore, there exists y E€ X such that y E y, 


implying that ¢ is not valid. 

















Consequently, we obtain that the modal logic S4 is sound with respect to 
interpreting © as closure. In fact, as shown in McKinsey and Tarski, 1944, S4 
is also complete with respect to this semantics. 


2.4 Basic topo-completeness of S4 


As we already pointed out, S4 is sound with respect to interpreting © as the 
closure operator of a topological space. We are ready to show that S4 is in fact 
complete with respect to this semantics. But first we discuss the well-known 
connection between relational and topological semantics of S4 (Aiello et al., 
2003; Bezhanishvili and Gehrke, 2005). 
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2.4.1 Connection with relational semantics of S4. 


DEFINITION 5.20 A topological space X is called an Alexandroff space if 
the intersection of any family of open subsets of X is again open. 


Equivalently, X is Alexandroff iff every x € X has a least open neighbor- 
hood. There is a close connection between Alexandroff spaces and S4-frames. 
Suppose § = (X, R) is an S4-frame. A subset A of X is called an upset of § if 
x € Aand «Ry imply y € A. Dually, A is called a downset if x € A and yRx 
imply y € A. 

For a given $4-frame § = (X, R), we define the topology TR on X by 
declaring the upsets of § to be open. Then the downsets of ¥ turn out to be 
closed, and it is routine to verify that the obtained space is Alexandroff, that a 
least neighborhood of x € X is R(x) = {y € X : xRy}, that the closure of a 
set AC Xis 





RH(A)= {z€ X : Jy € A with z Ry}, 
and that the interior of A is 
X — R(X — A) = {x € X : (Vy € X)(zRy > y € A)}. 


Conversely, for a topological space V we define the specialization order on X 
by cR,y iff x € Cl(y). Then it is routine to check that the specialization order 
is reflexive and transitive, and that it is a partial order iff V is To. Moreover, 
one can easily check that R = Rrp, that T C TR,, and that T = TR, iff X is 
Alexandroff. 

These observations immediately imply that there is a 1-1 correspondence be- 
tween Alexandroff spaces and S4-frames, and between Alexandroff To-spaces 
and partially ordered S4-frames. Since every finite topological space is an 
Alexandroff space, this gives a 1-1 correspondence between finite topological 
spaces and finite S4-frames, and between finite To-spaces and finite partially 
ordered S4-frames. It is straightforward to see that this also implies a 1-1 
correspondence between continuous maps and order-preserving maps, as well 
as between interior maps and p-morphisms. As a consequence of all this, we 
obtain the following: 


TR? 


COROLLARY 5.21 Every normal extension of S4 that is complete with re- 
spect to relational semantics is also complete with respect to topological 
semantics. 


2.4.2 Canonical topo-model of S4. Corollary 5.21 says that standard 
modal models are a particular case of general topological semantics. Hence, 
the known completeness of S4 plus the topological soundness of its axioms 
immediately give us general topological completeness. Even so, we now give 
a direct model-theoretic proof of this result, taken from Aiello et al., 2003. 
It is closely related to the standard modal Henkin construction and is much 
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like completeness proofs for neighborhood semantics (Chellas, 1980), but with 
some nice topological twists. 


DEFINITION 5.22 


1 CallasetT of formulas of L (S4—)consistent if for no finite set {p1,..., Pn} 
CT we have that S4 F 7(y1 A- A pn). 


2 A consistent set of formulas I is called maximally consistent if there is 
no consistent set of formulas properly containing I. 


It is well known that I is maximally consistent iff, for each formula y of £, 
either y € I or ~g € I, but not both. Now we define a topological space out 
of maximally consistent sets of formulas. 


DEFINITION 5.23 (CANONICAL TOPOLOGICAL SPACE) The canonical to- 
pological space is the pair XE = (X£, T£) where: 


= X° is the set of all maximally consistent sets; 


= 1° is the set generated by arbitrary unions of the following basic sets 














£ = {Oy: is any formula}, where p =4ef {x E€ XE : p € x£}. In 
other words, basic sets are the families of the form: U, = {x € X Er: 
pEr} 


We first check that X£ is indeed a topological space. 














LEMMA 5.24 B£ forms a basis for the topology. 


Proof We only need to show the following two properties: 


m For each Uy, Uy € BF and each x € U, N Uy, there is Uy € BF such 
that x € U% G Up N Uy; 


= For each x € X£, there is U, € BE such that x € Up. 

















The necessitation rule implies that OT € x for each x. Hence, X£ = oT, 
and so the second item is satisfied. A As for the first 1 item, thanks to the axiom 
(K), one can easily check that 0 (prt Aw) = 09N Ow. Hence, U, N Uy € BY, 
and so BÊ is closed under finite intersections; whence, the fink property is 
satisfied. QED 




















Next we define the canonical topo-model. 


DEFINITION 5.25 (CANONICAL TOPO-MODEL) The canonical topo-model 
is the pair ME = (X£, vÊ) where: 


= X£ is the canonical topological space; 
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a v°(p)={2e X*: per} 


The valuation v£ equates truth of a propositional letter at a maximally con- 
sistent set with its membership in that set. We now show this harmony between 
the two viewpoints lifts to all formulas. 


LEMMA 5.26 (TRUTH LEMMA) For all modal formulas y, 
MË, x Ke piff £ € 9. 


Proof Induction on the complexity of y. The base case follows from the 
definition above. The case of the booleans follows from the following well- 
known identities for maximally consistent sets: 


= pAp=ony. 


The interesting case is that of the modal operator O. We do the two relevant 
implications separately, starting with the easy one. 


— ~~ 


< ‘From membership to truth.’ Suppose x € Oy. By definition, Oy is a 


















































basic set, hence open. Moreover, thanks to the axiom (T), we have Oy C y. 











Therefore, there exists an open neighborhood U = Oy of x such that y € y, 
for any y € U, and by the induction hypothesis, M“,y Er y. Thus M£, 
zT =c Ho. 

=> ‘From 1 truth to membership.’ Suppose M£, x Ke Oy. Then there € exists 
a basic set a) € B® such that zE Oy and Mey Er y for all uly € ay). By 
the induction hypothesis, y € y for all y € Joh. Therefore, Oy C y. This 
implies that S4 can prove the implication Ow — y. But then S4 can prove 
wo Q, al and hence, using the axiom (4), it can also so prove Ow — Of. It 


follows that Oy C C Ig, and so the world x belongs to Ao. QED 
























































































































































Now we can clinch the proof of our main result. 


THEOREM 5.27 (COMPLETENESS) For any set of formulas T, 
if TEcyp then T Fs4 y. 


Proof Suppose thatT s4 vy. Then TU{-y} is consistent, and by the standard 
Lindenbaum lemma argument it can be extended to a maximally consistent set 
x. By the truth lemma, M£, x Ke ~g, whence ME, x ‘Ke p, and we have 
constructed the required counter-model. QED 


COROLLARY 5.28 S4 is the logic of the class of all topological spaces. 
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We note that the whole construction in the completeness proof above would 
also work if we restricted attention to the finite language consisting of the 
initial formula and all its subformulas. This means that we only get finitely 
many maximally consistent sets, and so non-provable formulas can be refuted 
on finite models, whose size is effectively computable from the formula itself. 
This is usually referred to as the effective finite model property. 


COROLLARY 5.29 
I S4 is the logic of the class of all finite topological spaces. 


2 S4 has the effective finite model property with respect to the class of 
topological spaces. 


Incidentally, this also shows that validity in S4 is decidable, but we forego 
such complexity issues in this chapter. 

Comparing our construction with the standard modal Henkin model (X£, R£, 
z) for S4, the basic sets of our topology TÊ are R&-upward closed. Hence, 
every open of XÊ is R&-upward closed, and rÊ is weaker than the topology 
Tre corresponding to RÊ. In particular, our canonical topological space is not 
an Alexandroff space. In fact, as shown in Gabelaia, 2001, Theorem 3.2.3, Tpc 
is the least Alexandroff topology containing Tc. Further discussion of this and 
related topics can be found in Aiello et al., 2003, Sec. 3 and Gabelaia, 2001. 


2.5 Completeness in special spaces 


We have already seen that S4 is the logic of all topological spaces. But there 
are classical results with much more mathematical content such as McKinsey 
and Tarski’s beautiful theorem that S4 is also the logic of any dense-in-itself 
metric separable space. Here we concentrate on three spaces that play an 
important role in mathematics—the Cantor space Œ, the rational line @, and 
the real line JR—and sketch three proofs, taken respectively from Aiello et al., 
2003, van Benthem et al., 2005 and Bezhanishvili and Gehrke, 2005, that S4 is 
the logic of each of these spaces. 


2.5.1 Completeness w.r.t. Œ. We first show that S4 is the logic of the 
Cantor space (Œ. Our exposition is rather sketchy. For full details we refer the 
reader to Aiello et al., 2003, Sec. 4.1. 

Suppose an S4-frame ¥ = (W, R) is given. We recall that ¥ is rooted if 
there exists r € W—called a root of §—such that rRw for each w € W. We 
call C C W a cluster if for all w,v € C we have wRv and vRw. A cluster C 
is called simple if it consists of a single point, and proper if it consists of more 
than one point. The next theorem will aid in proving that S4 is the logic of Œ. 
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THEOREM 5.30 (AIELLO ET AL., 2003) S4 is complete with respect to fi- 
nite rooted S4-frames whose every cluster is proper. 


Now let a formula vy be not provable in S4. By Theorem 5.30, p can be 
refuted in a finite rooted $4-model M = (W, R, v}, with a root r, whose every 
cluster is proper. We transform the latter into a counterexample on the Cantor 
space Œ. Our technique is selective unravelling, a refinement of the technique 
of unravelling in modal logic (see, e.g., Blackburn et al., 2001). We select those 
infinite paths of M that are in a 1-1 correspondence with infinite paths of the 
full infinite binary tree 73 (see Fig. 5.9). 





Figure 5.9. Tp. 


We start with a root r and announce (r) as a selective path. Then if (wi,..., 
wr) is already a selective path, we introduce a left move by announcing (w1,..., 
Wk, Wk) as a selective path; and we introduce a right move by announcing 
(w1,..., Wk, Wk41) as a selective path if wyRwp+1 and wk # wk+1. (Since 
we assumed that every cluster of W is proper, such wg+1 exists for every wg.) 
We call an infinite path o of W selective if every initial segment of ø is a finite 
selective path of W. We denote by » the set of all infinite selective paths of 
W. For a finite selective path (w1,..., wz), let 


Biwi, wp) = {0 E€ X: o has an initial segment (w1,..., wz) }- 
Define a topology Ty on X by introducing 
By = {Bun wp) : (W1,--+, Wk) is a finite selective path of W } 


as a basis. 
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To see that By is a basis, observe that Bir) = ™, and that 


Biwi, wp) if (v1, ---, Um) is an initial 
segment of (w1,..., Wk), 
Biona N Bior... um) = Bose. otic’) if (w1,..., We) is an initial 
segment of (v1,...,Um), 
0 otherwise. 


In order to define vs, note that every infinite selective path o of W either 
gets stable or keeps cycling. In other words, either o = (w1,..., Wk, Wr, ---) 
or o = (w1, ..., Wn, Wn41,---) Where w; belongs to some cluster C C W for 
i > n. In the former case we say that wx stabilizes o, and in the latter that o 
keeps cycling in C. Now define vs on © by 


wk =p if wz stabilizes ø, 
e att p(C) =p ifo keeps cycling in C C W, where p(C) is 
some arbitrarily chosen representative of C. 


All we need to show is that (£, Txs} is homeomorphic to the Cantor space, 
and that My = (£,Ts, vs) is topo-bisimilar to the initial M. In order to 
show the first claim, let us recall that the Cantor space is homeomorphic to 
the countable topological product of the two element set 2 = {0,1} with the 
discrete topology. To picture the Cantor space, one can think of the full infinite 
binary tree 7; starting at the root, one associates 0 to every left-son of a node 
and 1 to every right-son. Then points of the Cantor space are infinite paths of 73. 
This together with the construction of © immediately gives us that (X, Ty) is 
homeomorphic to Œ. 
Finally, we show that My is topo-bisimilar to M. Define F : X — W by 


F(o)= wr if we stabilizes g, 
nE p(C) if o keeps cycling in C. 


Obviously F is well-defined, and is actually surjective. (For any wg E€ W, we 
have F'(a0, Wk, Wk,---) = Wr, Where oo is a selective path from w1 to wp.) 


PROPOSITION 5.31 F is a total topo-bisimulation between My = (%, Ts, 
vy) and M = (W, R,v). 


Proof (Sketch) With (W, R} we can associate the finite topological space 
(W,7TR). The set {R(v) : v € W} forms a basis for TR. Now the function 
F : (3,7 ) — (W, Tp) is continuous because 


F™(R(v)) = J{Beor,...vg) 1 UR We} 
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for each v € W, and F is open because 


F(Bew,....we)) = R(we) 


for each basic open Boy, ..w,) Of (%,7s). Therefore, F is an interior map. 
Moreover, as follows from the definition of vs, 


o € vy(p) iff F(o) € v(p). 


Since every interior map satisfying this condition is a topo-bisimulation, so is 
our F. QED 


THEOREM 5.32 S4 is the logic of Œ. 


Proof Suppose S47 y. Then there is a finite rooted model M such that every 
cluster of M is proper and M refutes p. Since @ is homeomorphic to (£, Ty), 
by Proposition 5.31, there exists a valuation væ on @ such that (©, væ) is 
topo-bisimilar to M. Hence, ọ is refuted on Œ. QED 


2.5.2 Completeness w.r.t. @. Now we show that S4 is also the logic 
of the rational line @. Our proof is taken from van Benthem et al., 2005. Again 
we use the infinite binary tree 72, but this time we work with its nodes rather 
than infinite paths. We start by recalling the following two well-known results. 


THEOREM 5.33 (VAN BENTHEM-GABBAY) S4 is complete with respect 
to To. 


Proof For a proof see, e.g., Goldblatt, 1980, Theorem 1 and the subsequent 
discussion. QED 


THEOREM 5.34 (CANTOR) Every countable dense linear ordering without 
endpoints is isomorphic to @. 


Proof For a proof see, e.g., Kuratowski and Mostowski, 1976, p. 217, 
Theorem 2. QED 


REMARK 5.35 We recall that if (X, <) is a linearly ordered set and x,y € X 
with x < y, then the open interval (x,y) isthe set{z € X : x < z < y}. Ifwe 
view linearly ordered sets as topological spaces using the set of open intervals 
as a basis for the topology, then it follows from Cantor’s theorem that every 
countable dense linear ordering without endpoints is (as a topological space) 
homeomorphic to @. 
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We are now ready to proceed with the proof. 


THEOREM 5.36 S4 is complete with respect to Q. 


Proof Our strategy is as follows. We use completeness of S4 with respect to 
To, view Tz as an Alexandroff space, define a dense subset X of Q without 
endpoints, and establish a topo-bisimulation between X and 73. This will 
allow us to transfer counterexamples from 72 to X, which by Cantor’s theorem 
is order-isomorphic, and hence homeomorphic to @. 

Let X = Uncu Xn, where Xo = {0} and 


new 


1 1 
gn an | £E Xn} 





Xar = Xr Uf 
CLAIM 5.37 Forn > Oand x,y € Xn, x £ y implies |x — y| > z =. 


Proof By induction on n. If n = 1, then Xı = {0,1,—1}, and so z 4 y 
implies |x — y| > 1. That the claim holds for n = k + 1 is also not hard to 
see. Note that if u,v € Xn—1 with u Æ v, then, i the induction hypothesis, 
ju — v| > goby and hence |(u + z4r) (v za )| > yh QED 





It follows from Claim 5.37 that (X, <) is a countable dense linear ordering 
without endpoints, thus order-isomorphic, and hence homeomorphic to @. It 
also follows that for each x € X with x Æ 0 there exists n, with £ E€ Xn, 
and x ¢ X,,,-1, and that there is a unique y € X,,,-1 with x = y — wast or 
x=yt ee Therefore, the open X-intervals (x — z, z+ z) form a 
basis for the order-topology on X. 

Now we define f from X onto 72 by recursion (see Fig. 5.10): If x = 0 then 
we let f (0) be the root r of 73; if x A 0 then z € Xn, — Xn„—1 and we let 


Oe the immediate left successor of f(y) ifa=y-— =: 
E the immediate right successor of f(y) if x = y+ y ; 


Figure 5.10. The first stages of the labelling in the completeness proof for S4: 0 is labelled 
by the root r of T2, —1 is labelled by the immediate left successor of r, 1 is labelled by the 
immediate right successor of r, and so on. 


CLAIM 5.38 fisan interior map. 
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Proof (Sketch) We recall that a basis for the Alexandroff topology on 73 is 
B = { Bi hen where Bi = {s € Tz : tRs}. Now f is open because for a basic 
open X-interval (£ — z7, £ + gaz), we have f(x — gaz, £ + az) = Bra): 
Also f is continuous because for each t € To, the f-inverse image of B; is 
open. Indeed, if x € f~'(B,), then f(x — z, t + ge) = Bray © Br 
implying that there exists an open interval J = (x — zH 24+ zr) of x such 
that 7 C f~+(B;). Thus, f is interior. QED 








To complete the proof, if S4 7 y, then by Theorem 5.33, there is a valuation v 
on Tz such that (73, v), r A y. Define a valuation € on X by (p) = f~+(v(p)). 
Since f is an interior map, and f(0) = r, we have that 0 and r are topo-bisimilar. 
Therefore, (X,£),0 A y. Now since X is homeomorphic to @, we obtain 
that y is also refutable on @. QED 


2.5.3 Completeness w.r.t. JR. Finally, we show that S4 is also the 
logic of the real line JR. There are at least three different proofs of this result. 
The original one is a particular case of a more general theorem (McKinsey and 
Tarski, 1944) that S4 is the logic of any dense-in-itself metric separable space 
(see also Rasiowa and Sikorski, 1963). The other two can be found in Aiello et 
al., 2003 and Bezhanishvili and Gehrke, 2005. Here we sketch the proof given 
in Bezhanishvili and Gehrke, 2005, where the construction of the Cantor set on 
any bounded interval of JR is used to show that every finite rooted S4-frame is 
an interior image of JR. 

Suppose a,b € R, a < b, and I = (a,b). We recall that the Cantor set Œ is 
constructed inside J by taking out open intervals from J infinitely many times. 
More precisely, in step 1 of the construction the open interval 


b-a 2(b— a) 


Il = (a4 3a 3 ) 





is taken out. We denote the remaining closed intervals by J} and J}. In step 2 
the open intervals 


—a 7(b—a 8(b—a 
32 )) and R = (a OT) a+ 09) ) 





32 ) 
are taken out. We denote the remaining closed intervals by J?, J2, J?, and J?. 
In general, in step m the open intervals 17°, .. . , J5;,-1 are taken out, and the 
closed intervals Jj”,..., Jom remain. 

Our immediate goal is to show that every finite tree is an interior image of J. 
We first show that the tree T of depth 2 and branching n shown in Fig. 5.11 is 
an interior image of J, and then extend this result to any finite tree by induction 
on the depth of the tree. 
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A t ds tn 


r 


T 


Figure 5.11. The tree of depth 2 and branching n. 


LEMMA 5.39 T is an interior image of I. 


Proof Define JE : I — T by 
gm-1 


fi (a) = tk, fee aes k(mod n) pat Iy 


r, otherwise. 


Obviously, JE is a well-defined onto map. Moreover, 


GG: OW) U a Awe © 


m=k(mod n) p=1 


Since {0, {t1},..., {tn}, T} is a family of basic open subsets of T, it is obvious 
that fE is continuous. Suppose U is an open interval of J. If U N @ = 9, then 
JEU) C {ti,..., tn}, and so fF (U) is open. If UN © + Ø, then there exists 
c € UNG. Since c € ©, we have ff (c) = r. From c € U it follows that 
there is € > 0 such that i —e,c+e) CU. We pick m so that oo < £. As 
c € Œ, there is k a te .,2™} such that c € Jj”. Moreover, since the length 
of J;” is equal to 2 Eg , we hoe that J” C U. Therefore, U contains the points 
removed from J;” in the subsequent iterations in the construction of Œ. Thus, 
JEU) 2 f{ti,...,tn} and f7(U) = T. Hence, fF (U) is open for any open 
interval U of I. It follows that JE is an onto interior map. QED 


THEOREM 5.40 Every finite tree of branching n > 1 is an interior 
image of I. 


Proof Suppose T is a finite tree of branching n > 1. Without loss of generality 
we may assume that the depth of T is d+ 1, where d > 2. Then we can represent 
T as shown in Fig. 5.12, where ¢1,...,¢,,a are the elements of T' of depth 2, 
and Ty is the subtree of T of all elements of T of depth > 2. We note that 
for each k € {1,...,n%} the upset R(t,) is isomorphic to the tree of depth 2 


248 HANDBOOK OF SPATIAL LOGICS 


and branching n, and that Ty is the tree of depth d and branching n. So by the 
induction hypothesis, there is an onto interior map from J onto Ty. Also, by 
Lemma 5.39, there exists an onto interior map from J onto each R(t). Now 
putting these maps together produces an onto interior map from I onto T’. For 
the details we refer to Bezhanishvili and Gehrke, 2005, Theorem 8. QED 


Ta 





Figure 5.12. T and Ty. 


The following useful assertion is now an easy consequence. 


COROLLARY 5.41 Every finite rooted partially ordered S4-frame is an inte- 
rior image of IR. 


Proof Since every finite rooted partially ordered S4-frame is a p-morphic 
image of some finite tree of branching n > 1 (Bezhanishvili and Gehrke, 
2005, Lemma 4), it follows from Theorem 5.40 that every finite rooted partially 
ordered S4-frame is an interior image of any bounded open interval J C RR. 
Since I is homeomorphic to JR, the corollary follows. QED 


We now extend on Corollary 5.41 and show that all finite rooted S4-frames 
are interior images of JR. Suppose ¥ is an S4-frame. We define an equivalence 
relation ~ on § by w ~ v iff w, v belong to the same cluster. Let ¥/~ denote 
the skeleton of §. That is, ¥/~ is the quotient of ¥ by ~, and R~ is defined on 
W/~ componentwise. 


DEFINITION 5.42 We call an S4-frame § a quasi-tree if Ẹ/~ is a tree. 


Suppose Q is a quasi-tree. We say that the swelling of Q is q if every cluster 
of Q consists of exactly q elements. Again, our immediate goal is to show that 
every finite quasi-tree is an interior image of J. This we show by first obtaining 
the quasi-tree Q of depth 2, branching n, and swelling q, shown in Fig. 5.13, as 
an interior image of J, and then extending this result to any finite quasi-tree by 
induction on the depth of the quasi-tree. For this we use the following lemma 
taken from Bezhanishvili and Gehrke, 2005, Lemma 11. 
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Q 


Figure 5.13. Quasi-tree of depth 2, branching n, and swelling q. 


LEMMA 5.43 If X has a countable basis and every countable subset of X 
is boundary, then for each natural number n there exist disjoint dense and 
boundary subsets A,,..., An of X such that X = (J; Ai. 


LEMMA 5.44 Q is an interior image of I. 


Proof We denote the least cluster of Q by r and its elements by r1,..., 7g. 
Also for 1 < i < n we denote the ith maximal cluster of Q by tî and its elements 
BYE yan, i: Since the Cantor set Œ satisfies the conditions of Lemma 5.43, it 
can be divided into g-many disjoint dense and boundary subsets @),..., Œq- 
Also each Ip i<p< 2ml mE w) satisfies the conditions of Lemma 5.43, 
and so each 75° can be divided into q-many disjoint dense and boundary subsets 


Ey". ...,(J;")7. Suppose 1 < k < q. We define fe : I > Q by 
gm-1 


f2 (2) = th» if x € ome i(mod n) U= (T) 
l Tk, if x eC. 


It is clear that fe is a well-defined onto map. As with Lemma 5.39, we have 


iS Uy an and (fp) r) =€ 


m=t(mod n) p=1 


Hence, fe is continuous. To show that fe is open let U be an open interval in 
I. If U N Œ = f, then fe) C Uj, t’. Moreover, since (I7")",..., (I7")4 
partition 75° into g-many disjoint dense and boundary subsets, U N 15 # 0 
implies U N ae # Ú for every k € {1,...,q}. Hence, if feU) contains 
an element of a cluster t’, it contains the whole cluster. Thus, feu ) is open. 
Now suppose U NŒ # 0). Since @),..., Œq partition Œ into q-many disjoint 
dense and boundary subsets, U N Œp # 0 for every k € {1,...,q}. Hence, 
rco fe (U ). Moreover, the same argument as in the proof of Lemma 5.39 
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guarantees that every point greater than points in r also belongs to fe(U J; 
Thus feu ) = Q, implying that fE is an onto interior map. QED 


THEOREM 5.45 Every finite quasi-tree of branching n and swelling q is an 
interior image of I. 


Proof This follows along the same lines as the proof of Theorem 5.40 but is 
based on Lemma 5.44 instead of Lemma 5.39. QED 


COROLLARY 5.46 Every finite rooted S4-frame is an interior image of IR. 


Proof This follows along the same lines as the proof of Corollary 5.41 but 
is based on the fact that every finite rooted S4-frame is a p-morphic image of 
some finite quasi-tree of branching n and swelling q (Bezhanishvili and Gehrke, 
2005, Lemma 5) and Theorem 5.45. QED 


THEOREM 5.47 S4 is complete with respect to IR. 


Proof If S4 7 ọ, then there exists a finite rooted S4-model M = (W, R, v}, 
with a root r, such that M,r Æ y. By Corollary 5.46, there exists an onto 
interior map f : R — W. Define a valuation € on R by €(p) = f~!(v(p)). 
Then f is a total topo-bisimulation between (JR, €) and M. Thus, there exists 
a point x € JR such that x F y. QED 


We recall that a subset A of JR is convex if x,y € A and ax < z < y imply 
that z € A. 


COROLLARY 5.48 S4 is complete with respect to boolean combinations of 
countable unions of convex subsets of IR. 


Proof Let f : IR — W be the onto interior map from the proof of Theorem 
5.47. Observe that for each w € W we have that f~!(w) is a boolean combi- 
nation of countable unions of convex subsets of JR (Bezhanishvili and Gehrke, 
2005, Theorem 15). The result follows. QED 


2.6 The landscape of spatial logics over S4 


As we have already seen, S4 is the logic of all topological spaces when in- 
terpreting © as closure. In addition, S4 turned out to be the logic of the Cantor 
space Œ, the rational line @, the real line JR, or more generally, any dense- 
in-itself metric separable space. These results, although with a lot of mathe- 
matical content, also indicate serious limitations of the basic modal language 
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in expressing various topological properties. For example, the completeness 
of S4 with respect to any dense-in-itself metric separable space already implies 
that such topological properties as being dense-in-itself, metric, or separable 
are not definable in the basic modal language. Here we address the topological 
definability issue, as well as review several normal extensions of S4 that are 
complete with respect to interesting classes of topological spaces. 


Topological definability and undefinability: Suppose a class K of topolog- 
ical spaces is given. We say that K is topologically definable or simply topo- 
definable if there exists a set of modal formulas T such that for each topological 
space ¥ we have X € K iff X H I. Topological completeness of S4 tells 
us that the class Top of all topological spaces is topo-definable (by the for- 
mula T over S4). However, as we will see below, many important classes of 
topological spaces such as the classes of compact or connected spaces are not 
topo-definable. For this we will need the following theorem, first established 
in Gabelaia, 2001 and van Benthem et al., 2003. 


THEOREM 5.49 Suppose ¢ is an arbitrary modal formula. 
1 If Y is an interior image of X, then X = ọ implies Y = y. 
2 If Y is an open subspace of X, then X = y implies Y = y. 


3 If X is the topological sum of {X;}ier, then X = y iff X; | y for each 
wel. 





Now we are ready to show that compactness, connectedness, and the sepa- 
ration axioms To, Ty, Tı, and To are not topo-definable. 


PROPOSITION 5.50 (GABELAIA, 2001) 
1 Neither compactness nor connectedness is topo-definable. 


2 None of the separation axioms To, Ta, Ti, and To is topo-definable. 


Proof (1) Let ¥ = ({ax},7) be a singleton set with the discrete topology. 
Then obviously ¥ is both compact and connected. On the other hand, any 
infinite topological sum of ¥ is neither compact nor connected. Now apply 
Theorem 5.49(3). 

(2) Let X = ({x,y},7) be a two point set with the trivial topology. Then 
obviously ¥ does not satisfy any of the four separation axioms. Define f : 


R > {x,y} by 
_ fa, ifre@ 
f(r) = { y, otherwise. 


Then it is easy to see that f is an onto interior map. Now observe that JR satisfies 
all the four separation axioms and apply Theorem 5.49(1). QED 
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REMARK 5.51 JR also satisfies stronger separation axioms such as T3 (regu- 
larity), T} ı (complete regularity), T4 (normality), 75, and Te. Therefore, Propo- 
2 


sition 5.50 also implies that none of 73, T31 , T4, T5, and Te is topo-definable. 
2 


Proposition 5.50 indicates that the basic modal language £ is not expressive 
enough for topological purposes. In Sec. 3 we will consider several enrichments 
of £ and show that some of topological properties not expressible in £ can be 
expressed in its various enrichments. Nevertheless, it seems to be a natural 
question to characterize those classes of topological spaces that can be defined in 
L. An answer to this question was given in Gabelaia, 2001, where a topological 
analogue of the Goldblatt-Thomason theorem was established. 


DEFINITION 5.52 (GABELAIA, 2001) Let X = (X,7) be a topological 
space. We let uf(X) denote the set of ultrafilters of the powerset P(X) and 
define R on uf(X) by 


wRu iff A € u implies Cl(A) € w 


foreach A C X. It is easy to verify that R is reflexive and transitive on uf (X). 
Let Tg denote the Alexandroff topology on uf(X) generated by R. We call 
ae(V’) = (uf(X), TR) the Alexandroff extension of ¥. 


We note that if the original topology on X is Alexandroff, then the 
Alexandroff extension of ¥ can be obtained by first taking the ultrafilter exten- 
sion of ¥ (van Benthem, 1983a) and then taking the corresponding Alexandroff 
space. Alexandroff extensions of topological spaces turn out to be crucial for 
obtaining a topological version of the Goldblatt-Thomason theorem. 


DEFINITION 5.53 We say that a class K of topological spaces reflects 
Alexandroff extensions if for each topological space X we have ae(¥) € K 
implies X € K. 


THEOREM 5.54 (GABELAIA, 2001) Let K be a class of topological spaces 
closed under formation of Alexandroff extensions. Then K is modally definable 
iff it is closed under taking open subspaces, interior images, topological sums, 
and it reflects Alexandroff extensions. 


Several refinements of Theorem 5.54 and extensions to richer languages can 
be found in Gabelaia and Sustretov, 2005 and ten Cate et al., 2006. Below 
we present a number of topo-definable classes of spaces, as well as normal 
extensions of S4 being complete with respect to topologically interesting classes 
of spaces. 











The logic of discrete spaces: It is rather easy to see that ¥ = p — Op iff 
every subset of X is open iff X is discrete. Therefore, p — Op (or equivalently 
Op — p) topo-defines the class of discrete spaces. 
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S5 and clopens: We observe that 
X —p—> ep iffA C Int(Cl(A)) for each A C X 
iff Cl(A) C Int(Cl(A)) for each A C X 


iff every closed subset of X is open. 
































Therefore, p — Op (or equivalently Op — Op) topo-defines the class of 
topological spaces in which every closed subset is open. 


S4.2 and extremally disconnected spaces: We recall that 
$4.2 = S4 + (Op > Oop). 




















Now observe that 
X H Op — Op iff Cl(Int(A)) C Int(Cl(A)) for each A C X 
iff Cl(Int(A)) = Int(Cl(Int(A))) for each A C X 


iff the closure of every open subset of X is open 























iff X is extremally disconnected. 

















Therefore, >Op — Op topo-defines the class of extremally disconnected 
spaces. 


S4.1 and filters of dense sets: We recall that 
$4.1 = S4 + (Oop — OO). 





























For a topological space V let D(X) be the set of all dense subsets of X. Now 
X H O0p — ODp iff Int(Cl(A)) C Cl(Int(A)) for each A C X. As shown 
in Bezhanishvili et al., 2003, p. 293, Proposition 2.1, the last condition is 
equivalent to D(X) being a filter. So, Op — ©Up topo-defines the class of 
topological spaces in which D(X) is a filter. 









































S4.Grz and hereditarily irresolvable spaces: We recall that 


S4.Grz = S4 + O(O(p > Op) > p) > Op. 





























We also recall that a space ¥ is resolvable if it can be represented as the union 
of two disjoint dense subsets, that it is irresolvable if it is not resolvable, that it 
is hereditarily irresolvable if every subspace of ¥ is irresolvable, and that it is 
scattered if every subspace of ¥ has an isolated point. 

For each A C X let p(A) = AN CI(CI(A) — A). Esakia, 1981 observed 
that ¥ = O(O(p — Op) > p) — Op iff A C CI(A — p(A)) for each 
A C X. As shown in Bezhanishvili et al., 2003, p. 295, Theorem 2.4, the 
last condition is equivalent to X being hereditarily irresolvable. Therefore, 
(G(p — Up) — p) — Up topo-defines the class of hereditarily irresolvable 
spaces. Now since S4.Grz is complete with respect to its relational semantics 
and since for an Alexandroff space Xg = (X, Tp) the notions of hereditarily 
irresolvable and scattered coincide with each other and with the notion of ¥ 
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Figure 5.14. The 2-fork frame. 


having no infinite ascending chains (Gabelaia, 1999), we obtain that S4.Grz 
is the logic of hereditarily irresolvable spaces, and also the logic of scattered 
spaces. As scattered spaces are a proper subclass of hereditarily irresolvable 
spaces (Bezhanishvili et al., 2003), they are not topo-definable. Moreover, since 
S4.Grz is also the logic of ordinals (Abashidze and Esakia, 1987), ordinals are 
not topo-definable either. 


Euclidean hierarchy: Corollary 5.48 shows that the logic of boolean combi- 
nations of countable unions of convex subsets of JR is already S4. The logic 
becomes much stronger, however, if we restrict our attention to finite unions of 
convex subsets of JR. 

We call a subset of JR serial if it is a finite union of convex subsets of JR. Let 
S (JR) denote the family of serial subsets of JR. Unlike the countable unions of 
convex subsets of IR, SIR) does form a boolean algebra. We call a valuation v 
on IR serial if v(p) € SUR) for each propositional letter p. We call a formula 
ọ s-true if it is true in JR under a serial valuation, and we call y s-valid if ọ is 
s-true for each serial valuation. Let L(S) = {y : y is s-valid}. It is easy to see 
that L(S) is a normal extension of S4, we refer to as the logic of serial subsets 
of IR. The following theorem was first established in Aiello et al., 2003 (see 
also van Benthem et al., 2003): 


THEOREM 5.55 L(S) is the logic of the 2-fork frame § shown in Fig. 5.14. 


Forn > 2, wecall_X C JR” hyper-rectangular convexif X = X,X---xXn, 
where all the X;’s are convex subsets of IR. We also call X C IR” n-chequered 
if it is a finite union of hyper-rectangular convex subsets of IR”. Let CH UR”) 
denote the set of all n-chequered subsets of JR”. As with S(JR), we have that 
CH (JR") forms a boolean algebra. We call a valuation v on IR” n-chequered 
if v(p) € CH(IR”) for each propositional letter p. We call a formula y n-true 
if it is true in JR” under an n-chequered valuation, and we call y n-valid if ọ is 
n-true for each n-chequered valuation. Let Ln = {y : y is n-valid}. As with 
L(S), we have that Ln is a normal extension of S4, we refer to as the logic of 
n-chequered subsets of IR”. Moreover, the logics form a decreasing chain: 





ES) = Li D by D L3 D- D LnD.... 
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Let §” denote the Cartesian product of the 2-fork frame § with itself n-times. 
The following theorem was proved in van Benthem et al., 2003: 


THEOREM 5.56 Forn > 2 we have that Ly, is the logic of §”. 


In particular, the logic of chequered subsets of the real plane coincides with 
the logic of $7. An illustration of $7 is given in Fig. 5.15. 


Figure 5.15. §?. 


We call X C IR™ co-rectangular convex if X = Ik X;, where each X; 
is a convex subset of JR, and all but finitely many of X;’s are equal to either JR 
or J. We call X C IR® oo-chequered if it is a finite union of oo-rectangular 
convex subsets of IR. Let CH (IR) denote the set of oo-chequered subsets 
of R°. As with each CH(IR”), we have that CH (JR) forms a boolean 
algebra. We call a valuation v on IR co-chequered if v(p) € CH(UR®) for 
each propositional letter p. We call a formula y oo-true if it is true in IR 
under a oo-chequered valuation, and we call y oo-valid if p is oo-true for each 
oo-chequered valuation. Let Loo = {y : y is oo-valid}. It is easy to see that 
Læ is anormal extension of S4, we refer to as the logic of oo-chequered subsets 
of IR°°. The following theorem can be found in van Benthem et al., 2003: 


THEOREM 5.57 Læ =) Ln. 


To summarize, we obtained the logic L(S) of serial subsets of the real line 
JR, as well as its natural generalizations—the logics Ln of sufficiently well- 
behaved n-chequered subsets of n-dimensional Euclidean spaces JR”. Unlike 
the full modal logic of each Euclidean space JR”, which coincides with S4, 
all logics Lẹ are different, forming a decreasing chain converging to the logic 
L of co-chequered subsets of IR°°. This provides us with a sort of Euclidean 
hierarchy in modal logic. It has been suggested by Litak, 2004 that Loo may 
be closely related to the quite differently motivated “Logic of Finite Problems’ 
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first defined by Medvedev. Recently it was shown in Fontaine, 2006 that the 
Medvedev logic is not finitely axiomatizable over Lo. 


REMARK 5.58 As was pointed out to us by David Gabelaia, L(S') coincides 
with the logic of the digital line (also known as the Khalimsky line), while 
Lə coincides with the logic of the digital plane (also known as the Khalimsky 
plane). Whether this correspondence extends to higher dimensions remains an 
open problem. For the definition and basic results on digital topologies we refer 
to Ch. 12. 


3. Modal logic and topology: further directions 


In Sec. 2 we were chiefly concerned with the interpretation of © as closure, 
the resulting logic S4, and the landscape of spatial logics over S4. But this 
classical picture is not all there is to modal logic and topology. We already 
noticed that many important topological properties are not expressible in the 
basic modal language £. In this section we discuss several different ways of 
increasing the expressive power of £, viz. modalizing the derivative operator, 
product constructions on topological spaces, extended modal languages, and 
epistemic interpretations of topological models. There is no single story-line 
here, but together these topics show the liveliness of current research. 


3.1 © as derivative 


There are at least two natural ways to increase the expressive power of £. 
One is to add new modal operators to £, and the other is to interpret the modal © 
as a topological operator that is more expressive than the closure operator. We 
consider adding new modal operators to £ in Sec. 3.2 and 3.3. Here we outline 
some of the consequences of interpreting © as derivative (first suggested by 
McKinsey and Tarski, 1944). Since Cl(A) = AU d(A) for each A C X, the 
derivative operator is more expressive than the closure operator. We recall that 
x € d(A) iff AN (U — {x}) 4 0 for each open neighborhood U of x, that the 
co-derivative of A is t(A) = X — d(X — A), and that x € t(A) iff there exists 
an open neighborhood U of x such that U C AU {a}. 

Let M = (&, v) be a topo-model. We define when a formula y is d-true at 
a point x € X by induction on the length of ọ: 


a x a4 piff x € v(p); 

» x |Ha ~y iff not x Ha 9; 

a cEayAwiffx Ha y and z Ha Y; 

se x a4 Og iff IU € T(x € U &VYyEU-—{r} yea); 














and hence also 
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a xa Oviff VU € r(x € U — Jy EU- {r}: y Fa p). 


We say that y is d-true in M = (X, v) if ọ is d-true at every x € X. We 
say that y is d-valid in ¥ if ọ is d-true in every model based on X. Finally, 
we say that y is d-valid in a class of topological spaces if y is d-valid in every 
member of the class. 


EXAMPLE 5.59 
































1 We show that (p \ Op) > p is d-valid in Top. Let ¥ € Top, 
M = (X,v) be a topo-model, and x a p ^ Op for x € X. Then 
x =a p and there exists an open neighborhood U of x such that y =q p 
for each y € U — {x}. Therefore, y Fg p for each y € U. But then 
y Fa Up for each y € U, implying that x Fg OOp. 










































































2 That O(p — q) — (Gp — Oq) is d-valid in Top and that the necessita- 
tion rule preserves d-validity can be proved similarly to Example 5.19. 























DEFINITION 5.60 Let wK4 denote the modal logic K + (p ^ Op) — Op. 
Obviously wK4 is weaker than K4, and we call wK4 weak K4. 


It follows that the modal logic wK4 is sound with respect to d-semantics. 
In fact, as shown in Esakia, 2001, wK4 is also complete with respect to d- 
semantics. Below we discuss the connection between relational semantics of 
wK4 and d-semantics; then we show that wK4 is the d-logic of all topological 
spaces; after that we determine the connection between S4 and wK4; finally, 
we discuss stronger spatial logics over wK4. Most results in this section are 
taken from Esakia, 2001, Esakia, 2004, Bezhanishvili et al., 2005, Shehtman, 
1990 and Shehtman, 2006. 


3.1.1 Weak K4. 


DEFINITION 5.61 Let § be a frame. We call § weakly transitive ifVw, v, u € 
W (wRv & vRu & w # u > wR). 


It is known that wK4 is sound and complete with respect to the class of all 
weakly transitive frames. In fact, wK4 has the finite model property (Esakia, 
2001). Because of this, we will sometimes refer to weakly transitive frames 
as wK4-frames. We call a weakly transitive frame § = (W, R) rooted if there 
exists r € W—called a root of §—such that rRw for every w Æ r. 


THEOREM 5.62 (ESAKIA, 2001) wK4 is complete with respect to finite 
rooted irreflexive wK4-frames. 


Proof (Sketch) If wK4 / y, then there exists a finite wK4-model M = (¥, v) 
refuting y. Now we construct a finite irreflexive wK4-frame © such that § is a 
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p-morphic image of 6. For ¥ = (X, R) let Y be obtained from X by replacing 
every reflexive point x of X by a two point set {x , x2} disjoint from X. 
Define S on Y by zı Sx and #2521, ySa, and ySx2 for each yRa, and 71 Sz 
and %25'z for each xRz. It is easy to see that Ġ is a finite irreflexive weakly 
transitive frame. Define f : Y — X by f(a1) = f(x2) = xifx € X is 
reflexive, and f(x) = x if x € X is irreflexive. Then it is routine to check that 
f is an onto p-morphism. Now define £ on © by €(p) = f~!(v(p)). Then the 
model N = (6, £) is bisimilar to M. Therefore, N also refutes y. QED 


DEFINITION 5.63 Let § = (X,R) be a wK4-frame. We denote by Ẹ = 
(X, R) the reflexive closure of § (that is, R is obtained from R by adding all 
reflexive loops), and by § = (X, R) the irreflexive fragment of § (that is, R is 
obtained from R by deleting all reflexive loops). 


For a wK4-frame %, it is obvious that ¥ is an S4-frame, and that & is an 
irreflexive wK4-frame. Moreover, every wK4-frame is obtained either from 
an S4-frame by deleting some reflexive loops or from an irreflexive wK4-frame 
by adding some reflexive loops. 

Given a wK4-frame %, we view § as an Alexandroff space. For A C X we 
denote the derivative of A in F by dy(A). The next series of results is taken 
from Esakia, 2001. 


LEMMA 5.64 Let § be a wK4-frame and A C X. In § we have dp(A) = 
R'(A). 
Proof We observe that 
x € dr(A) iff for each open neighborhood U of x we have UN (A — {x}) 40 
iff R(x) N(A— {r} 40 
if Riz) N AFD 
iff æ € R'A). 
The result follows. QED 
Now suppose ¥ is a topological space. We define Rg on X by xRay iff 
x € d(y). 


LEMMA 5.65 (X, Ra) is an irreflexive wK4-frame. 


Proof That Rais irreflexive follows from x ¢ d(x). To see that Rg is weakly 
transitive suppose xz Ray, yRaz, and x Æ z. Then x € d(y), y € d(z), and 
x # z. From x € d(y) it follows that for each open neighborhood U of x we 
have y € U — {x}; from y € d(z) it follows that for each open neighborhood 
V of y we have z € V — {y}; and from x ¥ z it follows that for each open 
neighborhood U of x we have z € U — {x,y} C U — {ax}. Thus, x € d(z), 
and so x Raz. QED 
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LEMMA 5.66 

1 If § is a wK4-frame, then Ra, C R. 

2 If § is an irreflexive wK4-frame, then Ra, = R. 

3 If X is a topological space, then R;'(A) C d(A). 


4 If X is an Alexandroff space, then R7' (A) = d(A). 


Proof (1) Ina wK4-frame §, xRapy > < € dr(y) > x € Ry) > xRy. 

(2) Suppose ¥ is an irreflexive wK4-frame. Then zRagy © z € dr(y) > 
x € Ry) ore Rly) > xRy. 

(3) Suppose X is a topological space. Then x € R7'(A) > (ay) (x Ray& 
y € A) > (Ay)(@ € d(y) & d(y) C d(A)) > x € d(A). 

(4) Suppose ¥ is an Alexandroff space. Then z € d(A) = Ra(x) A (A — 
{t}) #0 = Rala) NA #9 = ze RI (A). QED 








~ 


COROLLARY 5.67 For a nonempty set X, there is a 1-1 correspondence 
between: 


(i) Alexandroff topologies on X; 
(ii) Reflexive and transitive relations on X; 
(iii) Irreflexive and weakly transitive relations on X. 


It follows that there is a 1-1 correspondence between Alexandroff spaces, 
S4-frames, and irreflexive wK4-frames. Now we are in a position to show that 
wK4 is the d-logic of all topological spaces. 


THEOREM 5.68 
1 wK4 is the d-logic of all topological spaces. 
2 wK4 is the d-logic of all finite topological spaces. 


3 wK4 has the effective finite model property with respect to the class of 
topological spaces. 


Proof Obviously both (1) and (3) follow from (2). To see (2), Theorem 5.62 
implies that wK4 is complete with respect to finite irreflexive wK4-frames. By 
Corollary 5.67, irreflexive wK4-frames correspond to topological spaces. The 
result follows. QED 
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3.1.2 Connections between S4 and wK4. There is a close connection 
between S4 and wK4. For the set F of formulas of £, we define a translation 
tr : F — F by induction (Boolos, 1993): 


a tr 


a tr 


a tr(sy) = -tr(¢); 














a tr 





DEFINITION 5.69 


1 Let L be anormal extension of wK4 and S be a normal extension of S4. 
We say that L and S are companions if S + ọ iff L F tr(y). 


2 For anormal extension L of wK4, let T(L) = {yp : Li tr(y)}. 


LEMMA 5.70 
1 T(L) is anormal extension of S4. 


2 T(L) is a unique companion of L. 


Proof (1) It is easy to verify that 






































wKA4t tr(Op > p), tr(Op > OOp), tr(O(p > q) > (Gp > O9)), 











and that T (L) is closed under MP and N. So, T (L) is anormal extension of S4. 
(2) Suppose S isacompanion of L. Then S + yo LF tr(y) = T(L) F y. 
Therefore, S = T (L). QED 


On the other hand, a given normal extension S of S4 may have many different 
companions. For example, both wK4 and K4 (and all the normal logics in 
between) are companions of S4. 

Speaking in terms of relational semantics, if a normal extension L of wK4 
is characterized by a class K of frames, then T(L) is characterized by the class 
K ={3 : 3 € K}, where F denotes the reflexive closure of §. 

Now we turn to the topological significance of tr. For aclass K of topological 
spaces, let La( K) denote the set of formulas of £ that are d-valid in K. Since 
wK4 is sound with respect to d-semantics, it is obvious that L(K) is a normal 
extension of wK4. We call La(K) the d-logic of the class K. The next two 
facts are taken from Bezhanishvili et al., 2005, Lemma 2.1 and Theorem 2.2. 


LEMMA 5.71 Let K be a class of topological spaces and X € K. 
1 X H gif X Fa tr(y). 
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2K FE yif kK Ha tr(¢). 
THEOREM 5.72 If L is ad-logic, then T(L) is topologically complete. 


We point out that T(L) may be topologically complete without L being a 
d-logic. Thus, the converse of Theorem 5.72 is not true in general. Next we 
indicate several examples of topologically complete normal extensions of S4 
and wK4 that are each others companions. 


3.1.3 The landscape of spatial logics over wK4. The landscape of 
spatial logics over wK4 has been investigated less vigorously than that of spa- 
tial logics over S4. Nevertheless, there are several interesting results in this 
direction that we list below. 

As we have already seen, wK4 is the logic of all topological spaces when 
interpreting © as derivative. In addition, we will see that K4 is the logic of all 
Ty-spaces, that KD4 is the logic of the Cantor space Œ, the rational line @, or 
more generally, any 0-dimensional dense-in-itself metric separable space, that 
the d-logic of IR is KD4Gg, and that the d-logic of each IR”, for n > 2, is 
KD4G). 

We say that a class K of topological spaces is d-definable if there exists a set 
of modal formulas T such that for each topological space VY we have ¥ € K 
iff X a T. Since the derivative operator of a topological space is more 
expressive than the closure operator, topo-definability results will automatically 
transfer into d-definability results. However, there are d-definable topological 
properties that are not topo-definable. For example, the class of all Ty-spaces is 
not topo-definable. On the other hand, Op —> p d-defines it. Also, the class 
of dense-in-itself spaces is not topo-definable, but it is d-definable by OT (or 
equivalently by Op — Op). Below we present several results in this direction. 



































K4 and T4-spaces: 
PROPOSITION 5.73 (ESAKIA, 2001) K4 is the d-logic of all Tg-spaces. 


Proof Since X is a Ty-space iff dd(A) C d(A) for each A C X, we obtain 
that K4 is sound with respect to the class of Tj-spaces. To see completeness, 
recall that K4 is complete with respect to the class of all (not necessarily finite) 
irreflexive K4-frames (see, e.g., Chagrov and Zakharyaschev, 1997, p. 102, 
Exercise 3.11). Since each one of these corresponds to a Ty-space, the result 
follows. QED 


Another way to obtain d-completeness of K4 is to construct a canonical 
topological model for K4 similar to the one constructed in Sec. 2.4.2 (Steinsvold, 
2005). 


The d-logics of Œ and @: Let 
KD4 = K4+ OT. 
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For a topological space ¥ we have ¥ Fg OT iff d(X) = X iff X is dense- 
in-itself. Consequently, since both Œ and Q are dense-in-itself Tg-spaces, we 
have that Œ, Q Ha KD4. Moreover, as shown in Shehtman, 1990, Theorem 
29, KD4 is the d-logic of any 0-dimensional dense-in-itself metric separable 
space. As an immediate consequence we obtain that KD4 = Lq(@) = 


La(@). 


The d-logics of Euclidean spaces: The situation here is different from that of 
S4. Indeed, we have that the d-logic of Œ and Q is different from the d-logic 
of JR, and that the d-logic of JR is different from the d-logic of JR” for each 
n > 2. Let Gj denote the formula 


(Op A =p) > O((pA =p) V (=p A Op)), 


and KD4G, denote the logic obtained from KD4 by postulating the formula 


G1. Also let Q1 = pı ^mp2 ^ mp3, Q2 = mp1 \p2A 7p3, Q3 = 7p1 \>p2\p3, 
and G2 denote the formula 


((Q1 \0Q1) V (Q2 A 0Q2) V (Q3 A 0Q3))  (O-Q1 V O7Q2 V O-7Q3). 


We let KD4GĻ» denote the logic obtained from KD4 by postulating the formula 
Gə. Then it follows from Shehtman, 1990, Shehtman, 2006 that KD4G, = 
Lal IR”) for each n > 2, and that KD4G_2 = La( IR). 


GL and scattered spaces: Recall that the Gédel-Lob provability logic GL is 
obtained by adding to K the following Löb formula O(Op — p) —> Op. As 
we already saw in Sec. 2.6, the class of scattered spaces is not topo-definable. 
On the other hand, as shown in Esakia, 1981, ¥ Fa O(Op — p) — Op iff 
X is scattered. Therefore, 0(Op — p) — Op d-defines the class of scattered 
spaces. Now as GL is the d-logic of ordinal spaces (Abashidze, 1987; Blass, 
1990), we obtain that GL is the d-logic of both scattered spaces and ordinal 
spaces. Since the class of ordinal spaces is a proper subclass of the class of 
scattered spaces, it follows that the class of ordinal spaces is neither d-definable 
nor topo-definable. 



























































































































































K4.Grz and hereditarily irresolvable spaces: As we already saw in Sec. 2.6, 
(G(p — Op) — p) — Op topo-defines the class of hereditarily irresolv- 
able spaces. Consequently, the class of hereditarily irresolvable spaces is also 
d-definable. Interestingly enough, the same axiom d-defines the class of hered- 
itarily irresolvable spaces (Esakia, 2002). Moreover, K4.Grz is the d-logic 
of hereditarily irresolvable spaces (Gabelaia, 2004). Both GL and K4.Grz 
are companions of S4.Grz, but as shown in Esakia, 2002, K4.Grz is the least 
companion of S4.Grz. 









































Further results on d-definability and d-completeness can be found in 
Bezhanishvili et al., 2005. 
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3.2 Product logics 


Throughout the 1990’s, combinations of logics and constructions merging 
their models have come up in practice, and also as a new theme for mathematical 
theory. In particular, products of relational models have been studied exten- 
sively in Gabbay and Shehtman, 1998 for their uses in combining information 
along different dimensions, and the behavior of the matching modal logics is 
well-known (Gabbay et al., 2003). This section is about products of topological 
spaces as a generalization of this methodology. In particular, we describe two 
‘horizontal’ and ‘vertical’ topologies along with the standard product topology. 
For each of the three topologies on the product we introduce a modal box in 
our language and give axiomatizations of the resulting logics. The material 
presented here is taken from van Benthem et al., 2005. 


3.2.1 Products of topologies. Let ¥ = (X,7) and Y = (Y, 0} be two 
topological spaces. Recall that the standard product topology T on X x Y is 
defined by letting the sets U x V form a basis for 7, where U is open in ¥ 
and V is open in Y. We define two additional one-dimensional topologies on 
X x Y by ‘lifting’ the topologies of the components. 


DEFINITION 5.74 Suppose A C X x Y. We say that A is horizontally open 
(H-open) if for any (x,y) € A there exists U € n such that x € U and 
U x {y} C A. Similarly, we say that A is vertically open (V-open) if for any 
(x,y) € A there exists V € 0 such that y € V and {x} x V C A. If Ais both 
H- and V-open, then we call it HV-open. 


The H-closed, V-closed and HV-closed sets are defined similarly. Let 7; 
denote the set of all H-open subsets of X x Y and 72 denote the set of all V- 
open subsets of X x Y. It is easy to verify that both 7; and 72 form topologies 
on X x Y. 


DEFINITION 5.75 We call 7, the horizontal topology and T the vertical 
topology. 


REMARK 5.76 A set open in the standard product topology is both horizon- 
tally and vertically open. That is 7 C 7 and 7 C T2. However, the converse 
inclusions do not hold in general. 






































The interpretation of the modal operators O; and O2 of Lojo, in (X x 
Y, 71, T2) is as expected: 


(x,y) E Diy iff (AU € 71)((z,y) € U and V(x’, y’) € U (2,9) E p) 


(x,y) = Oop iff (AV € 72)((x, y) € V and Y(x',y') € V (2’,y') Ey). 
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The modalities ©; and ©2 are defined dually. Furthermore, all the usual notions 
such as satisfiability and validity generalize naturally to this new language. 

There are some similarities and differences between products of frames and of 
topological spaces. To see the similarities, let § = (W, R} and 3’ = (W’, R’) 
be S4-frames, and let § x f’ = (W x W’, Rj, Ro) be their product. Then 
TR, and TR, are precisely the horizontal and vertical topologies on the product 
space W x W”. This shows that our topological product construction is a faithful 
generalization of the usual product construction for frames. Thus, whenever 
topological spaces ¥ and y} are representable as S4-frames (are Alexandroff), 
then the horizontal and vertical topologies on their product X x Y canbe defined 
from the horizontal and vertical relations on the product of these frames. In 
other words, our topological setting generalizes the case for products of frames. 
To see the differences, we point out that both com and chr, while valid on 
products of frames, can be refuted on topological products. Below we exhibit 
their failure on JR x JR. 

(a) Failure of com: Let 


v(p)=( LU {a} x @,-2)) U ({0} x (-1,))UC U {2} x (2,2) 


x€(—1,0) x€(0,1) 


(see Fig. 5.16a). Then there is a basic horizontal open (—1, 1) x {0} such that 
(0, 0) is in it and every point in (—1, 1) x {0} sits in a vertically open subset of 
v(p). Thus, 0; O2p is true at (0,0). On the other hand, there is no vertical open 
containing (0,0) in which every point sits inside a horizontally open subset of 
v(p), implying that 0201p is false at (0,0). 

(b) Failure of chr: Let v(p) = U{{=} x (-4,4) : n > 1} (see Fig. 
5.16b). Then in any basic horizontal open around (0, 0) there is a point that sits 
in a basic vertical open in which p is true everywhere. Thus, © op is true at 
(0,0). On the other hand, since the horizontal closure of v (p) is v(p) U {(0, 0) } 
and since the vertical interior of v(p) U {(0,0)} is v(p), we have that O20 p 
is false at (0,0). 

These counterexamples on JR x JR are not accidental. van Benthem et al., 
2005, Sec. 4 shows when they can be reproduced in products of arbitrary topo- 
logical spaces. 















































3.2.2 Completeness for products. Our main goal here is to show that 
the logic of all products of topological spaces is S4 6 S4. In fact, we show 
that S4 @ S4 is the logic of Q x Q. 


THEOREM 5.77 S46 S4 is the logic of Q x Q. 


Proof As follows from Proposition 5.17, S4 6 S4 is complete with respect to 
the infinite quaternary tree Tz 2 = (W, R1, R2}. We view T3 2 as equipped with 
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(1,1) 


(0, 0) 





Figure 5.16. Counterexamples of com and chr on R x R. 


two Alexandroff topologies defined from Rı and R2. To prove completeness 
of S4 6 S4 with respect to Q x Q we take the X constructed in the proof of 
Theorem 5.36, define recursively an HV-open subspace Y of X x X and an 
interior map g from Y onto 72 2 with respect to both topologies: this allows us 
to transfer counterexamples from 729 to Y, then from Y to X x X, and finally 
from X x X to Q x @. 

Let Y = Ue Yn where Yo = {(0,0)} and 


nEw 


1 1 1 1 
Yn4i E YnU{(t— z7: Y) (2+7 Y) (x, v=): (£, y+. 


sa) (ruts) + (0) € Yn} 


CLAIM 5.78 Y is an HV-open subspace of X x X. 





Proof Let (x,y) € Y. Then x € (x — yz, + gaz) C X. Therefore, 
(x,y) € (£ — ste, £ + gaz) x {y} C Y. Thus, Y is an H-open subspace of 
X x X. That Y is a V-open subspace of X x X is proved symmetrically. QED 





A similar argument to that used in Theorem 5.36 shows that for each (x, y) 
Y such that (x, y) # (0,0) there exists nz y) with (£, y) € Yn, , and (x, y) 
Yny “1 and that there is a unique a) E Yne p1 Such that (x, y) 
(u ae Feat”) or (x, y) = (u,v ae eres 

We define g from Y onto 72,2 by recursion (cf. Fig. 5.17): If (x,y) = (0,0 
then we let g(0,0) be the root r of T22; if (x,y) A (0,0) then (2, y) 


Il AM 


) ) 








eS 








(u+ mei v) or (x,y) = (u,v £ a) for a unique (u,v) E€ Yng y)—15 
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and we let 
the immediate left R1-successor of g(u, v) 
if (x,y) = (u — aer À 
the immediate right Rı-successor of g(u, v) 
if = zabe 
saya] EOD = Ut pgn) 


the immediate left R2-successor of g(u, v) 


if (x,y) = (u,v — racine! 


the immediate right R-successor of g(u, v) 
if (2,y) = (uu t+ ae) 


T 
ETG ates i 
+ 


F T ji 
HAHH 
$ + 


$+ 


$e 
i 


Figure 5.17. The first stages of the labelling in the completeness proof for S4 @ S4: (0,0) is 
labelled by the root r of 72,2, (—1, 0) is labelled by the immediate left R1-successor of r, (1, 0) 
is labelled by the immediate right Ri-successor of r, (0, —1) is labelled by the immediate left 
Re2-successor of r, (0, 1) is labelled by the immediate right R2-successor of r, and so on. 


CLAIM 5.79 g is an interior map with respect to both topologies. 


Proof Let 7 and 72 denote the restrictions of the horizontal and vertical 
topologies of X x X to Y, respectively. We prove that g is an interior map 
with respect to 74. That g is interior with respect to T2 is proved symmetrically. 
We observe that 


1 1 
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forms a basis for rı. We also recall that a basis for the Alexandroff topology 
on T> » defined from R; is By = {Bi hena where B} = {s € Tho : tRis}. 

To see that g is open, let (x SGD x24 Few ) x {y} be a basic open 
for mı. Then the same argument as that used in Claim 5.38 guarantees that 
g(x ea) £4 ew ) x {y} = Bii „y Thus, g is open. To see that g is 








continuous, it suffices to show that for each t € T> 9, the g-inverse image of B} 
belongs to 7. Let (x,y) € g-!(B}). Then tRi g(x, y). So g((£ — Paw £+ 
Pu) x {y}) = B C Bł}. Thus, there exists an open neighborhood 
U = (x en ac Pan ) x {y} of (x,y) such that U C g~!(B}), implying 
that g is continuous. QED 





To complete the proof, if S4 6 S4 7 y, then there is a valuation v on 722 
such that (729,v),r E p. Define a valuation € on Y by €(p) = g7! (v(p)). 
Since g is an interior map with respect to both topologies and g(0,0) = r, we 
have that (Y, £), (0,0) KE y. Now since Y is an HV-open subset of X x X, 
we obtain that ọ is refutable on X x X. Finally, Theorem 5.36 implies that X 
is homeomorphic to @. Therefore, X x X is both horizontally and vertically 
homeomorphic to Q x @, and hence y is also refutable on Q x Q. QED 








COROLLARY 5.80 S4 9 S4 is the logic of products of arbitrary topologies. 


It follows that the logic of products of arbitrary topologies is decidable and 
has a PSPAC'E-complete satisfiability problem (Spaan, 1993). This stands 
in contrast to the satisfiability problem for S4 x S4, which turned out to be 
undecidable (Gabelaia et al., 2005). 


3.2.3 Adding standard productinterior. Sofar we only focused on the 
horizontal and vertical topologies on the product space, by analogy to products 
of relational structures. However, unlike products of relational structures, the 
standard product topology is not definable in terms of the horizontal and vertical 
topologies (van Benthem et al., 2005, Sec. 3). Therefore, it is only natural 
to add an extra modal operator O to the language Loo, with the intended 
interpretation as the interior operator of the standard product topology. 

For two topological spaces ¥ = (X,7) and Y = (Y,6), we consider the 
product (X x Y,7,71, T2) with three topologies: the standard product topology 
T, the horizontal topology 74, and the vertical topology 72. Then O is interpreted 
as: 


















































(x,y) =O% iff JUenandaVec@:UxVEy. 











Since T C 71 N T2, the modal principle 




















Op > Oyp A O2p 
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is valid in product spaces. Let TPL denote the logic in the new language 
£Lo,0,,05 that contains all the axioms of S4 6 S4 9 S4 plus the axiom Op — 
ip ^A Ogp. We call TPL the topological product logic. The main significance 
of TPL is that in the language Loo, ,o, it is the logic of products of arbitrary 
topologies. This can be proved by generalizing the completeness of S4 6 S4 
with respect to Q x Q for this new case. As a result, we obtain that TPL is 
complete with respect to @ x @, hence is the logic of products of arbitrary 
topologies (with horizontal, vertical, and standard product topologies). For the 
details of the proof see van Benthem et al., 2005, Sec. 6. 
















































































3.3 Extended modal languages 


In modern modal logic one conspicuous trend is design of languages whose 
expressive power matches the intended application rather than merely working 
with some formalism ‘because our forefathers did it’. All this is subject to the 
balance, matching expressive power with low complexity, preferably decidable 
(de Rijke, 1993; van Benthem, 1991b). For instance, to make global assertions 
about a model, one adds a ‘universal modality’ Ug saying that ¢ is true in 
all worlds. The same move makes sense for space. Topological relations 
not captured by the basic modal language can be safely expressed by adding 
appropriate new modal operators. We have entered the realm of extended or 
‘hybrid’ modal languages (Areces and ten Cate, 2006). 


3.3.1 Universal modalities and global properties. The basic language 
L interpreted on topological spaces has a ‘local’ view of the world. A global 
perspective comes from the addition of the universal modality that expresses 
accessibility to any point (Goranko and Passy, 1992). Universal modalities 
were brought to the spatial reasoning community in Bennett, 1995. For this 
purpose one adds: 


M,cxE Ep iff JEX: MyyEo 
M,x= Up iff VwEex My FY. 











More systematically the relevant new valid principles are those of S5: 


(Dual) Ep — ~U np 

(K) U(p > q) > (Up > Uq) 
(T) Up > p 

(4) Up — UUp 

(B) p > UEp. 


In addition, the following ‘connecting’ principle is part of the axioms: 


Op > Ep. 
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Using these principles, the enriched language Lu allows a straightforward 
normal form: 


PROPOSITION 5.81 Every formula of Ly, is equivalent to one without nested 
occurrences of E, U. 


The definition of topo-bisimulation extends straightforwardly. It merely 
demands that topo-bisimulations be total relations. 


THEOREM 5.82 (AIELLO AND VAN BENTHEM, 20024A) 


a Extended modal formulas in L, are invariant under total topo- 
bisimulations. 


a Finite £,,-modally equivalent models are totally topo-bisimilar. 


In the topological setting, fragments of this language can also be relevant. 
E.g., a continuous map has only one of the zig-zag clauses of topo-bisimulation. 
Now, consider ‘existential’ modal formulas constructed using only atomic for- 
mulas and their negations, ^, V, O, E, and U. 














COROLLARY 5.83 (AIELLO AND VAN BENTHEM, 2002A) Let the simu- 
lation — run from M to M' with x — x’. Then, for any existential modal 
formula yp, M, x |= ọ only if M',x' = ọ. In words, existential modal formulas 
are preserved under simulations. 


The language £,, is more expressive than the basic modal language £. In- 
deed, as we already saw in Sec. 2.6, connectedness of a topological space is 
not expressible in £. However, as shown in Shehtman, 1999, it is expressible 
in £,, by the formula: 





(5.1) U(Op — Op) — Up V Unp. 


In fact, it was shown in Shehtman, 1999 that (5.1) axiomatizes any con- 
nected dense-in-itself metric separable space, which is a generalization of the 
McKinsey-Tarski theorem. Another generalization is in Bezhanishvili and 
Gehrke, 2005, where it is shown that (5.1) axiomatizes the boolean combi- 
nations of countable unions of convex subsets of the real line. 

By encoding a fragment of the Region Connection Calculus (RCC5) (Randell 
et al., 1992) in the language Lu, Bennett showed the power of the language in 
expressing spatial arrangement of regions. The relevant elementary relations 
between regions that one can express are those of parthood and connectedness. 
The encoding is reported in Fig. 5.18, which is the basis for the appropriate 
calculus in computer science and AI. For details, cf. Ch. 3. 

We have just given the first steps here of a much longer ladder. Much stronger 
hybrid languages, all the way up to first-order languages for topological models 
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RCC5 Le Description 





DC(A, B) | ~E(A ^ B) A is disconnected from B 




















EC(A, B) | E(OAAOB)A7AE(QAAOB) | Aand B are externally connected 
P(A, B) U(A > B) A is part of B 


EQ(A, B) | U(A > B) A and B are equal 

















Figure 5.18. Expressing RCC5 relations via Lu. 


from abstract model theory, are found in ten Cate et al., 2006. Still richer 
descriptions arise in modal predicate logics, which quantify over individuals 
and refer to arbitrary predicates. Cf. Rasiowa and Sikorski, 1963; Moerdijk 
1982 and Awodey and Kirshida, 2006 for such logics which are complete for 
topological and sheaf semantics. 


3.3.2 Temporal operators and boundaries. Another kind of extension 
of the modal language of topology comes from temporal logic. Consider the 
‘Until’ language of Kamp, 1968. Abstracting from linear temporal behavior 
gives a natural notion of spatial ‘Until’, describing truth in a neighborhood up 
to some ‘fence’ in topological models: 


M,x | puw iff there exists an open neighborhood U of x such that Vy € U 





we have y(y) and Vz on the boundary of U we have ¢)(z). 
Here the boundary is definable in the earlier modal language: 
boundary(U) = OU A ©-U. 


As in temporal logic, this operator can define various further notions of inter- 
est. This richer language still has topo-bisimulations in line with the proposals 
in Kurtonina and de Rijke, 1997 for dealing with the 3V-complexity of ‘Until’. 

Borrowing from temporal logic is an interesting phenomenon per se. Many 
temporal principles valid in JR survive the move to the spatial interpretation. 
E.g., two key equivalences for obtaining temporal normal forms are 





tU (p V q) = (itp) V (tUa) 
(p ^ q)Ut = (put) ^ (qut). 
In a two-dimensional spatial setting, the first equivalence fails: Fig. 5.19a re- 


futes the implication —. But the other remains a valid principle of monotonicity. 
The direction — of the second equivalence is a general monotonicity principle 
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again. Conversely, we even have a stronger valid law (see Fig. 5.19b for an 
illustration): 


pill ^ ptt — (pı ^A p2)U(q V t). 


We mention that Aiello, 2002a contains a more sustained analysis of the spatial 
content of the IR complete Until logic of Burgess, 1984. 





Figure 5.19. Examples of Until models. 


3.3.3 Extended spatio-temporal formalisms. Another use for the pre- 
ceding ideas is in combined logics of space-time, treated extensively in Ch. 9. 
In particular, Shehtman, 1993 axiomatized the complete logic of the rationals 
in this language, while Gerhardt, 2004 added ‘Since’/‘Until’ using methods of 
de Jongh and Veltman, 1985 that go back to Burgess, 1979. But axiomatizing 
the complete logic of the reals remains open. We refer to Bezhanishvili and 
Kupke, 2006 for some interesting progress in tackling this problem using prod- 
ucts of modal logics, while also introducing the ‘Since’/‘Until’ operators for the 
temporal component. Also worth mentioning are combinations of topological 
modalities with one for a continuous map giving us the next step in the tem- 
poral evolution of some dynamical systems; Ch. 10 explains this approach and 
provides references to current work, including the recent undecidability results 
of Konev et al., 2004. 


3.4 Topological semantics for epistemic logic 


Spatial models can also serve other than geometrical purposes. As we noted 
in Sec. 1.3, the earliest topological semantics was actually proposed for mod- 
elling intuitionistic logic based on evidence and knowledge. Nowadays, how- 
ever, standard relational semantics holds sway in modelling intuitionistic logic 
or explicit knowledge-based epistemic logic in the tradition of Hintikka in phi- 
losophy (Hintikka, 1962), Aumann in economics (Binmore, 1994), or Halpern 
and Parikh in computer science (Fagin et al., 1995; Wooldridge, 2002). Nev- 
ertheless, van Benthem and Sarenac, 2004 have shown recently how even 
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the more technical results obtained in the spatial tradition are illuminating for 
knowledge once we switch to a topological interpretation. Ch. 8 is also highly 
relevant in this connection, be it more in the constructive logic tradition. 

Our main interest in this chapter is space rather than knowledge. Never- 
theless, we sketch some main ideas from van Benthem and Sarenac, 2004, as 
they involve a further extension of modal languages for space, to also include 
fixed-point operators. 

The most-used relational models in epistemic models have reflexive and 
transitive accessibility relations, and the key semantic clause about an agent’s 
knowledge of a proposition says that Ky holds at a world x iff ọ is true in 
all worlds y accessible for 2 from x. For an illustration of how this works cf. 
Fig. 5.20, where we assume that the 1 and 2 relations are reflexive, transitive, 
and symmetric. 


p Pp 


Figure 5.20. In the black central world, 1 does not know if p, while 2 knows that p. Therefore, 
in the world to the left, 1 knows that 2 knows that p, but 2 does not know if 1 knows p. 











Thus, the epistemic knowledge modality is a modal box O;%, and the basic 
logic is that of the spatial interpretation, viz. S4. In an epistemic setting, the 
spatial modal axioms get a special flavor. E.g., the iteration axiom Op — 
1011p now expresses “positive introspection’: agents who know something, 
know that they know it. More precisely, we have S4-axioms for each separate 
agent, but no further ‘mixing axioms’ for iterated knowledge of agents such as 
102p — O20;p. Indeed, the latter implication fails in the above example 
because in the world on the left, 1 knows that 2 knows that p, but 2 does not 
know if 1 knows p. Another way of describing the set of valid principles is 
as the fusion of separate logics S4 for each agent. In what follows, we shall 
mostly work with the two-agent groups G = {1, 2}. 



























































3.4.1 Group knowledge: agents as relations. A striking discovery in 
an interactive epistemic setting has been various notions of what may be called 
group knowledge. Two well known examples are the following (Fagin et al., 
1995): 


1 Ego: every agent in group G knows that p, 


2 Cay: pis common knowledge in the group G. 


The latter notion has been proposed in the philosophical, economic, and lin- 
guistic literature as a necessary precondition for coordinated behavior between 
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agents (cf. Lewis, 1969). The usual semantic definition of common knowledge 
runs as follows: 


M, x H} C12 iff for all y with x (Ry U Ro)"y we have M,y E y, 


where x(R, U R2)*y if x is connected to y by a finite sequence of successive 
steps from either of the two accessibility relations. This is the familiar transitive 
closure of the union of the relations for both agents. The key valid principles 
for common knowledge are as follows: 














(Equilibrium Axiom) Chop > (pA (0101,29 A O21 29)) 

















Fp (OY Agp) A Dap A ¢)) 


(Induction Rule) 
Fp Cio 








This logic is known in the literature as S49. It has been shown to be complete 
and decidable (Fagin et al., 1995). 

A further interesting notion of knowledge for a group of agents is the so- 
called implicit knowledge Dg, which roughly describes what a group would 
know if its members decided to merge their information: 


M,« | Dj 2¢ iff for all y with z(Rı N R2)y we have M, y = y, 


where Rı N Ro is the intersection of the accessibility relations for the separate 
agents. Unlike universal and common knowledge, this notion is not invariant 
under modal bisimulations. It also involves a new phenomenon of merging 
information possessed by different agents. The latter topic will return below. 

New notions of group knowledge introduce new agents. E.g., Cg defines a 
new kind of S4-agent since (Rı U R2)* is again reflexive and transitive. Note 
that Rı U Re is not necessarily transitive, so the new ‘agent’ corresponding to 
the fact that “everybody knows’ would have different epistemic properties. In 
particular, it would lack positive introspection as to what it knows. In contrast, 
the relation Ry N Rə for Dg is again an S4-agent since Horn conditions like 
reflexivity and transitivity are preserved under intersections of relations. Thus, 
with two S4-agents 1, 2, two additional agents supervene, one weaker and one 
stronger: 
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(Ri U R2)* 
wr nS 


Ry Rə 


BS A 
Ri N Ro 


3.4.2 Alternative views of common knowledge. Despite the success 
of standard epistemic logic, there are still doubts about its expressive power 
and sensitivity. Notably, the well-known critical paper Barwise, 1988 claimed 
that a proper analysis of common knowledge must distinguish three different 
approaches: 


1 countably infinite iteration of individual knowledge modalities, 
2 the fixed-point view of common knowledge as ‘equilibrium’, 
3 agents’ having a shared epistemic situation. 


Barwise’s distinctions are hard to implement in standard relational semantics. 
But they make sense in topological semantics—where they suggest interesting 
language extensions. Here is some technical groundwork. 

The Equilibrium Axiom for the operator Cgo describes it as a fixed-point of 
an epistemic operator AX. A 01X A 02X. In conjunction with the Induction 
Rule, it may even be seen to be the greatest fixed-point definable in the standard 
modal jz-calculus as: 



































Cag := vp.p ^A Op A Oop. 


The greatest fixed-point is computed as the first stabilization stage of a de- 
scending approximation sequence for a monotonic set function through the 
ordinals. We write [||] for the truth set of y in the relevant model where 
evaluation takes place: 


Choy = [lol] 
CTZ e = [le A O1(Cha¢) A D2(Cfa9)l], 
Chop = [| Axe, Chal], for À a limit ordinal. 








Finally, we let Clay := Cap where « is the least ordinal for which the 
approximation procedure halts; that is, Crate = Cfo. In general, reaching 
this stopping point may take any number of ordinal stages. E.g., the least- 
fixed-point formula up.Op computing the ‘well-founded part’ of the binary 
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accessibility relation may stabilize only at the cardinality of the model. But in 
certain cases we can do much better, as the following well-known fact shows: 


FACT 5.84 In every relational epistemic model, the approximation procedure 
for the common knowledge modality stabilizes at k < w. 


This result shows that Barwise’s fixed-point and countable-iteration views 
of common knowledge coincide in relational models. More precisely, vp.p ^ 
1p A Oop is equivalent to 















































K129 = p A Oig A O29 AD Oop”... 





The simple stabilization behavior at w is most easily understood by observ- 
ing that the knowledge modalities O; distribute over any infinite conjunction. 
Therefore, O;(NAn<w C129) is simply /\,,-,, FiCT2~ which is equivalent to 
Anew Crop. More generally, stabilization for the formula vp.y(p) is guar- 
anteed by stage w in any model in case the syntax defining the monotone ap- 
proximation operator is constrained to a ‘universal-conjunctive’ format (van 
Benthem, 1996). 





























3.4.3 Topological models for epistemic logic with fixed-points. The 
language of epistemic logic can be interpreted just as well in topological models, 
although the presence of many agents calls for an indexed family of topologies 
on the base set of worlds, representing their individual information structures. 
All the notions such as bisimulation, axiomatic systems, and the product con- 
structions of Sec. 3.2 also make sense epistemically. But these now acquire 
a special flavor—putting together topological models into one product space 
amounts to merging information spaces for different agents. The earlier hori- 
zontal and vertical topologies on the products encode the agents’ original indi- 
vidual spaces. Our earlier result that the modal logic of the product construction 
is the fusion S4@S4 then says epistemically that we have really defined a good 
“conservative merge’ without side-effects. 

Further topologies on the product space encode further emergent group- 
oriented information structures. The earlier definitions of common knowledge 
still make sense in topological models. As before, the countably infinite iteration 
of all finite sequences of alternating knowledge modalities for the individual 
agents 1, 2 is 

Ki29 = TAN Kt 29, 


n<w 


with Kī 29, defined inductively as follows: 


KP op =P 
K739 = O1 (Kpa) A Do(KP oy). 
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The same is true for the fixed-point definition 




















Cray := vp.p ^ Op A Oop. 


However, the definitions of common knowledge by fixed-points and by count- 
ably infinite iteration will now diverge, because one can show that given an 
interpretation of p, the interpretation of Kı 2p does not always define a hor- 
izontally and vertically open set in the product model. Since the fixed-point 
version of C1 2p is always horizontally and vertically open, it follows that the 
two are not the same. We refer to van Benthem and Sarenac, 2004 for the 
details. 

Returning to an earlier topic, we can now view product spaces as introducing 
new ‘collective agents’ via new topologies. In particular, common knowledge 
as a greatest fixed-point corresponds to the intersection Tı N T3 of the horizontal 
and vertical topologies on the product space. On the other hand, the topological 
meaning of the implicit group knowledge Dg is the join T1 V T2 of the horizontal 
and vertical topologies. Its basis is the pairwise intersection of horizontal and 
vertical opens. The latter topology need not always be of great interest. For 
instance, T1 V T2 is discrete on Q x @. From an informational perspective, this 
means that merging the information that we get about points in the horizontal 
and vertical directions fixes their position uniquely. All this again yields an 
inclusion diagram: 


TVT 


a N 


T1 T2 


a aa 


TAT 


Returning to the three distinctions made in Barwise, 1988, what about the 
third view of having a ‘shared situation’? One good candidate for it would be 
the standard product topology 7. The agent corresponding to this new group 
concept 7 only accepts very strong collective evidence for any proposition. And 
we know the complete logic of adding this agent from the joint axiomatization 
of horizontal, vertical, and standard product topologies from Sec. 3.2.3. 


4. Modal logic and geometry 


This chapter has been mainly concerned with modal aspects of topology. 
But many mathematical theories of space exist beyond topology, such as affine 
and metric geometry, or newer theories like mathematical morphology, more 
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within linear algebra. And indeed, modal structures emerge in all of them. 
Our final two sections provide a brief account of this, if only to put the modal 
topological approach in a broader perspective. Sec. 4 is about geometry proper, 
Sec. 5 will deal with vector spaces. For further details and alternative logical 
approaches, we refer to the chapters in this book on modal and first-order 
theories of geometry (Ch. 2; Ch. 7; Ch. 14). 

We start by recalling that affine geometry is given by the following three 
axioms involving points, lines, and an incidence relation (Blumenthal, 1961; 
Goldblatt, 1987; and Ch. 7): 


Al Any two distinct points lie on exactly one line. 
A2 There exist at least three non-collinear points. 


A3 Given a point a and a line L, there is exactly one line M that passes through 
a and is parallel to L. 


Affine spaces have a strong modal flavor (Balbiani et al., 1997; Balbiani, 
1998; Venema, 1999; Stebletsova, 2000). Approaches include two-sorted ver- 
sions with matching bimodal operators, and merging points and lines into one 
sort of pairs (point, line) equipped with two incidence relations. By contrast, 
the classical approach to affine structure is Tarski, 1959, which contains a com- 
plete first-order axiomatization of elementary geometry in terms of a ternary 
betweenness predicate G(xyz), as well as quaternary equidistance d(xyzu), 
interpreted as x is as distant from y as z is from u. Yet, Tarski’s beautiful 
decidable axiomatization still leaves things to be desired. First, the system has 
high complexity, viz. exponential space (Ben-Or et al., 1986). And from an 
expressive viewpoint, the axioms mix betweenness and equidistance, whereas 
one would like to understand affine and metric structure separately. A complete 
axiomatization of pure affine first-order geometry was given in Szczerba and 
Tarski, 1965. 

We now turn to the modal view. There will be a “style break’ here as com- 
pared to our account in this chapter so far. We presented topological models as 
a generalization of relational semantics, where the topology need not be gen- 
erated by any ordering relation on the space. In visual terms, this reflects the 
intuitively ‘malleable’ nature of open sets. However, moving from topology to 
geometry, we encounter basic relations such as betweenness and equidistance. 
And these lead to modal structures of a more classical relational kind, be it with 
ternary rather than the usual binary relations. This difference should not be 
a problem, but rather an asset. Indeed, topological and relational views live 
together harmoniously, say in a combined modal language for topology plus 
betweenness. 
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4.1 Affine geometry in modal logic 


4.1.1 Basic modal language and affine transformations. We start by 
defining a binary betweenness modality <B>: 


M,x = <B>(y,v) iff J3y,z:Blyxzz)and M,y = p and M, z = 4. 


Our language is a propositional language enriched with the betweenness modal 
operator <B>. Models for this language are triples (X, 8, v}, where X is a 
nonempty set, 3 is a ternary betweenness relation on X, and v is a valuation 
function. In this setting, familiar modal notions acquire sometimes surprising 
new flavors. For instance, we get a new sort of geometrical model transfor- 
mation appropriate to the modal view of affine structure. Affine bisimulations 
relate points verifying the same proposition letters, while maintaining the be- 
tweenness relation: 





DEFINITION 5.85 (AFFINE BISIMULATION) Given two affine models (X, 
B, v) and (X’, 3’,v’), with x,y, z ranging over X and 2’, y’, z’ over X’, an 
affine bisimulation is a nonempty relation B C X x X’ such that if x Ba’ then: 


1 x and 2’ satisfy the same proposition letters 





2 (forth condition): B(yxz) > 3y'z' : P'(y'x'z") and yBy' and zB?” 





3 (back condition): 3’ (y'2'z’) > Jyz : B(yxz) and yBy' and zBz’. 


In Goldblatt, 1987 isomorphisms are considered the only interesting maps 
across affine models. But in fact, just as with topological bisimulations versus 
homeomorphisms (Theorem 5.6), affine bisimulations are interesting coarser 
ways of comparing spatial situations. In the true modal spirit they only consider 
behavior of points inside local line environments. Fig. 5.21 shows acase of non- 
isomorphic yet bisimilar triangles with atomic properties indicated. This affine 
bisimulation can be regarded as a sort of ‘modal contraction’ to the smallest 
model with the same structure. 


A A 


Figure 5.21. Affine bisimilar models. 


By contrast, the models in Fig. 5.22 are not bisimilar: affine bisimulations 
preserve truth of modal formulas in an obvious way, but q A <B>(r, r) holds 
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IS 


Figure 5.22. Non affine bisimilar reduction. 


at the q point of the left model and nowhere on the right. This is also “modal 
logic of space’, but clearly in a new vein! 

Incidentally, there is a smaller affine bisimulation contraction for the left- 
hand triangle in Fig. 5.22. But the resulting model is not ‘planar’: it cannot be 
represented in two-dimensional Euclidean space. Now consider a new valuation 
shown in Fig. 5.23. In this case there does not exist a bisimilar contraction: 
every point of the triangle is distinguishable by a formula which is not true on 
any other point, see Fig. 5.24. 


) 


2 54 4 





Figure 5.23. An irreducible affine model. 





Point | Formula 

1 pı = pA<B>(q,r) 
y2= pArv1 
ys = gA<B>(¢1, p2) 
p4= r 
s= q^<B>(p2, p14) 
pe = GA 7H3 A 75 














DNNnkWN 





Figure 5.24. Formulas true at points of the model in Fig. 5.23. 


4.1.2 Modal logics of betweenness. The preceding language has a 
minimal logic as usual, which does not yet have much geometric content. Its 


280 HANDBOOK OF SPATIAL LOGICS 


key axioms are two distribution laws: 


<B>(pV q,r) = <B>(p,r) V <B>(q,r) 
<B>(p,qV Tr) = <B>(p,q) V <B>(p,r). 


This minimal logic has all the usual modal properties, including decidabil- 
ity. Further axioms would express basic universal frame conditions such as 
betweenness being symmetric at end-points and all points lying ‘in between 
themselves’: 


<B>(p,q) > <B>(q,p) 
p — <B>(p,p). 


These are simple modal frame correspondences. A more interesting exam- 
ple was already mentioned in Sec. 1.5, involving an existential affine axiom. 
Consider associativity of the betweenness modality: 


<B>(p, <B>(q,r)) > <B>(<B>(p,q),r). 


FACT 5.86 Modal Associativity corresponds to Pasch’s Axiom. 


Proof We spell out the simple correspondence argument to show how easy 
matches can be between modal axioms and geometric laws. Consider Pasch’s 
Axiom (Sec. 1.5). Suppose that 


Vtzyzu( (ztu) & B(yuz) > Iv : B(xvy) & B(vtz)) 


holds in a frame. Assume that a point t satisfies <B>(p,<B>(q,r)). Then 
there exist points x,u with @(xtu) such that x = p and u = <B>(q,r). 
Therefore, there also exist points y, z with G(yuz) such that y E q and z = r. 
Now by Pasch’s Axiom, there must be a point v with G(xvy) and G(vtz). Thus, 
v = <B>(p,q) and hence t H <B>(<B>(p,q),r). 

Conversely, assume that 3(atu) and G(yuz). Define a valuation on the space 
by setting v(p) = {x}, v(q) = {y}, and v(r) = {z}. Thus, u = <B>(q,r) 
and 





t Æ <B>(p, <B>(q,r)). 
Then by the validity of Modal Associativity, 


+ 





= <B>(<B>(p,q); r). 


Therefore, there must be points v, w with B(vtw) such that v E <B>(p,q) 
and w — r. By the definition of v, the latter implies w = z, and the former 
that 6(xvy). So indeed v is the required point. QED 


All these correspondences may even be computed automatically as they have 
‘Sahlqvist form’ (cf. Blackburn et al., 2001 for more general theory). Thus, 
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modal axioms correspond to significant geomterical axioms, reflecting insights 
from the usual arithmetizations of geometry. Deeper examples may be found 
in Stebletsova, 2000 and Stebletsova and Venema, 2001, who analyze modal 
structures in projective geometry, including Pappus’ Theorem. 

Complete affine modal logics of special models may also be axiomatized, 
though only few examples have been dealt with so far. At least for the real line 
IR, the task is easy as one can take advantage of the binary ordering <, defining 


M,x = <B>(y,v) iff 3y,z: M,y |} pand M,z H yandy< r< z. 





Using this, we can define temporal operators Future and Past (both including 
the present). Conversely, these two unary operators define <B> on JR: 


<B>(y,¥) = (Ppa Fy). 


Thus, a complete and decidable axiomatization for our <B>-language can be 
found using the well-known tense logic of future and past on JR (Segerberg, 
1970). We refer to Ch. 7 for further details. 


4.1.3 Logics of convexity. An interesting and rich special case of 
affine structure is the convex closure of a set, consisting of all points lying on 
a segment whose end-points are in the set. Convexity is important in many 
fields from computational geometry (Preparata and Shamos, 1985) to cognitive 
science (Gärdenfors, 2000). We can capture convexity modally by frames of 
points with the betweenness relation: 





(5.2) M,« = Co iff 3y, z : M, y H vy and 


M,z = ọ and z lies in between y and z. 





This is a one-step convexity operator whose countable iteration yields the stan- 
dard convex closure, cf. Fig. 5.6. The corresponding binary modality C’y is 
defined as follows: 





dyz: B(yxz) and p(y) and y(z). 


Basic axioms are different here from the preceding subsection. In particular, 
distributivity fails. The one-step convex closure of a set of two distinct points 
is the whole interval in between, while the union of their separate one-step 
closures is just these points themselves. Thus, only monotonicity remains as a 
valid reasoning principle. Another principle which may fail is the idempotence 
of the convexity modality: 

CCye = Co. 


Iterating Cy can lead to new sets, witness Fig. 5.6. Even so, the non-idempotence 
is of interest, as it helps distinguish dimensions. For instance, Cp => Cy 
holds in JR, but not in IR?. 
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One may now think that the stages C’t!p => C" determine the dimension 
of the spaces JR” for all n. But here is a surprise. 


THEOREM 5.87 (AIELLO, 2002A) CCCy = CCo holds in RR°. 


Nevertheless, convexity does provide dimension principles after all. Here is 
an old result from Helly, 1923: 


THEOREM 5.88 If K1, Ko,..., Km are convex sets in the n-dimensional Eu- 
clidean space JR” with m > n + 1, and if for every choice of n + 1 sets K; 
there exists a point that belongs to all the chosen sets, then there exists a point 
that belongs to all the sets K1, Ko,..., Km. 


Our modal language formalizes this theorem as follows: 


n+l m 
A E( N (Cero) > EIN C" ea); 
f:{1,....n4+1}—- {1,....m} i=1 i=1 


where F is the existential modality defined in terms of betweenness: 
Ep iff <B>(y,T). 


Interestingly, Helly’s results have been revived recently in Leitgeb, 2005 in 
a new reconstruction of Carnap’s geometrical constructions in Carnap, 1998. 
Thus again, modal languages capture significant geometrical facts. 


4.1.4 First-order affine geometry. As usual, the above modal language 
is a fragment of a first-order language under the standard translation. The 
relevant first-order language is not quite that of Tarski’s elementary geometry, 
however, as we also get unary predicate letters denoting regions. That is, in 
terms of validity, we are rather in monadic second-order logic. As in our 
discussion of topology, the affine first-order or monadic second-order language 
of regions is a natural limit toward which affine modal languages can strive 
via various logical extensions. From a geometrical viewpoint, one might also 
hope that ‘layering’ the usual language in this modal way will bring to light 
interesting new geometrical facts. For much more information on current first- 
order theories of geometry we refer to Ch. 2, Ch. 7. 

Another major feature of standard geometry is the equal status of points 
and lines. This would suggest a reorganization of the modal logic to a two- 
sorted one (cf. Marx and Venema, 1997) stating properties of both points and 
segments viewed as independent semantic objects. One can think of this as 
a way of lowering the second-order complexity as the relevant subsets have 
now become first-order objects in their own right (cf. van Benthem, 1999). 
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There are also other analogies with existing modal systems, such as arrow logic 
(van Benthem, 1996; Venema, 1996), where ‘arrows’—standing for directed 
transitions between points (or vectors)—become semantic objects in their own 
right, in addition to the points themselves. Arrow logic will be explained in 
some more detail in our account of mathematical morphology and linear algebra. 
The two-sorted move seems very geometrical in spirit, and it would also reflect 
duality principles of the sort that led from affine to projective geometry. 


4.2 Metric geometry in modal logic 


The next level in geometry beyond affine point-line patterns is metric struc- 
ture. We show how modal languages can describe a notion of comparative 
distance. 


4.2.1 Structures for relative nearness. Relative nearness was intro- 
duced in van Benthem, 1983b as a natural primitive for ‘orientation’ which has 
spatial discourse in natural language: 
N(a, y, z) iff y is closer to x than z is, i.e., d(x,y) < d(x, z) 
where d(x, y) is the distance function (see Fig. 5.25). 


Figure 5.25. From point x, point y is closer than point z. 


The function d can be a spatial metric, cognitive visual closeness, or even a 
utility function. Randell et al., 2001 develop the first-order logic of comparative 
nearness for the purpose of robot navigation, extending the calculus of regions 
RCC. In such a setting, relative nearness is a powerful primitive, as it defines 
equidistance: 


Eqd(x,y, z): =N (x,y,z) and ~N(a, z, y). 


Affine betweenness is also definable in terms of N, at least in IR”: cf. Sec. 
4.2.2. Finally, even the identity of points x = y is expressible: 


x =y iff aN (x, x,y). 


284 HANDBOOK OF SPATIAL LOGICS 


The further analysis of this structure can proceed as in the affine case. But 
there are also surprising open questions. E.g., the universal first-order theory 
of relative nearness for Euclidean spaces has still not been axiomatized. 


4.2.2 Modal logic of nearness. In line with the main concern of this 
chapter, consider this obvious modal operator accessing ternary nearness N: 





M, x = <N>(y, 4p) iff dy,z: M,y H y and M, z H wand N(z, y, z). 
The universal dual of <N> is also interesting in its spatial behavior: 


M, x H [N](¢, Y) iff Vy, z (N(z,y, z) and M,y F p > M, z H| =y). 


Dropping the negation, one gets the following appealing notion: 
If any point y around the current point x satisfies y, then all points z 


further out must satisfy w. 


The basic modal logic of nearness again has distribution laws: 


<N>(pV q,r) = <N>(p,r) V <N>(q,r) 
<N>(p,q V r) = <N>(p,q) V <N>(p,r). 


On top of this, natural universal frame constraints return as special axioms. 
Here are two examples: 


(transitivity) 
<N>(p,q) \7<N>(p,p) A =<N>(q, 4) A <N>(q,r) > <N>(p,r) 


(connectedness) 
<N>(p,q) An<N>(p,p) \7n<N>(q,¢) A Er — <N>(p,r) v <N>(r,q). 


Modal logics of nearness on special structures may include further constraints 
computable by correspondence techniques. Here is a useful observation cov- 
ering many cases. Our language can define that y holds in just one unique 
point: 

E! giff E(y A A<N>(9, )). 


Now a straightforward proof, known from extended modal logics with a differ- 
ence modality (de Rijke, 1993), establishes the following: 


PROPOSITION 5.89 Every universal first-order property of N is modally 
definable. 


Sheremet et al., 2006 provide a sophisticated analysis of modal languages 
for comparative nearness, including boundaries between decidable and unde- 
cidable systems. 
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4.2.3 First-order theory of nearness. As in the affine case, the back- 
drop to our modal analysis is the first-order theory of relative nearness (cf. 
Aiello and van Benthem, 2002b). 


Fact 5.90 The single primitive of comparative nearness defines the two prim- 
itives of Tarski’s Elementary Geometry in first order logic. 


Proof The following defines betweenness (see Fig. 5.26): 





B(yxz) iff sda": N(y, x', x) and N(z, 2’, 2). 


an 


, 
Figure 5.26. Defining betweenness via nearness. 


x 











y x z 


This allows us to define parallel segments in the usual way as having no inter- 
section points on their generated lines: 
wa'\\yy’ de : B(wa'c) A B(yy'e)A 
cd : B(d xa’) A B(cyy’)A 
a : B(ac'z’) A B(yc!’y’). 


Then one defines equal segment length by 


7 





za. 





ô(x, y, z, u) iff Sy’: xull|yy’ and xy||uy’ and =N (u, z, y’) and ~N (u, y'z) 
(see Fig. 5.27). 


QED 


There are many other systems of first-order geometry with similar richness. 
For instance, see the axiomatization of constructive geometry in von Plato, 
1995. Having established this connection with the first-order realm, we pass 
the torch to the relevant chapters in this book: Ch. 2, Ch. 7. 


5. Modal logic and linear algebra 


Classical geometry is not the last word in mathematical theories of space. 
Connections between linear algebra and spatial representation are well-known 
from a major qualitative visual theory, viz. mathematical morphology (Math- 
eron, 1967; Serra, 1982). We provide a brief treatment along the lines of Aiello 
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Figure 5.27. Equidistance in terms of nearness. 


and van Benthem, 2002a and van Benthem, 2000b, to show that modal struc- 
tures play a natural role here as well. The flavor of modal logic of linear algebra 
is different from that of topology or geometry, but similar themes emerge all 
the same. A different connection between mathematical morphology and modal 
logic can be found in Bloch, 2000. See Ch. 14 of this book for a more compre- 
hensive treatment. 


5.1 Mathematical morphology 


We first set the scene. In line with our spatial interests, we consider vector 
spaces JR”. ‘Images’ are regions consisting of sets of vectors. Mathematical 
morphology provides four basic ways of combining or simplifying images, viz. 
dilation, erosion, opening and closing. These are illustrated in Fig. 5.28. 

Intuitively, dilation adds regions together while, e.g., erosion is a way of 
removing ‘measuring idiosyncrasies’ from a region A by using region B as a 
kind of boundary smoothener. (If B is a circle, one can think of it as rolling 
tightly along the inside of A’s boundary, leaving only a smoother interior version 
of A.) More formally, dilation or Minkowski addition © is a sum operation on 
sets of vectors: 


A@®B={a+b:aceAandbe B} dilation 
This is naturally accompanied by 
AGOB={a:a+b€Aforallb€ B} erosion 


Openings and closings are combinations of dilations and erosions: 


the structural opening of A by B (AC B) B 
the structural closing of A by B (A® B)ƏB 


Mathematical morphology also employs the usual boolean operations on 
regions: intersection, union, and complement. In addition to topology and 
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(b) (c) (d) (e) 





Figure 5.28. (a) Regions A and B of the vector space JR; (b) dilating A by B; (c) eroding A 
by B; (d) closing A by B; (e) opening A by B. 


geometry, this is yet another mathematical theory of JR”, this time focusing 
on their vector structure. Evidently, the above operations are only a small 
sub-calculus, chosen for its computational utility and expressive perspicuity. 


5.2 First step: links with linear logic 


Mathematical morphology is not normally thought of in terms of logic, but 
on reflection, it does have a logical flavor. The Minkowski operations behave 
like notions of propositional logic in some procedural mode. Dilation is like a 
logical conjunction ®, and erosion like an implication —, as seems clear from 
their definitions (‘combining an A and a B’, and ‘if you give me a B, I will 
give you an A’). The two are related by the following residuation law: 


AG®BCCiffACB-C 


which is also typical for conjunction and implication. Nevertheless, there are 
also some differences. For instance, A @ A is not in general equal to A. 

A first logical calculus for these operations (not yet ‘modal’, but see Sec. 
5.3) is known as multiplicative linear logic in computer science and as the 
Lambek calculus with permutation in linguistics (Troelstra, 1992; Kurtonina, 
1995). The calculus derives ‘sequents’ of the form A,,..., A, = B where 
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each expression A, B in the current setting stands for a region, and the intended 
interpretation—in our case—says that 


The sum of the A’s is included in the region denoted by B. 


Here are the derivation rules, starting from basic axioms A => A: 











E X>A Y >B X,A4 B> C 
roduct rules E rigs = RS 
alec X,Y >AOB X,AOBSC 
( cs A,X >B X>A B,Y >C 
arrow rules > 

X> A->B X,A> BY >C 
E E X>A X=>A A,Y=B 
structural rules — i 

7X] = oe X,Y ZB cut 


Derivable sequents typically include: 


(‘function application’) AA>B>B 
(‘function composition’) A—B,B=—C>A>C 


Other key examples are two ‘currying’ laws, provable using © rules: 


(A®B)-C=3(A-(B-0O)) 
(A—(B>C))=>(A®@B) HC. 


The major combinatorial properties of this calculus LL are known, including 
proof-theoretic cut elimination theorems, and decidability of derivability in NP 
time. Moreover, there are several formal semantics underpinning this calculus 
(algebraic, game-theoretic, category-theoretic, and possible worlds-style; see 
van Benthem, 1991a. On top of all this, mathematical morphology provides a 
new model for linear logic! 


Fact 5.91 (AIELLO AND VAN BENTHEM, 2002B) Every space IR” with 
the Minkowski operations is a model for all LL-provable sequents. 


This soundness theorem shows that every sequent derivable in LL must be 
a valid principle of mathematical morphology. The converse seems an open 
question of independent interest. 

Further laws of mathematical morphology ‘mix’ pure Minkowski operations 
, — with standard boolean ones. E.g. they include the fact that (AUB) > C 
is the same as (A — C) N (B — C). This requires adding boolean operations 
to LL: 
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X,A=>B X,A>B X=>A X=>B 
X,ANC=B X,CNA=>B X> AAB 

X=>A X=>A X,A=>B X,C => B 
X> AUB X> BUA X,AUC = B 


5.3 Arrow logic and hybrid modalities 


But what about modal logic, the main theme of this chapter? We will see 
that there is a natural connection after all. 

The basic players in the above algebra of regions in a vector space are surely 
the vectors themselves. For instance, Fig. 5.28.a represents the region A as a set 
of vectors departing from the origin. Vectors come with some natural operations 
such as binary addition or unary inverse—witness the usual definition of a vector 
space. A vector v in our particular spaces may be viewed as an ordered pair of 
points (0, e), with o the origin and e the end point. Pictorially, this is an arrow 
from o to e. Now this provides a point of entry into one more area of modal 
logic. 

Arrow logic was developed in the early 1990’s as a modal theory of transitions 
or arrows structured by various relations. In particular, there is a binary modality 
for composition of arrows and a unary one for converse. The motivation comes 
from dynamic logics, treating transitions as objects in their own right, and from 
relational algebra, making pairs of points into separate objects. This allows for 
greater expressive power than the usual algebraic systems, while also lowering 
complexity of the core logics (Blackburn et al., 2001; van Benthem, 1996). For 
instance, the pair-interpretation has arrows as pairs of points (ao, a), and then 
defines these semantic relations: 


composition C (ao, dc) (bo, be) (Co, Ce) iff ao = Co, de = bo, and be = Ce, 
inverse R(ao, ae)(bo, be) iff ao = be and ae = bo, 
identity I (ao, ae) iff ao = de. 


An abstract model is a set of arrows as primitive objects, with three relations 
as above, and a valuation function as usual: 


DEFINITION 5.92 (ARROW MODEL) Anarrow modelis a tuple M = (W, C, 
R, I, v) suchthat C C WxWxW,R CWxW,I C W,andvy : P > P(W). 


Such models have a wide variety of interpretations, from linguistic syntax to 
category theory (Venema, 1996). And they can be made even more useful by 
having a two-sorted version with both points and arrows as primitive objects. 
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Now introduce a standard modal language with proposition letters, the iden- 
tity element 0, monadic operators ~, —, and a dyadic operator ®. The truth 
definition then has the following obvious clauses: 

M,x Ep if «2 €v(p) 
M,xE-0 if Ix 
M,<x = —ọ iff dy: Rxy and M,y = y 








M,x = 7 if not M,< = 

M,zH= ovy iff M,x}HgporM,x Hy 

M,xz=A4A9B if 3y3z: Cxryz and M,y = A and M, z = B 
M,x =| AOB iff VYyz(Cyxz and M,z |= A> M,y E B). 






































This language leads to a decidable basic system of arrow logic: 


(5.3) (pV q) r= (par) v (qer) 
(5.4) p®(qVr)— (pa) V (per) 
(5.5) —(pV q) > (=p ^ —4) 

(5.6) pA(q®r) —q9 (rA (4€ p)). 


This connects relational algebra, linear logic, and categorial grammar (cf. Kur- 
tonina, 1995). 

Important to us here, however, is the obvious connection with vector spaces. 
Think of the abstract composition Cxyz as vector addition x = y + z, of 
converse Ray as vector inverse x = —y, and of identity elements Ix as the 
null-vector x = 0. Most modal topics now make immediate sense in linear 
algebra or mathematical morphology. E.g., arrow models support a natural 
notion of bisimulation, which will now compare vector spaces in coarser ways 
than their usual linear transformations. 

Next, there is valid reasoning. The laws of basic Arrow Logic represent 
obvious vector laws. E.g., to see the validity of (5.6), note that if a vector a is 
the composition of b and c, then c can also be written as the composition —b@ a. 


c -b+a 


(a) (b) 


Figure 5.29. Triangle axiom for arrow composition. 


On top of the base system, special arrow logics have been axiomatized with 
a number of additional frame conditions (Marx, 1995; Mikulas, 1995) and a 
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resolution calculus has been introduced for the purpose of knowledge repre- 
sentation and spatial reasoning (Aiello and Ottens, 2007). In particular, the 
vector space interpretation makes composition commutative and associative, 
validating two further axioms: 


(commutativity) pdaq-aqhp 


(associativity) pe(qer)e(peq er 
The key fact about composition is now the vector law 
a=b+c iff c=a-—b 


which derives the triangle inequality (see Fig. 5.29). 

Again the soundness of arrow logic is clear, and we can freely derive old 
and new laws of vector algebra in the above calculus. But our connection 
between arrow logic and mathematical morphology also raises an intriguing 
open problem: What is the complete axiomatization of arrow logic over the 
standard vector spaces IR”? 


6. Conclusions 


The work surveyed in this chapter suggests that modal logic is a natural 
medium for analyzing reasoning about spatial patterns. In particular, modal 
research into topological structure is showing signs of becoming a recognized 
topic with a fast-growing agenda, as shown in Sec. 2, 3. And this interface has 
interesting repercussions in both directions. Modal logic acquires new spatially 
inspired models and new questions about them, while topology acquires new 
modally inspired notions, such as bisimulation or complete calculi for language 
fragments. We have also shown briefly (in Sec. 4, 5) how the modal perspective 
can be extended to deal with further geometrical structure: affine, metric, and 
even vector-based, more in the spirit of classical graph-like relational models. 
Again, benefits of taking such a stance can flow in surprising ways. We have 
indicated, e.g., how spatial models can also inject new ideas into areas such 
as dynamic or epistemic logic. In our view, all this goes to show that, just as 
temporal logic has been for so long, spatial logic might become a focus for 
modal logic. 
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1. Introduction 


This handbook chapter explores some themes which relate general topology 
and epistemic logic. The leading ideas are: (1) to review the connection be- 
tween the modal logic 54 and topology going back to the work of Alfred Tarski 
and J. C. C. McKinsey in the 1940’s; (2) to discuss the epistemic interpreta- 
tion of topology; (3) to present the two-sorted semantics of topologic and to 
discuss what is known about it, including some of the main completeness and 
decidability results; (4) to present a topological semantics for the logic of belief 
K D45 based on the derived set operation; and (5) to briefly mention related 
work in a number of directions. 


Topology and modal logic: afirstlook. One ofthe things which strikes one 
when studying elementary (set-theoretic) topology is how easy it is. Notions 
like open, closed, dense, seem intuitively transparent: their basic properties 
easy to prove. Contrasting this fact is that topology uses second order notions 
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as it reasons with both points and sets. This would imply that like second order 
logic, topology ought to be computationally very difficult. 

This intuitive tension between the two paradigms vanishes when one realizes 
that a large part of topology can be seen as a modal logic, i.e., as an epistemic 
logic which combines the notion of knowledge and effort. Recall that modal 
logics tend to be much easier than first order logic, let alone second order. 

Suppose that we have made a measurement—for example, that some velocity 
vis 50 + .5. We interpret this as saying that v is in the open set (49.5,50.5) and 
therefore anything which we know about v must hold not only of v itself, but 
also of any v’ in the same interval. (49.5,50.5) thus becomes an equivalence 
class for an appropriate 55 logic of knowledge. Thus the connection with 
modal logic. 

However, the notion of effort can also enter, as v might be measured more 
accurately with more effort. Following up on this basic intuition, the first two 
authors of this chapter developed a bimodal logic called topologic for studying 
elementary topology. This logic turned out to have a nice axiomatization and 
to be decidable. The original work was followed up then by further work by 
Georgatos and also by Dabrowski in conjunction with the two original writers 
(see Georgatos, 1993; Georgators, 1994a; Georgatos, 1994b; Georgatos, 1997; 
also, Heinemann, 1997; Heinemann, 1999b; Heinemann, 2001; Weiss, 1999) 
We should mention that the original logic was defined for arbitrary subset spaces. 
The subsequent logics consider extensions of this logic where restricted families 
of sets were considered, closed under union or intersection. The bulk of our 
chapter, Sections. 4-8, is about topologic. 

Another large part of the chapter comes from Steinsvold’s discovery that 
the notion of belief can also be given a topological meaning via the notion of 
derived set. This is covered in Sec. 10; the main reference is his dissertation 
(Steinsvold, 2006). 

This chapter may be considered a continuation of van Benthem and 
Bezhanishvili’s Ch. 5 of this handbook. Although we have written this chapter 
to be read on its own, readers would certainly benefit from a look at the many 
related discussions about different logics that can be found in Ch. 5. 





2. Perspectives 


This handbook deals with logic and space, more precisely with various logics 
formulated with different goals in mind. But in a certain sense, the goal is 
usually to take a common mathematical model of space and then to fashion 
logical tools to work with it. This chapter works in a different way. The overall 
points are to investigate notions such as knowledge, belief, and observation 
effort, and mathematical structures like topological spaces arise in the course 
of that investigation. So the modus operandi, and the overall goals, of our work 
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are different. One should not expect insight into, say, algebraic topology from 
our work. What we are after is rather a kind of reconstruction of the ideas 
underlying topology. 

Some perspective on our subject might be gleaned from comparing the ideas 
with those in a well-known source, Steven Vickers’ book Topology via Logic 
(see Vickers, 1989). One goal of that book is to explain the non-Hausdorff 
topologies that arise in computer science. This goal is relevant for our chapter, 
more so perhaps than for other chapters of this handbook. The overall message 
is that (p. 3) “topology is used to explain approximate states of information: the 
points include both approximate points and more refined points, and these relate 
to the topology by the property that if an open set contains an approximate point, 
then it must contain any refinement of it.” The book goes on (pp. 5—11) to discuss 
“finite observations,” and therefore aims at a reconstruction of topology in terms 
of logics of observations. The algebraization of a logic of finite observations is 
called a frame; itis acomplete Heyting algebra under two operations ^ (binary) 
and \/ (infinitary). (Note that the opens of a topological space are a frame under 
the corresponding set-theoretic operations.) This notion of a frame gives us a 
more topological notion, a locale; this is based on the frame morphisms into a 
special two-element frame. 

Now our work differs from Vickers’ in the sense that we add to the notion of 
observation a notion of effort. Our discussion begins in Sec. 4. Our treatment 
of effort is in a sense fairly crude: we take the subsets of a space as the pos- 
sible observations, and then more effort corresponds to a smaller set, a better 
approximation to being closer to the “real” point. We say that this is crude 
because it does not measure the amount of effort in any real sense. However, 
it is a refinement of treatments which do not include any modeling of effort 
whatsoever. We start in the next section by first discussing the classical work 
of Tarski and McKinsey. 


3. The original topological interpretation of modal logic: 
Tarski and McKinsey’s Theorem 


The project of relating topology to modal logic begins with work of Alfred 
Tarski and J.C.C. McKinsey (see McKinsey, 1941; McKinsey, 1944). The basic 
idea is to study the laws of the interior operation on subsets of a topological 
space and its dual, the closure operator. Suppose that ¥ = (X, O) is a topo- 
logical space; this just means that O is a family of subsets of X containing 
the empty set Ø and X itself, and O is closed under arbitrary unions and finite 
intersections. The family O is called a topology, and its elements are the open 
sets of the space X. The closed sets are the complements of the opens. For 
any subset A C X, we define its interior A® to be the largest open subset of A. 
Dually, the closure A is the smallest closed set including A. We say “dually” 
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here to illustrate one of the properties of these operations: A° = X \ (X \ A). 
In words, the interior of A is the complement of the closure of the complement 
of A. This holds for all subsets of all spaces; it is just this kind of general 
property that we aim to study. 

In order to study these notions, we introduce a logical language £o. It is a 
modal language. We begin with an arbitrary but fixed set At of atomic propo- 
sitions and close under truth functions ^ and ~ and the appropriate modalities; 
for present purposes, we use the modality J for the interior operation. (As we 
have seen, J and C are interdefinable duals. So only one needs to be taken as 
basic in the syntax, and then the other may be regarded as a defined symbol.) 

The language Lo is interpreted on such a space ¥ together with an inter- 
pretation map into X, i : At + P(X). (We do not insist that each i(p) is an 
open set.) So for atomic p, i(p) says which points satisfy p. We call (X, O, i) a 
topological model. Then i extends to all of Lo by interpreting negation as com- 
plement relative to X, conjunction as intersection, and C and J as the closure 
and interior operators respectively. In symbols, we have 


i(ng) = X\i(¢) 
(pAv) = ib) Nil) 
i(I) =  (a(¢))° 


EXAMPLE 6.1 For a very easy example, consider the usual real line R with 
i(p) = {1}. Theni(Ip) = 0, i(-p) = R\{1}, andi(I-p) = i(-p). Moreover, 
one can check as well that for all ¢, i(¢) is always one of the sets 0, R, {1}, or 


R\ {1}. 


As with all attempts to study some phenomenon, the main idea here is that 
the basic properties of the boolean operations on sets (unions, intersection, and 
complement), and also the salient topological operations (interior and closure) 
correspond to sentences in the language, or rather to schemes of sentences. For 
example, consider another general fact: the interior of the intersection of two 
sets is the intersection of their interiors. This corresponds to the fact that for 
all sentences ġ and y, and for all spaces ¥ and all interpretations 7 of whatever 
atomic sentences we have, we also have 


(LAY) = (Ud) AU) = X. 


For another example, the fact that the interior operation is idempotent corre- 
sponds in the same way to the scheme 


(Id) 4 (Id) = xX. 


One of the natural questions to ask about this language and its semantics is: 
can we characterize in an enlightening way the sentences @ with the property 
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that for all topological models (X, O, i}, i(@) = X? We call such sentences 
topologically valid. The reason for this is that we more generally write x  ¢ 
for x € i(@). (This notation from model theory will be used throughout our 
chapter.) Then the topologically valid sentences are those that are true at all 
points in all spaces under all interpretations. Similarly, the topologically satisfi- 
able sentences are those sentences ¢ which belong to 7(¢) for some topological 
model. And we say that a set © C £ is topologically satisfiable if there is a 
topological model with (ges i(¢) 4 0. 

What Tarski and McKinsey proved is that the topologically valid sentences 
are exactly those provable in the logical system S'4. This is a logical system 
which had been proposed much earlier than their work. It is the system whose 
axioms are the substitution instances of the tautologies of classical propositional 
logic, and also the schemes below: 


1 I(¢> 4%) > (Io > I4) 
2I1¢—-¢ 
3 Ig > II 


(Note that the second of these can be read as saying that the interior of a set 
is a subset of it, and the last is one direction of the idempotence of the interior 
operation.) The rules of $4 are modus ponens, and also necessitation: from ¢, 
derive Jd. It should be mentioned that the earliest work on S4 did not present 
this semantics, so that the Tarski-McKinsey work may be read as giving a nice 
semantics to an already-existing logical system. 


THEOREM 6.2 (TARSKI AND MCKINSEY) The interpretation of Lo in 
topological spaces is sound and complete in the sense that the following are 
equivalent: 


1 gis topologically valid. 
2 dis provable in S4. 


Moreover, the set of topologically valid sentences is decidable. 


Proof The soundness being routine, here is a sketch of the completeness. Our 
work here follows a recent presentation (Aiello et al., 2003). One considers the 
theories in S4; these are the maximal consistent sets of sentences. We show that 
each theory T is topologically satisfiable. This is equivalent to completeness. 
Actually, we shall show that all theories T are satisfiable in the same canonical 
topological model. 

Let C(.S4) be the set of theories in 54. For any 4, let 


$ = {TEC:geT}. 
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The family of sets {Td : $ € Lo} is the basis of a topology; this is tantamount 
to the fact about intersections and interiors that we noted above. In this topology, 
the basic open sets are those of the form {T : [¢ € T}, for some sentence ¢. 
We use the following canonical interpretation 


(6.1) i(p) = P 
for p € At. This gives a topological model which we also call C($4). 
The main fact about C (54) is the following Truth Lemma: 

io) = ¢. 
The proof is by induction on ¢. The base case for atomic sentences is by 
definition, and the steps for the propositional connectives use basic facts about 
theories. The main work is in the induction step for Io. Suppose first that 
T € Iĝ ġ, so that Iọ € T. . Then this set Id ġ is a basic open set containing T. By 
Scheme 2 of the logic, To oc d. By induction hypothesis, $= = i(¢). In this 
way, T € i(¢)° = i(I¢). 

Going the other way, suppose that T € 7(/@). Then there is some basic open 
set around 7’, say Td, included in 7(¢). By induction hypothesis, Th G d. We 
claim now that I~ — ¢ is provable in S4. (If not, then ITY A —¢ is consistent. 
So there is some theory U € i(Iy) but not in i(@). This contradicts Ib Co) 
Using this claim and the necessitation rule, we can prove I(Iy — ¢). Using 
Scheme 1, we get [Jy — Id. By Scheme 3, we then get Iy — Id. So Id 
belongs to our original T. Hence T € To, as desired. 

The final assertion of our theorem, the decidability, is a standard effective 
finite model result. The idea is that given a sentence ġ, one can compute a 
number n = n(¢) such that if ¢ is satisfiable on any model, then it is satisfied 
on a model of size at most n. We get such a number by estimating the size of a 
certain quotient of C (54), a quotient which of course depends on ¢. We omit 
the details here, but see Sec. 8 for another decidability result. QED 


There are also stronger forms of Theorem 6.2. For example, one might 
well wonder whether 54 is complete for the topological operations on specific 
natural spaces, such as the reals or the interval [0,1]. McKinsey and Tarski 
showed that S4 is complete for every dense-in-itself separable metric space 
(see McKinsey, 1944). This implies the completeness of 4 for all of the spaces 
mentioned above. In recent years, there has been a series of papers simplifying 
the completeness arguments for these special cases; see, for example, Aiello 
et al., 2003. 


3.1 The preorder semantics of the same system S4 


We have already seen the topological interpretation of the logic $4. In what 
follows, we need not only this semantics but the more standard relational, or 
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Kripke semantics of this and other logical systems. This section reviews this 
topic. 

The syntax of modal logic is similar to what we have already seen. It begins 
with some set At of atomic sentences (or propositions) and then considers the 
closure of this set under the boolean operations of A and ~ (and others, say 
by abbreviation), and some modal operators. We shall continue to use I as the 
one operator for now. (It is more standard to use symbols like L and K for the 
operators, with duals © and L. We shall see these in later parts of our chapter.) 

The semantics begins with a frame, a set whose elements are called worlds 
or points together with a binary relation on it. This relation is sometimes called 
accessibility, and symbols like —, <, or R have been used for it. To interpret 
S4, read I not as interior but rather as “all points which the current point relates 
to”. Then reading the schemes of S4 this way suggests that the accessibility 
relation be a preorder (transitive and reflexive): for example, [¢@ — @ says 
that if something is true of all points which the current point relates to, then 
it is true at the current point itself. To get a sound interpretation of 54, we 
should require that the current point is related to itself. We therefore define a 
preorder model to be a triple X = (X,<,i), where (X, <) is a preorder and 
again i : At > P(X). 

The semantics is the same as for topological models, except that now the 
clause for J becomes 





iId) = {wx :ta Ci(d)} 


Here {x stands for {y : x < y}. The semantics of C is given by duality, so that 
Co — =I-7¢ is valid by definition. We again define validity and satisfiability 
in preorder models just as with topological models, mutatis mutandis. 


THEOREM 6.3 The interpretation of Lo in preorders is sound and complete 
in the sense that the following are equivalent: 


1 dis valid in preorder models. 


2 dis provable in S4. 


One can prove this result in the same manner as Theorem 6.2. We use the 
set C(.S4) of theories in S4 and define a preorder on them by 


(6.2) T < U iff o € U whenever [gin T. 


One also uses the canonical interpretation from (6.1). The rest of the complete- 
ness argument is standard, and any textbook on modal logic would contain it. 
Instead of giving the details, we present an alternative approach based on the 
connection between certain topological spaces and preorders. 
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Let ¥ = (X, <) be a preorder. Consider the Alexandrov topology on X: 
the opens are the sets closed upwards in the order. This gives a topology which 
we call O<. This associates topological spaces to preorders. (Actually, the 
reflexivity is not used in verifying that we have a topology, a fact which will 
come into play in Sec. 11.2.) It also associates topological models to preorder 
models, just by copying the interpretation 7. (Actually, this gives a very special 
kind of space: the opens are closed under arbitrary intersections.) 


PROPOSITION 6.4 For all preorder models (X,<,i), all x € X, and all 
$ € Lo, 
re ĝin (X,<,i) if zH in (X,O<,i) 


Proof By induction on ¢ € Lo. The case of the atomic sentences is trivial, as 
are the induction steps for the boolean connectives. 

Assume the lemma for ¢. So i(¢) is the same for both interpretations. 
Consider Jọ. First, assume that x € i(J¢@) in the preorder sense. That is, every 
y > xis in i(@). Now fz is an open set, and by reflexivity it contains x. As 
we have just seen, it is included in i(@). So x € i(I@) in the topological sense. 
Conversely, if x € I(¢) topologically, then there is some y such that x €f y 
and fy C i(¢). But then y < x. By transitivity, fz C fy. So fx C 2(¢). Thus 
x € i(I¢) in the preorder sense. QED 


Using this easy result, Theorem 6.2 follows from Theorem 6.3. That is, every 
S4 theory is satisfied on some preorder, hence on some topological space. But 
with a little more work, Theorem 6.3 can be made to follow from Theorem 6.2 
or rather from the related fact that sentences which are satisfiable in S4 have 
finite topological models. Thus they are Alexandrov spaces: every point has a 
minimal open set around it. We turn a topological model which is an Alexandrov 
space into a preorder model by: x < y iff y belongs to every open set around 
x: the Alexandrov property implies the transitivity. Then one notes a result 
just like Proposition 6.4, except now we relate the topological semantics on an 
Alexandrov space to the preorder semantics derived from it. The upshot is that 
a given topologically satisfiable sentence ¢ now has a finite preorder model. 
This implies the completeness of 4 in the relational semantics. 


3.2 Adding the difference modality 


One of the purposes of this chapter is to mention other work that builds 
on classical topics. We mention here recent work (Gabelaia, 2001; Kudinov, 
2006) which adds a modal operator to the language we have been discussing. 
Kudinov’s paper is the source for all results in this subsection and contains other 
material as well. Add to Lo the operator [4] to get a larger language £1. For 
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the semantics, we stipulate that 


x |4]é iff forall y different from x, we have y — ©. 





So we can read [4]¢ as “¢ holds everywhere except possibly here.” It is also 
useful to adopt an abbreviation Kọ for ¢ A^ [4]@, (Kudinov uses [V], but we 
use K later for just this purpose). As usual we include for convenience a dual 
modality (4) so that (¢)¢ abbreviates —=[4]¢. Then as axioms, one takes S'4 
for I Gust as we have seen), together with the following schemes: 


[AI(¢ > y) > l — y) 
= [AIA 


(DA le) > Ale 
Kọ — I¢ 


For rules we take the necessitation rules for both J and also [4]. This axiom 
system is called S4D. It is easy to see that S4D is sound for all topological 
interpretations. 

Now in this logic we can express some interesting topological properties. The 
examples we have in mind are: density-in-itself, and Tı. The first condition 
means that there are no isolated points: {x} is not open. The second means 
that for every x # y, there is an open set containing x but not y.) These are 
examples of correspondence phenomena, and the formal statements involve 
quantification over interpretations of the atomic sentences in the model, in the 
following way. 


PROPOSITION 6.5 Let X = (X, O) be a topological space. 
I The following are equivalent: 


(a) X is dense in itself. 
(b) For all sentences ¢, all interpretations i into X, and allx € X, we 
have x = [4]¢ —> Co. 


2 The following are equivalent: 


(a) X isa T; space. 


(b) For all sentences ¢, all interpretations i into X, and all x € X, we 


have x = |A\o — [A]I¢. 


Correspondence results for the Tọ and Tı properties were first obtained 
in Gabelaia, 2001. The main results about the logics in Proposition 6.5 are 
the following completeness theorems. For example, the second statement be- 
low means that for a sentence ¢ € £1, ¢ is provable in the system axiomatized 


308 HANDBOOK OF SPATIAL LOGICS 


by S4D with the extra scheme |4]¢ — Co iff ¢ holds in every space Y which 
is dense in itself, at all points, under all valuations. 


THEOREM 6.6 (KUDINOV, 2006) 
1 S4D is complete for topological spaces. 


2 S4D + “|4|¢ — Co” is complete for spaces which are dense in them- 
selves. 


3 S4D + “|4|¢ — Co” + “[F]6 — [A]IG” is complete for spaces which 
are dense in themselves and also T). 


4. Topologic 


At this point we have seen the topological interpretation of modal logic. This 
topic has been pursued in many directions over the years; see Ch. 5 for a survey. 
It is not exactly the thrust of this chapter, however. Instead, we strike out on 
a different direction by considering a bimodal language interpreted on a larger 
class of models. This language was first considered in Moss and Parikh, 1992. 

A subset frame is a pair ¥ = (X, O) where X is a set of points and O is a set 
of subsets of X. The elements of O are called opens. We assume that X € O, 
though this is really not necessary. Æ is an intersection frame if whenever 
u,v € O and uN v Æ f, then also uN v € O. X isa lattice frame if it is an 
intersection frame closed under finite unions, and a complete lattice frame if it 
is closed under infinitary intersections and unions. 

Note that we use the term “open” for simplicity even though it is not required 
that O be a topology. It is just that the topological case is our paradigm case, 
and the basis of our intuitions. 

We now set up a formal language £ which is expressive enough for simple 
arguments concerning subset spaces. Later we shall expand this language. The 
formulas of £ are obtained from atomic propositions by closing under ^, ~, 
K and D. 

A subset space is a triple X = (X, O, i), where (x, ©} is a subset frame, 
andi : At + P(X). (We do not require that each i(p) be an open.) If (x, ©) 
is an intersection frame, then ¥ is called an intersection space, and similarly 
for lattice spaces, etc. (Often we simply speak of models.) For p € X and 
p E€ u € O, we define the satisfaction relation =x on (X x O) x £ by 
recursion on @. 











p, u =x A iff pei(A) 

puHxony iff  p,u|Hx dandp,u =x 4 

p,u =x 76 iff p,uFy 

p,u =x Kọ iff q,u =x ¢forallg € u 

p, u |x Oo iff  p,v =x ¢ forall v € O such that p E€ v C u. 
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We use L as the dual of K and ¢ as the dual of O. Explicitly, we have 


p, u =x Lo iff q, u =x @ for some q € u 
DUEx Od iff p,v Ex ġ for some v € O such that p € v C u. 








We stress that we only use the notation p, v = ¢ when p belongs to v. 

As usual, we write p, u = ¢ if X is clear from context. If T C £, we write 
T |= ¢if for all models ¥, all p € X, and all u € O, if p, u Ex w for each 
w € T, then also p,u Hx ¢. Finally, we also write, e.g., T Erm: ¢ for the 
natural restriction of this notion to the class of models which are intersection 
spaces. 

With these definitions in place, we return to a discussion of the concepts 
and motivation. We are considering a Kripke structure whose worlds are the 
pairs (p, u) with p € wand u € Ø. Think of p as the “real world” and u as 
a guess as to where that world lies based on some observation. The language 
then uses two accessibility relations corresponding to shrinking an open (O) 
while maintaining a reference point, or to moving a reference point inside the 
given open (K). Another way to think of these is in terms of quantification, so 
we recast the semantics in English a bit: [lọ is true at p, u if for all refinements 
v of the observation u, ¢ is still true at p, v. And Kọ is true at p, u if for all 
q € u, if the real world happens to be q rather than p, ¢ is still true at that world 
q with the same observation u. 

We see that the intuition behind this logic with its two modalities is that 
knowledge is affected not only by the situation we are in, but also by the 
amount of effort we have put in. For instance, recall our remarks introducing 
measurement in the Introduction. Suppose a policeman uses radar to determine 
that a car is going 51 mph in a 50 mile speed limit zone. But if the accuracy 
of his radar is +2 mph, then he does not know that the car is speeding. If 
however, a more accurate radar with an accuracy of +1 mph shows that the 
car is going 51.5 mph, then he does know that the car is speeding. Originally 
the possible speed of the car lay in the interval (49,53), which is not entirely 
contained in the interval (50, oo) which represents speeding. The second in- 
terval, however, is (50.5, 52.5); this is contained in (50, oo). We can represent 
the policeman’s later situation as symbolized by K (Speeding), and the earlier 
one as ÔK (Speeding )\ LQ K —(Speeding ). In the earlier case the motorist was 
speeding, and the policeman had the possibility of knowing this, but did not 
actually know it. 

















4.1 Preliminary examples 


At this point, we present two simple spaces where we can interpret the lan- 
guage, and some formulas. These should help the reader to become familiar 
with the semantics. 
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First, consider the case when X is the set R of real numbers, and O is 
the standard topology on R. Suppose that there are two atomic predicates P 
and J, and that 7(P) = [0,2], and i[I] = {x € R : zis irrational}. Then 
(1,(0,3)) H= P, Also, since 2.5 € (0,3), (1,(0,3)) = ZAP. Moreover, 
(1, (0,3)) E OKP, since we can shrink (0,3) around 1 to (.5,1.1), say, and 
have the new neighborhood entirely inside the interpretation of P. On the other 
hand, (2, (0,3)) jÆ OKP. The reason is that every open u around 2 contains 
a point larger than 2, and so (2,u) | ~K P. We also write this last fact as 
(2, (0,3)) = OLZ-P. For a final example, (0, R) = KOLI, since every open 
set containing any real is non-empty and so contains an irrational. 

The next example itself will re-appear (in a slightly elaborated manner) in 
Sec. 6 as Example A. A picture of it may be found in Fig. 6.3 below. 

Let X = {a, p,q, 21, 22}, and let O contain X and 














ui = {a, p, zı} u2 = {a,q, zo} 
vı = {a, 21} vg = {a, z2}. 


Let P and Q be atomic sentences; we form a subset space (X, O, i) via 
i(P)={p} (Q) = tat 


Note that a, vı H| K-P. the reason for this is that neither a nor z; satisfy P. 
Since vı C u1, we see that a,u; = OA-P. So we have 


a,u, EF (LPA7LQ) AQK-—P. 


Similarly, we have 





a, uz = (LQ A7LP)AQK-7Q. 
We conclude that 


(6.3) a, X FO((LPAA7=LQ)AQKAP) A O((LQ A ALP) A KQ). 


4.2 Further definitions 


Certain kinds of sentences will have special interest in our study. Given a 
model ¥, and a sentence ¢, dis persistent in X if for all p, u, v sothat p € v C u, 
we have that if p, u =x ¢ then p,v =x ¢. ¢@ is persistent if it is persistent in 
all V. ¢ is bi-persistent if for all ¥ and all p, u, v so that p € v C u, we have 
p,u =x Qiff p, v Ex ¢. A sentence ¢is reliable in a model ¥ if Kọ > KO 
is valid in ¥. ¢ is reliable if it is reliable in every V. In other words, once ¢ 
is known, we need not worry about its becoming false. A sentence of the form 
KU¢ is itself always reliable. Reliable sentences represent reliable knowledge 
and have a rather intuitionistic flavor. However, our logic is classical, since we 
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are trying to represent certain knowledge theoretic ideas in a classical setting, 
rather than use an intuitionistic setting where such ideas would be presupposed. 
If the topology is discrete, then the only reliable sentences will be persistent. In 
comparison, with the trivial topology, only tautologies will tend to be reliable 
in V. Thus, for example, assuming that all boolean combinations of i( A) and 
i(B) are non-empty, then the only sentences involving A and B which are 
reliable will be tautologies. Note that when v is a subset of u, then every 
reliable sentence known at p, u is also known at p, v. This is in accord with the 
intuition that refining from u to v increases knowledge. 

Our language allows us to express certain basic topological notions. If ¥ is 
indeed a topology, then a set i( A) will be open iff every point in i(A) has an 
open neighborhood contained entirely in i(A) iff at any p in i(A), the sentence 
QKA holds. Thus i(A) is open iff the sentence A — OKA is valid in the 
model. Dually, i(A) is closed iff the sentence OLA — A is valid in the model. 
It is not hard to see that with the obvious definitions, r.e. subsets of the natural 
numbers will satisfy the same knowledge theoretic sentences that opens do in 
a topological setting, and this, we believe, is the source of our intuition that 
there is similarity between open sets and r.e. sets. The set i(A) is dense iff the 
sentence OLA is valid and it is nowhere dense if O L—A is valid. 








4.3 Topologic and the Tarski-McKinsey semantics 


We now relate the original topological interpretation of the modal logic Lo 
from Sec. 3 to the bimodal logic £ that we have been discussing. Define a map 


* Lp -L 
by recursion: 
A* = A 
(PAP) = GAY 
(=4)* = =e") 
(1o)* = QK. 


Recall that we interpret £ on all subset models, hence on all topological models. 
It is on this class that it makes sense to compare the two languages. 


PROPOSITION 6.7 For all ¢ in Lo, all topological models X, and all points 
xe xX, 


red iff «XK. 


The proof uses the fact that @* is always bi-persistent. 

The point here is that the original language Lo corresponds to a fragment of 
the bimodal language £. It should not be surprising that the larger language is 
strictly more expressive. 
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Substitution instances of classical tautologies 
K(o> y) > (Kọ > Ky) Oey) (1 


Kọ — (¢A KK¢) O¢ — (¢ A004) 
Lọ > KLo 
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p) 




















(A — OA) ^ (~A — OA) for atomic A 

















Kg — OK (the Cross Axiom) 


>y, ¢ $ $ 
Y Kọ  Oọ 




















Figure 6.1. The axioms and rules of the logic of subset spaces. 


PROPOSITION 6.8 The sentence Lp is not equivalent on the class of topolog- 
ical spaces to ¢* for any sentence ¢ of Lo. That is, there is no Lo-sentence ġ 
such that for all topological spaces *, all x € X, and all interpretations i of 
atomic sentences in X, 


ced iff x,X - Lp. 


Proof Here is a sketch. Consider two models, both with the same universe 
R of reals. In Mj, i(p) = 0, and in Mo, i(p) = {1}. (Mı was presented in 
Example 6.1.) An induction on Y% € Lo shows that the interpretation of 7 in the 
two models is the same except possibly for the point 1. In particular, 0 =; 4 iff 
0 H2 Y. Now suppose that ¢ exists as in our proposition. Note that 0 F, ~Lp 
and0 2 Lp. It follows thatO 1 -¢and0 2 ¢; this is a contradiction. QED 





5. A logical system: the subset space axioms 


One main technical goals of this chapter is to present axiomatizations of the 
validities of several classes of subset space models: all spaces, intersection 
spaces, lattice spaces, and complete lattice spaces. The logic of subset spaces 
is described by axioms and rules of inference in Fig. 6.1. 

These axioms and rules say that K is S5-like, and O is S4-like. We have 
an axiom of atomic permanence: (A — OA) ^A (=A — LA). This is only 
sound for atomic A. The intuition is that since an atomic sentence A is true at 
a point irrespective of which open we are considering, shrinking the open does 
not alter the truth value of A. 

Perhaps the characteristic axiom of the system is KL¢ — UK 4, called the 
Cross Axiom. It is the one axiom relating the two modalities. it is often used 
in its dual form Lø — LO. Let us check its soundness in this form. Fix a 
subset model X, and assume that p € u € O has the property that p, u = OL¢. 
We must show that p, u = LOd. Our assumption first gives an open v € O 
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such that p € v C wand p,v = Lọ. From this, there is some q € v such 
that g,u = ¢. Now q € was well, and we use this point to show that indeed 
p,u H| Lg. That is, we claim that g,u = O¢. The reason again is that 
q € v C u, and we have seen above that q, v = ¢. 

The axioms and rules of inference are sound for subset spaces. The argument 
is routine, and perhaps the only interesting part concerns the Cross Axiom; we 
saw it just above. We defer discussion of completeness to Sec. 7 and decidability 
to Sec. 8. 


5.1 Axioms for spaces with closure properties: the 
topologic axioms 


We next consider what happens in set spaces with various closure properties. 
We list the axioms of interest in Fig. 6.2. 

A directed space is one where for every p, u, v with p € u and p € v there is 
aw € O such that p € w, w C (u N v) An intersection space is one where we 
can take w = u Nv. 

We check that the axiom the Weak-Directedness Axiom (WD) from Fig. 6.2. 
is sound for directed spaces. Suppose that x, u = OU¢. Letv C u be such 
that x, v Dd. To see that x, u H O04, let u’ C u. Let w € O be such that 
p€w Cu’ Nv. Then since w C v we have z,w H ¢. Hence x,u’ H ĝo. 
Since w’ is arbitrary, x, u indeed satisfies 00¢. 

As it happens, (WD) does not lead to a complete axiomatization of the valid 
sentences on intersection spaces. We discuss the incompleteness further in 
Sec. 6. 

We next consider the Union Axioms (Un). To check soundness on spaces 
closed under unions, suppose that x, u* satisfies the antecedent via u, y, and v 
such that u C u*, y € u, y E€ v C u*, x,u = ġ, and y, v = wy. Let w = u Uv. 
Then w C u*. Clearly x, w H= O¢ A LOw. Since each point of w is either 
in u or v, every point in w has a neighborhood in which either Ôg or Ow is 
satisfied. The reader might like to consider the scheme (Weak Un) and to see 
why it actually is weaker than (Un). 

The system whose axioms are the basic axioms together with the Weak- 
Directedness axiom and the Union Axiom will be called topologic. The idea 
is that topologic should be strong enough to support elementary topological 
reasoning. 














THEOREM 6.9 (GEORGATOS, 1993; GEORGATOS, 19944A) 

The topologic axioms are complete for topological spaces, indeed for complete 
lattice spaces. Moreover, any sentence satisfiable in any topological space is 
satisfied in a finite topological space. 


One way to obtain the completeness result goes by considering a canonical 
model, building on what we have seen in Sec. 3 on the more standard modal 
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formal statement + comments 

















WD co — Oo 
sound for weakly directed spaces 
Un Opa LOY > Olp Aa LOY A KOLOV ¥)| 


sound for spaces closed under binary unions 
topologic This is the set space axioms + (WD) + (Un). 

sound for lattice spaces, complete for 

topological spaces — even for complete lattice spaces 

Weak Un LOA LOW > LOLO ^ Lya KOLV y) 
weaker than (Un) 

CI O9 — Oe 
sound for set spaces closed under all intersections 
follows from topologic axioms 

Mn (OLOH A OK A- A OK Un) 

— LOA Ky A: AQK Un) 
(WD) + all (M,,) is complete for directed spaces 




















Figure 6.2. Axiom schemes for set spaces with additional closure properties. 


logic of topology. (See Dabrowski et al., 1996.) This proof also shows that the 
topologic axioms give a complete axiomatization of the validities on the smaller 
class of complete lattice spaces. Indeed, the canonical model of topologic turns 
out to be acomplete lattice. Further, it turns out that the (Un) cannot be replaced 
by (Weak Un) in this axiomatization. 

This work also has some model-theoretic corollaries which might be of 
interest. Here are two of them, again from Dabrowski et al., 1996. In the 
first, we recall that Lo is the modal logic of the interior operation J, and the 
map * : Lo — £ is the embedding of Lo into the bimodal logic of inter- 
est here (this map was discussed in Sec. 4.3). We extend the map to sets by 


S*={d* : bE S}. 
THEOREM 6.10 For all S and ¢ in Lo, the following are equivalent: 
1 St @inSa. 
2 S*+ @* in topologic. 
In the second result, let II be the smallest set of sentences containing the 


atomic sentences, and closed under boolean operations and the operator OK. 
This is also the image of the *-translation mentioned above. It is easy to check 
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that each m € II is bi-persistent on all spaces closed under intersections; this 
can be proven formally using (WD). 


THEOREM 6.11 If ¢ is bi-persistent on the class of finite lattice spaces, then 
there is some T € II so that į Y © r in topologic. 


Georgatos also showed that topologic has the finite model property and is 
therefore decidable. The logic of subset spaces and the logic of intersection 
spaces do not have the finite model property. However, in Sec. 8 we adapt 
filtration on certain non-standard models to show that the logic of subset spaces 
is decidable. 

Continuing, we consider the axioms (CI). These are the converse of the (WD) 
axioms. We claim first that (CI) is sound for complete intersection spaces. To 
see this, let x be a point in some space 1’, and assume that x,v = OO¢. 
Let uz be the intersection of all opens containing x. Since uz, has no open 
proper subsets x, us H (O¢ v O~). Hence x, us H Od. Since uz C v, 
we have x,v H= OLI¢d. On the other hand, (CI) is not sound for intersection 
spaces; this can be shown semantically using Example B of Sec. 6 below. 
Intuitively, closure under finite intersection alone is not enough to guarantee that 
the neighborhoods of a point eventually stabilize on ¢ or on —@. The interesting 
point is that this axiom is sound for lattice spaces. So by completeness, each 
instance is provable in topologic. We know of no purely syntactic argument 
for this. 

We end with a mention of the results on intersection spaces and directed 
spaces. As we noted before, the (WD) axioms are sound for intersection spaces, 
even for directed spaces. Since these classes are of topological interest, the 
reader of this handbook article might find them of interest. It turns out that 
(WD) is not complete for either class. As Weiss has shown in an unpublished 
note, the two classes of spaces have the same logic, and a characterization of 
either class by a set of axioms also leads to a characterization of the other. That 
turns out to require infinitely many axiom schemes which cannot be reduced to 
a finite set (see Weiss and Parikh, 2002). This can be achieved using the (Mn) 
schemes; we omit the proof of their soundness for directed spaces and merely 
state the relevant result. 























THEOREM 6.12 (WEISS AND PARIKH, 2002) The set space axioms plus 
the axiom (WD) and the axioms (M,,) are complete for directed spaces. 


The topics described in this section do not exhaust what is known in the field. 
For example, Georgatos, 1997 studies tree-like spaces. (Given two open sets 
in a tree-like topological space, they are either disjoint, or one is a subset of the 
other.) See also Sec. 9 for other results which use languages that are variants 
of the one presented in Sec. 4 and studied in this section. 
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Figure 6.3. Example A. 


6. Further examples 


In this section, we present two further examples of set spaces and sentences. 
(The first of these was essentially introduced in Sec 4.1.) These examples are 
somewhat pathological, and we present them to justify some of the claims in 
the last section, and also to motivate some of the work in Sections 7 and 8. 


Example A. Let X = {a, p,q, 21, 22}, and let O contain X and 


ui = {a, p, zı} u2 = {a,q, zo} 
vı = Os 21} vg = {a, 22}. 


Let P, Q, and Z be atomic sentences. (Our work in Sec. 4.1 did not need Z.) 
We form a subset space (X, O, i) via 


(P)={p} iQ) ={a} i(Z) = {21,22}. 


(See Fig. 6.3.) 

Our first observation is that the Weak Directedness axioms are validated in 
this model (despite the fact that the model is not actually closed under intersec- 
tions). This is an instance of a very general fact. In a subset frame with only 
finitely many opens, every open u about a point p can be shrunk to a minimal 
open v about p. When v is minimal (p, v) automatically satisfies all sentences 
of the form @ — Lọ. In this way, all finite spaces satisfy the Weak Directedness 
axioms. 

Next, we see that the Weak Union axioms also hold (and again, the model is 
not actually closed under unions). Since there are a number of cases, we only 
present a few of them. Suppose that p, u1  @ and q, u2 H w. Then for any 
x € X we have x, X | Loy, where 


x = Loo Ov A KOLV). 


The most interesting case is where, e.g., p,ui H= o and z2,v2 = w. Here, 
21, U1 satisfies w as well, since vı and v2 are isomorphic. Thus for any y € u1 
we have y, ui E x. 
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However, the stronger Union Axiom fails; for example 


z2, X = OK(7Q) A LO(LPA K-Q) 
but 22, X - O[LP A K-Q]. 





Note that, prefixed by an L, this sentence would be satisfied. This was the key 
idea for showing that the Weak Union Axiom is satisfied. 

To summarize: this space satisfies the Weak Union Axiom but not the Union 
Axiom. Hence the former scheme is properly stronger. There is a second reason 
for introducing this example, having to do with the theories realizable in various 
spaces. 

This example can also be used to show that the subset space logic and Weak 
Directedness Axioms are incomplete for the class of intersection spaces. 

Concerning unions, we have seen in Sec. 4.1 that 


(6.4) a,X = O((LPA-7ALQ)AQK-=AP) A O((LQA ALP) A OK-Q). 





The witnesses are wu, and ug. Also 
(6.5) a, X E K(Z > 7A(O(LPAALQ) A O(LQ A -=LP))). 


Let Y be the conjunction of the sentences in (6.4) and (6.5). We claim that Y% has 
no models which are closed under unions. For suppose Æ’ were such a model. 
Let uj and vi be witnesses in X’ to (6.4); let u, and v4 be subsets witnessing 
a,u) H| OK AP anda, vi, H OK-7Q respectively. Let w’ = u1 Uv (we could 
also use u% Uv). Let 2’ € v} be a Z-point. Then z’, w’ = LP A =LQ while 
z'u EF LQA-=LP. So 








2, X HZ A O(LPA-LQ) A O(LQA-LP). 


This contradicts (6.5). 

This fact that the model satisfies the Weak Directedness Axioms and Weak 
Union Axioms shows that the basic axioms of topologic together with all of 
these axioms cannot refute = A x. Nevertheless ~ A x cannot hold in any lattice 
space (even in any directed space). So the axioms are incomplete. (Never- 
theless, we do have completeness for lattice spaces using the stronger Union 
Axioms.) 

Next, we present an example which shows a number of things about the 
theories realized in spaces closed under finite intersections. 


Example B. The space ¥ has points 


G0, 41, 42,---,An,---,; bo, 61, b2,.--,0n,..-, and c. 
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Figure 6.4. Example B. 


There are several families of opens 


Un = {e} U {am: MÈ n} U bm: M> n} 
Un = {e} U {am: M >n} U fbm: M> n} 
Wn = {an, bn} 

wh, = {bn}. 


(See Fig. 6.4.) We interpret three predicates A, B, and C in the obvious way. 
Let 


last-and-B = BAU(KBVLIC). 





Informally, last-and-B should hold only at a pair (d, s) only when dis a b-point, 
and only when d is the last element of s (the element with lowest subscript). 
Note that b;, v; } last-and-B, since every subset of v; which contains b; is either 
the singleton {b;} or contains c. Further, b;, u; = —last-and-B, since w; C vj, 
and b;, w; = ~(K BV LC). Finally, if i > j, then b;, v; H slast-and-B for 
the same reason. So in this space last-and-B has the meaning that we have 
described. 

Another fact about this example is that if i > j and 2’ > j’, then th(a;, uj) = 
th(ay,u,’), and th(aj,vj) = th(ay,v;). Similar statements hold for the 
b-points, and also th(c, u;i) = th(c, uy), etc. All of these facts are proved 
by induction. Alternatively, one may use the appropriate version of Fraisse- 
Ehrenfeucht games for this semantics. 


This example also has important consequences for theories in the set space 
logic. Consider the theories T = th(c, u;) and U = th(c, vi). These theories 
are not equal, since L(last-and-B) is in the latter but not the former. Never- 
theless, the two theories have the property that for every ¢ € T, O¢ € U. In 
Sec. 7, we will introduce the notation T—> U for this relation. In this exam- 
ple, we also see that U ae T. Now our intuition about the 220 relation is that 
T vs U should mean that in every space containing (d, s) with theory T, it is 
possible to shrink s to s’ C s with th(d, s’) = U. 
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A further desirable result would be that if T® U L T, then T = U. 
However, the example under consideration shows that this is not in general 
correct. This accounts for some of the difficulties in the completeness proof for 
the logic of set spaces. 

Arelated fact is that the set space logic does not have the finite model property. 
To see this, consider ¢ = L(last-and-B), and note that both T and U contain 


(6.6) y = (O0¢)A (09-4). 


We claim that no sentence of the form (6.6) can have a finite model. For 
suppose that ¥ were a finite space containing x and u such that z, u = w. We 
may assume that u is a C-minimal open about x with this property. But the 
minimality implies that x, u = A ~ọ, and this is absurd. 








Te Completeness of the subset space axioms 


The following is the main result on the subset space logic. 


THEOREM 6.13 The basic axioms are strongly complete for subset space 
models. That is, if T = ¢, then T F @. 


This section is an extended sketch of the proof, presenting many of the details 
and ideas but certainly leaving out a good deal of the work. The definitions at 
the beginning will be used in later sections, as will Propositions 6.14 and 6.15. 

As in Sec. 3, we use theories; these are the maximal consistent subsets of the 
language £. Let th be the set of theories in £ using the subset space axioms 
from Sec. 5. We continue to use letters like T, U, V, etc., to denote theories. 
In order to prove that we have given a complete proof system, we need only 
show that for every theory T, there is a subset space model ¥ = (X, O, i), a 
point x € X, and a subset u € O such that p, u =x T. 


To get started, we define the relations —. and aes on theories by: 


U- V iff whenever gEV,LdEU 
as V iff whenever 6 € V, OO E€ U. 


Of course, the maximal consistency of theories give other characterizations. 
For example, U— V if whenever KọEeU, GEV. 

Further, define U L? V if for all ġ € V, LO¢@ € U. And define relations 
such as U 245 V and UHS V similarly. 














PROPOSITION 6.14 Concerning the relations Æ, and zo $ 
Euri ‘ : 
(1) — > is an equivalence relation. 


(2) me is reflexive and transitive. 
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(3) If Lo € T, then there is some U so that @ € U and T- U. 


(4) If O¢ €T, then there is some U so that ¢ € U and TÊ, U. 
Proof These are all standard consequences of the S4-ness of and the S5-ness 
of L. QED 


These facts will be used in the sequel without mention. In addition, we have 
the following consequence of the Cross Axiom. 


PROPOSITION 6.15 Let U and V be theories, and suppose that there is a 
theory W such that U-°+ W V. 


ai 
L| L 
y 

T= =V 


Then there is a theory T so that us TÊ V. 


Proof Let S = {19 :ġ E€ V} U {w: Ky € U}. We claim that S is 
consistent; suppose towards a contradiction that it is not. Then there is a finite 
subset of S which is inconsistent. Now the two sets whose union is S are 
closed under conjunction, and moreover, K commutes with conjunction. So 
there are individual ¢ and w such that 6 € V, Kw € U andF O¢ > ay. 
Therefore + [Od — L-1; hence this sentence belongs to U. And also, 
o € V, so Lọ € W and OL¢ € U. So LOd € U by the Cross Axiom. It 
follows that Dw € U. But since Kw € U, this gives the contradiction that 
U is inconsistent. So S is consistent. Let T > S be maximal consistent. By 


construction, BREEN pe, V. QED 


Proposition 6.15 is the embodiment of the Cross Axiom in the realm of 
theories. The next result is a generalization which will be used in the proof of 
completeness. 


PROPOSITION 6.16 Let L = (L,<,T) be a finite, bounded, linear order 
with top element T, and let T be an order preserving map from L to (th, aes, a 


Suppose that T= U*. Then there is an order preserving U : L —> th such 
that Uy = U*, and for all m € L, Tm—> Um. 


Proof U is defined going down £, using Proposition 6.15. QED 


The first idea in proving Theorem 6.13 is to consider the canonical model 
and to show that every theory T is the theory of some pair p, u from that model. 
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As we have seen, the collection th of theories has relations zs and DA with 
the cross property of Proposition 6.15. We shall call such a structure a cross 
axiom frame, we can soundly interpret £ on it using the two relations. (We 
shall pursue this in Sec. 8.) The canonical cross axiom model shows that the 
subset space logic is complete for interpretations of £ in cross axiom models. 
However, the canonical cross axiom model is not a subset space, and there does 
not seem to be any straightforward way to turn it into one. 

Nevertheless, we would like completeness relative to subset spaces. Again, 
the natural idea is to try to “spatialize” the space of canonical theories (perhaps 
by taking as opens the L-equivalence classes). However, this construction could 
not work for the following reasons: Call a model ¥ exact if for every T there are 
unique p and u so that thy (p, u) = T, andifu D viffthx(p, TEA thx(p, v). 
But in Example B we saw theories T——> U—> T which are distinct. This 
implies that there are no exact subset space models. 

For this reason, we do not approach completeness via the canonical model. 
The strategy is to build a space X of “abstract” points. We shall also have opens 
given in an abstract way, via a poset P and an antitone (i.e., order-reversing) 
map į : P — P*(X). The points are abstract since they are not theories. 
But with each x and each p so that t € i(p) we shall have a “target” theory 
t(x,p). The goal of the construction is to arrange that in the overall model, 
th(x,i(p)) = t(x, p). 

We are not going to present all of the details here, only the basic ideas. We 
hope that the interested reader can fill in the rest, based on the following more 
concrete plan. 

One builds 


(1) A set X containing a designated element zo. 

(2) A poset with least element (P, <, L). 

(3) A function i : P  P*(X) such that p < qiff i(p) D i(q), and i(L) = X. 
(That is, a homomorphism from (P, <, L) to (P*(X), >, X).) 


(4) A partial function t : X x P — th with the property that t(x, p) is defined 
iff x € i(p). Furthermore, we require the following properties for all 
p EP, {x € ilp), and ¢: 


(a.1) If y € i(p), then t(x, p) t(y, p). 

(a.2) If Lo € t(x, p), then for some y € i(p), o € t(y, p). 
(b.1) If q > p, then t(x,p)—> t(x,q). 

(b.2) If O¢ € t(x, p), then for some q > p, @ € t(x,q). 


(c) t(xo, L) = T, where T is the theory from above which we aim to 
model. 
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Suppose we have X, P, i, and t with these properties. Then we consider the 
subset space 


X = (X,{i(p):p EP} i). 
where i(P) = {x : A € t(x, L)}. 
LEMMA 6.17 (THE TRUTH LEMMA) Assume conditions (1)—(4) for X, P, 
i, and t. Then for all x € X and all p € £ such that x € i(p), 


Proof We show induction on ¢ that ¢ belongs to the set on the left iff it belongs 
to the set on the right. We only give the inductive step for sentences Lo. 
Suppose that x, i(p) | Lo. Then there is some y € i(p) such that y, i(p) = 








@. By induction hypothesis, ¢ € t(y, p). By property (4a.1), t(x, p= t(y, p). 
Therefore Lo € t(x, p). On the other hand, if Ld € t(x, p), then by property 
(4a.2) there is some y € i(p) such that ¢ € t(y, p). By induction hypothesis, 
y, i(p) H= ¢. Therefore x, i(p) | Lo. This concludes the induction step for L. 

The induction step for ¢ is similar and uses (3), (4b.1), and (4b.2). QED 


One builds X, P, 2, and t by recursion, in a step-by-step process that goes 
on for countably many steps. This is not the place to enter into these details. 
They are fairly straightforward and can be found in Dabrowski et al., 1996. 

By the Truth Lemma and property (4c) above, thy(xo, L) = T. So the 
theory that we started with has a model. In the usual way, this proves the 
Completeness Theorem. 


8. Decidability of the subset space logic 


Despite the failure of the finite model property, we prove that the logic of 
subset spaces is decidable. The proof here is due to Krommes, 2003, simplifying 
the argument from Dabrowski et al., 1996. It goes by showing that a satisfiable 
sentence @ has a finite cross axiom model. This is a kind of pseudo-model for 
this subject, to be defined shortly. The idea as always is that we move to a 
bigger class of models than the one we are primarily interested in, and this class 
will turn out to be better behaved. 


DEFINITION 6.18 A cross axiom frame is a tuple (J, aes: 2 ) such that J 
is a set, En is an equivalence relation on J, —> is a preorder on J, and 


the following property holds: If is j L, k, then there is some l such that 


ae ; : : ; 
i—> l—> k. A cross axiom model is a cross axiom frame together with an 


interpretation i of the atomic symbols of L. 


Note that when we interpret the language on a cross axiom model, we have 
a node on the left side of the turnstile. That is, we write, e.g., j = @ since there 
are no sets involved. 
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The subset space logic is complete for interpretations in cross axiom models 
since the latter include subset spaces. 

Our main example of a cross axiom model which is not a subset space is the 
canonical model of the subset space logic: 

Car = na 

(The “ca” stands for “cross axiom”.) Proposition 6.15 says that C (ca) actually 
is a cross axiom model. The standard truth lemma for this structure shows that 
for all T € th, th(T) = T; that is, the set of sentences satisfied by the point T 
in C(ca) is T itself. We shall use a version of filtration to prove a finite model 
property. 

Let @ be any sentence which is consistent in the logic of set spaces, and 
consider the following sets: 


X` = all subformulas of ¢ and their negations. 

X’ = all conjunctions and disjunctions of (finite) subsets of X~. 

£” = all conjunctions and disjunctions of (finite) subsets of X’. 

DKL = {Ip be E! UKY: wed"). 

RSU 
Note that © is finite. Note also that modulo propositional logic, ©” is closed 
under the boolean connectives; in effect we are taking disjunctive normal forms. 
Finally, I’ depends on the original @. 


Write U = V if U OT = V AT, and let [U] be the equivalence class of U 


under this relation. (Note that = also depends on ¢, but we save a little notation 


by suppressing this.) We define relations 4, and + on [C(ca)] as follows: 


[S| [T] iff there exist S’ € [S] and T” € [T] such that S$’ T”. 
This is the definition used in minimal filtrations. In contrast, we want EA to 


be transitive, and so we define 


S) [T] iff there exist n > 0 and S’, S1,- , Sn, S4, T” such that 
n 


S= 5S ?, s =s Os es ° Sn = S! oT =T. 





We call the tuple 
[C(ca)] = ([C(ca)], +, ) 


the quotient of C(ca) by =. We shall show that this quotient is a cross axiom 
frame; the main points are the transitivity of —£, and the cross axiom property. 
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We turn the structure into a model in the evident way, via the interpretation 
(A) = {[T]: AE TOS}. 

Standard reasoning shows that [C(ca)] is a filtration of C (ca). It follows that 
since the sentence @ with which we began is consistent, it has a finite cross 
axiom model. Thus the decidability reduces to the verification of the properties 
of [C(ca)] that we mentioned in the last paragraph. 

We need several results, starting with an important lemma. For each theory 
T, let 


yr = AEAT): 


That is, we take the conjunction of the set T A I. This is an analog of atoms, 
as we find them in the completeness proof of PDL and other places (see Kozen 
and Parikh, 1981). 


LEMMA 6.19 Let T and U be theories. If Lyr € U, then whenever V = U, 
we also have Lyr E V. 


Proof Fix U containing Lyr, and let V = U. We split yr into two conjuncts 
w= APO") wet SATA 


Again, we have yr = y% ^A 74”. Our goal is to show that Ly% € V. 

Lety € TAT". (Notice at this point that y belongs to V.) This 7 is either 
Ly or Ky for some x € I”. The S5 laws of K in subset spaces tell us that 
Kx « KKy and Ly = KLyx. Sow = Kw. We therefore see that KY € V. 
This for all y € TOT” shows that Ky" € V. 

Note second that since U contains Lyr, it also contains Ly}. But this last 
sentence belongs to I KL CT, Because V = U, we see that Lp EV. 

We conclude that V contains Ky” and Ly%. Thus it contains L(748" A4). 
But this is equivalent to Lyr. So Lyr € V. QED 


LEMMA 6.20 Suppose that |S] |T]. Then for all S! € [S] there is some 
T’ € [T] such that S T". 


Proof Fix S' € [S]. Let S” € [S] and T” € [T] be such that S” T". 
Then Lyr € S". By Lemma 6.19, Lyr € S$’. And thus there is some T” 
such that S’—s T’ and yrr € T’. Since yr» = yr, we see that T' = T. QED 


Lemma 6.20 easily implies the transitivity of + in [C(ca)]. We are thus left 


with the verification of the cross axiom property. Suppose that [S]— [T] a [U]. 
We need to find W so that [S] ee [W]—-> [U], and for this we argue by induc- 


Q 
tion on n in the definition of the relation [S]— [T]. When n = 0, we have 
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S =T. In this trivial case we have [S] = [T] and may take [W] = [S]. Assume 


our result for n, and suppose that [|S] 2 [T] via a chain of length n + 1, say 


G29 Gas oe Ses SS rer 








Then isi [T] via a chain of length n, and so by induction hypothesis we 


have some W such that [Ss] m [U]. By Lemma 6.20 there is some 


W' = W such that Sı- W”. So as theories, si, Sı W’. Since C(ca) 
is a cross axiom frame, there is some X such that S” meen RA W’. We have 
[sj [X] in the quotient. We also have |X i w] [U]. By transitivity 
we also have |X ex [U]. This completes the proof of decidability: from a 
consistent sentence, we have effectively produced a finite model. 


9. Heinemann’s extensions to topologic 


This section discusses two of the many extensions of topologic due to Bern- 
hard Heinemann. One of his papers (Heinemann, 1998) studies spaces which 
satisfy chain conditions of various sorts. We begin with the relevant definitions; 
let ¥ = (X, O) be a subset frame. 


1 X satisfies the weak bounded chain condition (wbcc) if for each point 
x € X, every descending sequence of opens around z is finite. 


2 & satisfies the finite chain condition (fcc) if every descending sequence 
of opens is finite. 


3 & satisfies the bounded chain condition (bcc) if for some n, every de- 
scending sequence of opens is of length at most n. 


In order to axiomatize the logics of these classes of spaces, it is necessary to 
alter the basic semantics of topologic. Up until now we have 


p, u = O¢ iff  p,v H ¢forallv € O such that v C u. 





In the study of chain conditions, we alter this to 


p, u = O¢ġ iff  p,v H ¢forallv € O such that v C u. 








(So U now quantifies over proper subsets.) Turning to the logics themselves, 
first note that the topologic axiom Oo — ¢ is no longer valid. So this axiom 
is dropped, and the rest of the system is retained. 

To axiomatize the validities in the wbcc spaces, one then adds the scheme 


(6.7) O(O¢ > 4) > Od. 
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This scheme is familiar from provability logic (also called Lob logic). The re- 
sulting logic lacks the finite model property, and indeed the decidability remains 
open (see Heinemann, 1998). 

Turning to bcc spaces, we would add (6.7) (actually a weaker scheme suffices) 
and also 

















KO(KO¢ > ¢) > KO¢). 


It turns out that the resulting system is sound for fcc spaces and complete even 
for the smaller class of bcc spaces, and that it is decidable. 

Around the same time as we wrote this chapter, Heinemann discovered 
a modality which is basic for our enterprise. This is overlap operator O, 
(see Heinemann, 2006). We add it to the basic language of topologic and 
then the semantics is extended to 


puF-Od iff p,vu-_ ¢ forall v € O such that p € v. 





Note that the difference between this and the semantics of Llo is that we do not 
require that v C u. Asa result, we see that the axiom O¢ — O¢ is valid. Also, 
the S5 axioms are valid for O: 


Ol% > y) > (Op = OY) 








In the last of these, the operator P is the dual of O, so P¢ is defined by ~O~ġg. 
Turning from axioms to rules of inference, we see easily that the necessitation for 
O preserves validity. So we get a logical system called ET by adding the axioms 
and rule to the system of topologic. Heinemann proves completeness and 
decidability of this system (see Heinemann, 2006). The details are elaborations 
of the arguments which we have already seen for the basic system of topologic, 
the subset space axioms. 

Recall that the class of directed spaces is not finitely axiomatizable (see Weiss 
and Parikh, 2002). This is remedied in the larger language: consider 


Pld —> O¢. 





Further, the weak connectedness of the space may be captured. This condition 
says that given any two overlapping sets, one is included in the other. The 
relevant axiom is 


(o> OY) v O — 04). 


But this is not so topologically natural. It would be more in the spirit of things 
to capture the condition that two overlapping sets have a common superset. 
The natural move here is to add nominals I, J, etc. whose values are taken to 
be opens of the underlying space. Then one defines the semantics as in hybrid 
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logic, using interpretations on the new nominals and the obvious satisfaction 
relation. The sentence 


IAPJ > P(OIAOJ) 


then captures the condition we mentioned. One can also add nominals 
for points, again following the lead of hybrid logic . There is an outline of 
the completeness and decidability of the resulting logical system in 
Heinemann, 2006. 

We close with a mention of an avenue for further work. Most of the results 
in this area concern completeness and decidability of various systems, and also 
results on expressive power. Results on complexity are scarce. One exception 
is (Heinemann, 1999a), where an NP-completeness result is proved for a sat- 
isfiability problem related to topological nexttime logic. But again, there are 
many open complexity problems. 


10. Common knowledge in topological settings 


The interaction of the topological semantics of modal logic with the topic of 
common knowledge is the topic of a recent paper (van Benthem and Sarenac, 
2004). Like those discussed in Sec. 9, we expect this paper to inspire others. 
Our purpose here is to give a high-level overview of it that connects it with the 
theme of our chapter. 

The most standard setting of modal logic is that given by the semantics on 
Kripke models. We shall go into more detail on this in Sec. 11 just below. In 
the standard setting, one adds transitive closure operators L* to modal logic. 
For the semantics, one takes a model M which lives on a relation R, considers 
the reflexive-transitive closure R* of R, and finally defines 








x = Lo iff y = ¢ for all y € M such that z R*y. 














So O* ¢ is semantically equivalent to the infinite conjunction @AUdAL?Z GA: --. 
We sometimes read L*¢ as fully-aware knowledge; ¢ is true, the agent knows 
this, s/he knows that s/he knows this, etc. But note that this reading hides a great 
deal. Indeed, it would be better to note that there is an intuitively important 
idea of fully-aware knowledge and then to take L* as a proposal for modeling 
it. A second proposal is to take the set of states satisfying L1*¢ to be the largest 
fixed point of the following operator on the power set of the set of states in our 
model: 














Xt {w€ W : w FE ¢,and all R successors of w belong to X}. 


Yet another proposal might be to have some very explicit evidence, and of 
course this is not capturable in the standard semantics in the first place. (In the 
same way, we should always remember that the standard semantics of [D is a 
proposal for the modeling of intuitive knowledge in the first place.) 
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To capture this new operator L* in a sound, complete, and decidable logic, 
one adds necessitation for the new operator, an axiom called Mix: 


O*ġ — (6A 00" ¢) 


and also an induction rule: from Y — ¢@ A Hy, infer y — L*¢. 

So far in our chapter we have only dealt with epistemic logic with one 
agent, and this is how the subject of epistemic logic germinated. But the whole 
subject blossoms in the multi-agent setting. For simplicity, we shall deal with 
two agents, call them A and B. The intuitive idea of common knowledge is the 
two-agent version of the fully-aware knowledge that we saw above. So we add 
an operator L]* (sometimes it is decorated with the names of the agents A and 
B) to the basic modal language. Then the question arises as to the semantics. 
One way to begin is to start with a model M living on two different relations, 
say R4 and Rpg. The standard proposal for the semantics of LI*¢ is 





























cEO*d iff yE@forall y E€ M such that (R4 U Rp)*y 





So we take the reflexive-transitive closure of the union of the two relations. In 
plainer terms, this proposal amounts to saying that ¢ is common knowledge to 
A and B if 


(0) @ is true; 
(1) A and B both know (0); 
(2) A and B both know (1); 


and so on. To summarize, this standard proposal is to identify common 
knowledge of ¢ with a certain infinite conjunction of iterated knowledge 
statements of ġ. 

We have taken great care in this discussion to keep separate the intuitive 
concepts and the standard formulation of them. These are usually conflated, and 
the main point of van Benthem and Sarenac’s paper is that a two-agent version of 
the topological semantics of epistemic logic allows one to distinguish between 
different formalizations that are always identified in the relational semantics. 
The need to do this goes back to an influential paper on common knowledge 
(Barwise, 1988). In that paper, Barwise wishes to question the basic modeling 
of common knowledge as an infinite iteration. More precisely, he wishes to 
distinguish between the infinite iteration and the related fixed point; he also is 
concerned with the notion of “agents having a shared situation”. 

Recall the topological semantics of epistemic logic, and especially the trans- 
lation of the interior operator J from Sec. 4.3. So the analog of LI*¢ is now 





PN KOPN KOKOA: 


Topology and Epistemic Logic 329 
And the natural operator on the power set of the state set is 
XH {we X°:wk $} 


(Note that we are using the interior operator here.) For a sentence in the logic 
at hand, the two definitions agree. And one of the points of the paper of van 
Benthem and Sarenac is to show a separation of the two formalizations on one 
particular model for the two-agent version of the logic. This model is 


QxQ = (QxQ,O1, 02). 


Thus the states are pairs of rational numbers. A set X is open in O; if for all 
(r,s) € X there is some rational e such that 


{r} x ((s-—6,s+E)NQ) CX. 


And Ó» is defined similarly. The analog of the infinite iteration now reflects 


two agents: 
dA oie A \ Ni : 
i Tai. 


(so that į ranges over {1,2}). The fixed point now is for the operator which 
takes a set X to the intersection of the interiors of X in the two topologies. 
Again, the paper proves that the two notions differ. See also van Benthem 
et al., 2007 and Sections 3.2 and 3.4 of Ch. 5 for this and related material, such 
as the completeness of the two-agent version of S4 on the rational square Q x Q. 














ip Nw 





11. The topology of belief 


We have already seen in Sec. 3 that S4, a logic of knowledge, has a topological 
semantics. For the purposes of this chapter, the important conclusion is that 
the interior operator acts like knowledge. In more detail, the properties of this 
operator as rendered in modal terms correspond to the axioms of a logic S4 
that may be considered a logic of knowledge. The purpose of this section of our 
chapter is to show that there is a topological operator which acts like belief in 
this same sense. Our work in this section is based on the recent dissertation by 
one of the authors (Steinsvold, 2006). 

Most commonly, the difference between knowledge and belief is taken to 
be that belief should not imply truth; this corresponds to the assertion that 
analogues of statements like Jọ — @ should not be valid. To emphasize that 
we are dealing with belief, we change the modality to B in this section. So 
again, we do not want to work with a logic that includes the T-scheme Bọ — @. 

The most standard logic of belief is the logic K D45. K D45 is axiomatized 
using the schemes listed in Fig. 6.5, together with the rules of Modus Ponens 
and Necessitation which we have seen in Sec. 3. The D scheme corresponds to 
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the assertion that beliefs are consistent; an agent who believes ġ should not also 
believe ~o. The 4 and 5 schemes correspond to assertions of introspection: if 
an agent believes ¢, then they believe the assertion that they believe ¢. This 
is the content of 4. As for 5, it says that if an agent does not believe ¢, then 
again they believe the assertion of that disbelief. Despite obvious problems, 
K D45 is a standard logic of belief in the sense that the provable sentences may 
be taken as a first approximation to the properties of belief. Here is the plan 
of this section of our chapter. We first recall and compare the relational and 
topological semantics for K D45. We then discuss the derived set operation 
from topology (defined in Sec. 11.2 below). and provide an interpretation of 
K D45 in it. Finally, a topological completeness proof is presented for K D45 
with respect to a class of spaces that we call DSO spaces. 

We should note in passing that other authors have explored the properties of 
this derived set operation using modal logic; see, e.g., Esakia, 1981; Shehtman, 
1990; Bezhanishvili et al., 2005. However, none of these works offered epis- 
temic connections. This is one of our goals. Actually, we are more interested 
in doxastic connections, that is connections to the notion of belief. 

Before plunging into the details, we should mention why we chose this topic 
for a chapter on Topology and Epistemic Logic. We have already seen various 
logics in the paper, including a standard logic of knowledge ($4) and a standard 
logic with common knowledge (the logic of *). One certainly can view what 
we are doing here as being work in the same general direction: find a logic 
corresponding to a semantics that is already of interest. But this section can 
also be read in the other direction: given a logic that we believe to be sensible 
or at least worthy of study, construct a semantics for it. The work of this section 
suggests a topological development along these lines. In a nutshell, the proposal 
is to read belief as the dual of the derived set operator on a topological space 
satisfying some conditions. For a complete discussion of the philosophical 
aspects of this proposal, see Steinsvold, 2006. 





11.1 Relational semantics of K D45 


We have already discussed the notion of a frame in Sec. 3.1. We quickly 
moved from frames in general to preorders there, but here we need the more 
general notion. Again, a frame, F = (W, R}, is a set W with a relation R on 
W. We’ll refer to the members of W as points or worlds, interchangeably. If 
x bears the relation R to y, we’ll write either xRy or (x,y) € R. A model is 
a frame together with an interpretation 2 of atomic sentences in it. We say that 
the model is based on the frame. 

The semantics on models is given in the usual way, with the clause for B 
that 


w |= Bo iff (Vz)(wRz implies z = œ). 
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Ax formal statement relational correspondent 

K B(¢—> 4)—> (Bo— Bw) (none) 

D Bb--7B-¢ every point has an f-successor 
4 Bọ— BBọ R is transitive 

5  ~Bo —> B-Bo R is Eucldiean 








Figure 6.5. Axiom schemes of K D45 with their relational correspondents. 


So B is defined using a universal quantifier, like J was in our earlier work. 

We read w — @as saying that ¢ is true at w or that w satisfies . A sentence 
¢ is valid in a model iff ¢ is true at every point in the model. A sentence ¢ is 
valid in a frame iff ¢ is valid in every model based on the frame. 

The correspondences of Fig. 6.5 are well known. The precise nature of this 
correspondence is that a frame F satisfies each instance of D (say) iff F meets 
the condition that every point in it has a successor. A model which meets 
the condition corresponding to D (that every point have a successor) is called 
serial. The Euclidean condition corresponding to the 5 axioms is 


(Va) (Yy) (Yz) ((x Ry A xRz) > yRz). 


K D45 is complete with respect to models which have the properties listed 
above. That is, if T U {¢ġ} is a set of sentences, then T+ ¢ in K.D45 iff for 
all serial, transitive, and Eucldiean models M and all points x € M, if x = T, 
then z = ¢. 

This completeness is a parallel to facts we saw in Sec. 3 for S4: One considers 
the set of theories T in K D45 and gets the rest of a “canonical model” structure 
of a model using (6.2) and (6.1). K D45 also is decidable via the finite model 


property. 


11.2 The derivative operation on topological spaces 


We are now going to be proposing a different semantics for modal logic, 
one based on an operation from topology called the derived set operation. We 
collect in this section some topological preliminaries about this operation. Let 
X = (X, ©) bea topological space. For a set A C X, we define the derived 
set of A, d(A), 


w € d(A) iff (VU € O)(if w € U then (3x € U \ {w} (z € A)). 





In words, w belongs to the derived set of A iff every open set U around A 
contains some point of A different from w. It also might be useful to write out 
the complement: 


w ¢ d(A) iff (AU € O)) (w € U but (Yz € U \ {w})(a € A)). 
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This derived set d( A) has many other names in the literature on point-set topol- 
ogy: derivative, Cantor-Bendixson derivative, set of limit points, set of accu- 
mulation points, and set of cluster points (of A). It is usually written as A’. 


EXAMPLE 6.21 Let R be the reals with the usual topology, and let 


A = [iner 
n 


Then d( A) = {0}. That is, every open set containing 0 contains some number 
1/n. But for all r 4 0, there is an open neighborhood of r which contains no 
number in A. Next, consider 


1 1 
Bo= {ete imma h 
n n+il+m 


This is like A, but with a copy of A itself next to each of its points. The 
copies are disjoint. Then d(B) = AU {0}. And so d(d(B)) = {0}. Further, 
d(d(d(B))) = Ø. More generally, the derived set of any singleton in any 
topological space is always empty. 


EXAMPLE 6.22 Here is another example of how d works, one which will be 
elaborated in Sec. 11.4 below. Let W be an infinite set, let D be an infinite 
subset of W, and let O be the family of subsets U C W such that D \ U is 
finite, together with the empty set. It is easy to see that O is closed under finite 
intersections, since 


D\(UNV) = (D\U)U(P\V), 


and arbitrary unions. (In fact, every superset of a nonempty open set is itself 
open.) In this way, W = (W, ©) is a topological space. 
We compute d(A) for all A C W: 


= Ø if DA Ais finite 
dA) = { A if DN Ais infinite 


In the first case, D \ (D \ A) = DN Ais finite. For w € W, let Uy = 
{w}U(D\ A). Then Uw is an open set containing w, but for all x € Uw \ {w}, 
x ¢ A. Sow ¢ d(A). Thuse d(A) = Ø. We turn to the second case. Let 
w € W and let U be an open set containing w. We claim that U N AN D is 
infinite. For suppose not. Then D N A is infinite but U N (A N D) is finite. So 
(DNA) \U is infinite. So its superset D \ U is also infinite. But by definition of 
the topology, D \ U is finite. This contradiction shows that indeed UM DN Ais 
infinite. In particular, U contains an element of A different from the original w. 
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EXAMPLE 6.23 Here is a final example, one suggestive of the connection 
between the relational semantics and the derived set operation. Let M = 
(W, R) be a set with a transitive relation on it. (We do not require that R be 
reflexive.) Recall the Alexandrov topology from Sec. 3.1: the opens are the 
F-closed sets. We again compute d on sets, and this time 





d(A) = {xeW  : (Aye A) (y £ z and zRy)}. 


Here is the reasoning: suppose first that x € d( A). Consider the open set 
U, = {x} U {y : xRy}. A must contain some point of Uy other than x. In 
the other direction, assume that y Æ x, xRy, and y € A. Then every open set 
around x contains y. 

A suggestive observation: the standard Kripke semantics of modal logic is 
almost related to the derived set by 


(0d) = d(i(¢)). 


Indeed, if we modified the definition of x € d(A) to not require witnesses 
different from x in the open neighborhoods, we would have the equation above. 
Or, if R were irreflexive, then the equation would hold. See Esakia, 2001 for 
more on this. 


Proposition 6.24 below contains standard results in point-set topology about 
the operation d. In the first part, we recall that a topological space satisfies 
the T; separation property iff for all distinct points x and y, there is an open 
set containing x but not y. Another formulation: all singletons are closed sets. 
This property can also be neatly expressed using d: d({x}) = Ø for all z. 
(Recall also that x ¢ d({x}).) In the second part, we mention the Ty separation 
property. This holds if every singleton is the intersection of an open and a 
closed set. Equivalently, every set of the form d(A) is closed. This alternate 
formulation is more useful in this chapter. 


PROPOSITION 6.24 Concerning the derivative operation d on a topological 
space X: 


1 If every set of the form d({z}) is open, then X is a T; space. 

2 If X isa T\ space, then X is a Ty space. 

3 For all topological spaces, d(d(A)) C A U d(A). 
Proof For the first assertion, assume that each d({x}) is open. Suppose 
towards a contradiction that y € d({x}). As we know, x ¢ d({x}). But since 


d({x}) is open, contains y, and is disjoint from {x}, we see that y ¢ d({x}) 
after all. This is a contradiction. 
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For the second assertion, consider a derived set d( A). Let x ¢ d(A). Then 
there is an open set U containing x such that (U \ {x}) A = Ø. We claim that 
U N d(A) = 0. Since we already know x ¢ d(A), we only need to consider 
some y # xin U. By the T; property, let V be an open set with y € V but 
x € V. Then(UNV)NA = 9. Since UNV is open and contains y, we see that 
y ¢ d(A). But y is arbitrary, and this establishes the claim that U N d(A) = 0. 
That is, U is an open set, x € U, and U C X \ d(A). Since x is arbitrary, 
X \ d(A) is open. 

For the last part, suppose that x € d(d(A)) but x ¢ A. Let U be an open 
set containing x; we show that U N (A \ {x}) 4 0. Let y € d(A) NU. Since 
y €d(A)NU, let ze UN (A\ {y}). Then z # x, since x ¢ A. QED 


The assertions in Proposition 6.24 suggest a definition: we say that a space 
X is a DSO space if every derived set d( A) is open but the space is dense in 
itself. (This last condition means that there are no open singletons. We add this 
because the conjunction of these two properties will be important in our later 
work: the DSO spaces turn out to play the role of serial, transitive, Euclidean 
relations in the Kripke semantics of K D45.) 


EXAMPLE 6.25 Every space built as in Example 6.22 is a DSO space. To 
see this, recall that we have computed the derivative operation on all subsets 
of the space. The upshot is that the only derived sets are the empty set and the 
whole space. So every derived set is open. Further, no singletons are open. So 
we have a DSO space. 


EXAMPLE 6.26 On the other hand, consider a space (W, ©) built from a tran- 
sitive relation (W, R) as in Example 6.23. An open singleton corresponds to a 
point x with no R-successors different from zx itself. We do not usually have 
a DSO space: usually there are derived sets which are not open. (Inciden- 
tally, there are no finite DSO spaces: such a space would be 7}, and hence 
all singletons would be open.) In fact, there does not seem to be an elegant 
frame-theoretic correspondent to the DSO condition. 


11.3 Derived set semantics of K D45 


Recall that we are aiming towards a completeness theorem for the doxastic 
logic K D45 with respect to a semantics concerned with the derived set opera- 
tion. We have seen the axioms of K D45 in the opening part of Sec. 11. We now 
give the semantics. Let ¥ = (X, ©, i) be a topological model, a topological 
space with an interpretation of some fixed background set of atomic sentences. 

We interpret the basic modal language in ¥ using the classical interpretation 
of the connectives, and most critically, 





we Bo iff (3U €e O)(wEeUA (Ve EU \ {w})(4& E 9)). 
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Ax formal statement topological correspondent 
K B(¢—> 4)—> (Bo— Bw) (none) 

D Boé—>-7B-7¢ dense in itself 

4 Bob— BB Tq separation property 

5 14B¢— B-ABo all derived sets are open 








Figure 6.6. Axiom schemes of K D45 again, this time with their topological correspondents. 


In words, Bọ is true at w when there is an open set U containing w such that 
every point in U except possibly for w satisfies ¢. So we have 





w = ~B-=6¢ iff (VU € O)( if w € U then (dx € U \ {w})(x E ¢)). 


And therefore 
i(-B-¢) =  d(i(¢)). 


At this point we have a hint as to why the semantics of B was taken with the 
existential condition rather than the universal one. In Example 6.23 we saw 
that the derivative operation corresponds to the existential modality ©. And 
since the K D45 axioms for B are universal, we take the semantics using the 
dual of d rather than d itself. 

We have the standard semantic definitions: A sentence ¢ is valid in a topolog- 
ical model X iff ¢ is true at every x € X. A sentence ¢ is valid in a topological 
space (X, O) iff ¢ is valid in every topological model based on (X, O). 

We first study some correspondence phenomena and then use these to mo- 
tivate a completeness theorem. The correspondence for D may be found in 
Shehtman, 1990, and the one for 4 in Esakia, 2001). 


‘THEOREM 6.27 The correspondences of Fig. 6.6 hold. That is, each scheme 
is valid on a topological space X = (X, O) iff X has the specified topological 
property. 


Proof We fix a space ¥ = (X, O) in this proof. The K axioms are easily seen 
to be valid on ¥; this amounts to the closure of topologies under intersection. 

We turn to the correspondence for the D axioms. If {w} is open, consider 
the model obtained by i(p) = {w}. Init, w = Bp ^ B-p, counter to D. In the 
other direction, fix an interpretation 7 and assume that w = Bo ^A Bd. Then 
there are opens U and V containing w such that at all points of U besides w, 
@ holds, and all points of V besides w, ¢ fails. Then the open set U N V 
must be {w}. 

Next, suppose that the space is Ty, so every derived set d( A) is closed. Fix 7 
and ¢, and suppose that w = Bd. Let A = i(@), so that w € X \d(A). Let U 
be such that w € U C X \ d(A) = i(B@). U shows that w € 1(BB¢). Going 
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the other way, suppose that every 4 axiom holds under every interpretation. 
Let A C X. We check that X \ d(A) is open. Let i(p) = X \ A, so that 
i( Bp) = X \ d(A). Let x € X \ d(A). So 


x€i(Bp) © i(BBp) = X\(d(X\i(Bp)). 


Thus there is an open set U containing x with the property that all points in U 
are in i( Bp), except possibly x. But i(Bp) = X \ d(A). And as we know, x 
itself belongs to X \ d(A). So U shows that X \ d(A) is indeed open. 

For the correspondence result for the 5 axioms, suppose that d( A) is always 
open in ¥. Fix i and ¢, and suppose that w = ~Bọ. Let A = i(7¢), so 
that i(B¢) = X \ d(A) and w € d(A). Then d(A) is open, contains w, 
and every point of it is outside of 7(Bd). Sow € i(B4Bo). Going the 
other way, suppose that every 5 axiom holds under every interpretation. Let 
A C X. We check that d(A) is open. As in the last part, let i(p) = X \ A. 
Let x € d(A) = i(-Bp). Then x € i(B-Bp). So there is an open set U 
containing x such that U \ {x} C i(~Bp) = d(A). Hence U Cd(A). QED 


There are a number of other general facts concerning this semantics which 
might be of interest. For one, no topological space validates Bọ — @. (This 
is the scheme T' which we frowned on before.) But since our main goal is a 
completeness theorem, we shall not digress. 

Recall the notion of a DSO space from Sec. 11.2, and also Proposition 6.24. 
We see from the correspondence results that every DSO space satisfies all of the 
K D45 axioms under every interpretation. In fact, it is not hard to check that 
the logic K D45 is sound for DSO spaces in the following sense. Let T U {¢} 
be any set of modal sentences (using B as a modality). We say that T F @ 
is provable in K D45 if there is a finite subset {41,..., Yn} C T such that 
(p1 A+: An) > Q in KD45. It is easy to check that K D45 is sound for 
DSO spaces: if T F ¢, then T E ¢. 


The derived set logic as a fragment of a richer topologic-like system. 

Much of our chapter has been concerned with the two-sorted ontology of points 
and sets, and with the resulting bimodal logical systems used in its study. Our 
work in this section appears to be on a different track. However, it is possible 
to unify the two. Consider the language £ for points and sets from Sec. 4; this 
is the language with modalities O and K. Instead of K, take as primitive an 
“everywhere but here” operator [4] as in Sec. 3.2, but this time with semantics 


p,u H= [Z] iff forallg 4 pin u, we have q, u H= o 


Then the K of topologic is recovered as an abbreviation. Call the resulting lan- 
guage L2. Itis easy to check that the correspondence results of Proposition 6.5 
still hold. We can translate the language of this section into £2 in the manner 
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of Sec. 4.3 via 
(Bo)* iff O|Al¢* 


The logic 54D must be modified a bit: the axioms Kọ — L¢ are no longer 
sound. This reflects the fact that the sentences in £2 are not always persistent. 

Here is a sample of the use of this logic. We saw in Proposition 6.24 the 
general fact about derived sets d(d(A)) C AU d(A). Here we show that this 
can be proved in the logic of set spaces together with the “weak 4” scheme 


(o A [¥]¢) > #1 [#]¢. That is, we can prove 
(sp ACA )O(F)p) > OF )p 


for atomic p. The first conjunct on the left and atomic permanence gives Lip. 
The second conjunct on the left easily gives O(4)(4)p. Using weak 4 and 
modal reasoning, we have O(Æ)}p. 

But we must mention that £2 has not been studied previously. So all of the 
natural questions about the logic are open. 





























11.4 Completeness of K D45 for DSO spaces 


The final result of our chapter is the following completeness theorem. 


THEOREM 6.28 The logical system K D45 is sound and strongly complete 
for DSO topological models. That is, if T = ¢, then T F ¢. 


Proof As the reader expects, the method of proof is to show that a set S' of 
modal sentences which is consistent in K D45 has a DSO topological model. 
So fix such a set S. By the relational completeness noted in Sec. 11.1, we have 
a model 

M = (W,-,i), 


which is serial, transitive, and Euclidean; and w* € W so that w* = S. 

Let N be the set of natural numbers. We build a topological space M whose 
set of points is N x W. Here we meld the constructions in Examples 6.22 
and 6.23. For the topology, we consider the following family O of subsets of 
Nx W: 


UeO iff forall (n,x) € U,ifx — y, 
then for all but finitely many m, (m, y) € U. 





It is easy to see that O is closed under arbitrary unions. So to check that O really 
is a topology we only need to consider binary intersections. This amounts to the 
fact that the intersection of two cofinite sets is again cofinite. As an example, 
take any (n, x) € N x W and consider 


(6.8) Vina) = {(n,2)}U{(m,y):meN and x > y} 
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This is open by the transitivity of — in M. 

We check that this space M = (N x W, O) is a DSO space. No singletons 
are open, using the fact that the original M is serial. To check that derived sets 
are open, and also for future use, we again compute derived sets: 


d( A) = {(n, x): n € N and there is some y such that x > y 
and infinitely many m such that (m, y) € A} 


(We leave the verification of this to the reader: see Examples 6.22 and 6.23 
for parallel work.) We check that such a set d(A) is open. Suppose that 
(n,x) € d(A). Fix some y such that x — y and for which there are infinitely 
many m such that (m,y) € A. Let Unx) be as in (6.8). We claim that 
U'n z) E d(A); let x — z, and consider (p, z). Let V be any open set containing 
(p, z). The key here is that since x — y and x — z, we have z — y by the 
Euclidean property. Almost all j have the property that (j, y) € V, and infinitely 
many of them have the property that (j, y) € A. So there is one (indeed infinitely 
many) j so that (j, y) € VN A. Since V is arbitrary, (p, z) € d(A), as desired. 

So far we have converted our relational model M into a DSO space M. We 
consider it as a model via i“ (p) = N x iM (p). 


LEMMA 6.29 (TRUTH LEMMA) For all (n,z) € N x W, x = ġ in M iff 
(n, x) = ġ in M. 


Proof By induction on ¢. Only the inductive step for B needs an argument. 
Suppose x = Bd in M, so that for all y such that x —> y, y = ¢. By 
induction hypothesis, we see that for all m, (m, y) | ¢ in M. So each point 
of U'n,x) from (6.8) above satisfies @ in M, except possibly for (n, x). Thus 
our topological semantics of B¢ tells us that (n, x) | Bodin M. Conversely, 
assume that (n, x) | Bodin M. Let U be an open set containing (n, x) with the 
property that all points in U satisfy ¢ in M, save possibly for (n, x) itself. Let 
x — yin M. There is some m such that (m, y) € U. By induction hypothesis, 
y = p in M. And since y is arbitrary, we see that in M, x = Bo. QED 








This Truth Lemma completes the proof of Theorem 6.28. We started with a 
set S and a world w* € W such that w* = S. Then M is a DSO space, and in 
it (0, w*) = S. QED 


Incidentally, the same proof shows the following results: K4 is complete 
for spaces in which every derived set is closed (due to Esakia, 2001), and 
K D4 is complete for spaces which are dense in themselves and in which every 
derived set is closed. For this last result, see Shehtman, 1990 and also Sec. 3 
of Ch. 5 of this Handbook. The point is that the construction in the proof of 
Lemma 6.29 only used the Euclidean property in verifying that the space M 
has the property that all derived sets are open. We can drop this point, and 
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instead verify directly that M is Tı. The rest of the arguments are the same. 
Perhaps the progenitor of all of these results is Esakia’s completeness theorem 
for (finite) topological spaces using weak K4, the modal logic K supplemented 
with the scheme (¢ A Bọ) —> BB¢o) (Esakia, 2001). See also Theorem 1.62 
of Ch. 5. 


12. Other work connected to this chapter 


Chapters 5 and 10 contain much material that is relevant to the concerns of 
this chapter. 

We mention a few other papers which also are relevant, but which we did 
not discuss in the chapter. Davoren, 1999 and Davoren and Gore, 2002 study a 
propositional bimodal logic consisting of two 54 modalities O and [a], together 
with the interaction axiom scheme (a)¢ — O(a) ¢. Inthe intended semantics, 
the plain [O is given the McKinsey-Tarski interpretation of interior, while the 
labeled [a] is given the standard Kripke semantics using a preorder Ra. The 
interaction axiom has the flavor of the Cross-Axiom, and here it expresses 
the property that the Ra relation is lower semi-continuous with respect to the 
topology. 

Pacuit and Parikh, 2005 study a non-spatial application of topologic, thereby 
showing that the area of application may indeed be wide. They consider a 
set of agents connected in a communication graph, and such that agent 7 may 
receive information from agent j only if there is an edge from 7 to 7. The logic 
which arises uses a language very similar to topologic, and it is shown that for 
each graph, the logic is decidable, and completely characterizes the graph. An 
application to the Valerie Plame affair, a notorious political affair from the early 
years of this century, is also described. 
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1. Introduction and historical overview 


In ancient Babylon and Egypt geometry was just a set of empirical ob- 
servations and practical skills and methods for measuring land and designing 
irrigation systems, although it already had a degree of sophistication (e.g., 
Pythagoras’ theorem was already known and the triangle with sides 3-4-5 was 
used in practice for producing right angles). In ancient Greek times it evolved 
into a liberal art. 

We dare claim that Geometry only became a science with Euclid’s epic work 
“Elements” written over 2300 years ago, which was not only the first truly 
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scientific treatment of geometry, but also the first systematic application of the 
axiomatic method in mathematics. Geometry remained a central subject in 
mathematics throughout the centuries, but when in the first half of the 17th 
century Descartes introduced coordinate systems, and with them the analytic 
method in geometry, it gradually began to lose its prime position in mathematics 
and became part of algebra and calculus, and later—of topology. The modern 
view, going back to the famous Klein’s Erlangen program, defined geometry 
as a study not of figures, but of transformations, and classified the different 
geometric structures and their theories in terms of the groups of transformations 
which preserve them. This view placed it firmly on algebraic foundations to an 
extent that some mathematicians consider it as an “applied group theory”. 

On the other hand, the discovery of non-Euclidean geometries by Bolyai, 
Lobachevsky and Gauss in the early 19th century (see e.g. Coxeter, 1969, 
Eves, 1972, Meserve, 1983), which showed inter alia the independence of 
Euclid’s “Fifth Postulate” from the other axioms of Euclidean geometry, was 
an impressive demonstration of the strength and importance of the formal logical 
approach in mathematics, which also reinforced the importance of geometry 
to the foundations of mathematics. Euclid’s Fifth Postulate claims that, given 
a line and a point not incident with it in a plane, there exists a unique line 
in that plane passing through the given point and parallel to the given line. 
Depending on the acceptance or otherwise of that postulate, several natural 
lines of development of geometry evolve: 


= affine geometry, first studied by Euler, which adopts the Fifth Postulate. 
Thus incidence, parallelism, collinearity, and betweenness, as well as 
transformations that preserve these relations, play a central role in affine 
geometry. Such affine transformations can be taken, in the spirit of 
the Erlangen program, as defining the very notion of “affine”. Affine 
geometry does not deal with angles, distances, or any other related metric 
concepts (not even with orthogonality), as these are not invariant under 
affine transformations. Thus the models of affine geometry, viz. affine 
spaces, are more general than Euclidean spaces, but have poorer structure. 


= hyperbolic geometry, introduced by Lobachevsky and Bolyai, and elliptic 
geometry, which adopt the negation of the Fifth Postulate. In hyperbolic 
geometries, given a line and a point not on that line, there exist infinitely 
many lines through that point which lie parallel with the original given 
line (see e.g. Coxeter, 1969, Szczerba and Tarski, 1965, Szczerba and 
Tarski, 1979 for more details); in elliptic geometry (see Coxeter, 1969, 
Behnke et al., 1974) there are no parallel lines at all. These will not 
be discussed in this chapter, but see Hilbert, 1950 for comparison and 
relationships. 
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= absolute geometry, introduced by J. Bolyai, based only on the first four 
postulates of Euclid, but independent of the Fifth Postulate. In some ex- 
tension of absolute geometry the notion of parallelism can be completely 
rejected, as in projective geometry, where every two lines in a projec- 
tive plane intersect. In others, alternatives of the Fifth Postulate can be 
adopted, as in the elliptic and hyperbolic geometries. Thus, absolute ge- 
ometry is a full-fledged system of geometry, involving distances, angles, 
etc., but based on a weaker axiomatic basis than Euclidean geometry. 
It is not a subsystem of affine geometry, though the intersection of the 
two is still rich enough to develop a meaningful and interesting theory 
of ordered affine structures (see Coppel, 1998, Coxeter, 1969, Ch. 15, 
Lenz, 1992, Szczerba, 1972). 


A fundamental affine relation, i.e., one invariant under affine transformations, 
is the relation of betweenness on triples of points, which extends basic affine 
structures by introducing ordering between points on a line, but not distances. 
Much of the theory of betweenness is independent of Euclid’s Fifth Postulate, 
and thus lies in the intersection of absolute and affine geometry (see Coppel, 
1998, Coxeter, 1969). 

Perhaps the simplest important non-affine relation is that of orthogonality. 
Further adding distances and angles, along with the axioms of the field of reals, 
extends affine geometry to the classical, Euclidean geometry of the real plane 
and space. 

At the beginning of the 20th century, Hilbert, the most influential proponent 
of the axiomatic method in mathematics, illustrated the power of that method 
by re-casting Euclid’s work into a precise and rigorous modern treatment which 
eventually put geometry on sound axiomatic foundations (see Hilbert, 1950). It 
was preceded by axiomatic investigations of the foundations of geometry at the 
end of the 19th century by Peano, as well as Pieri, 1908, Veblen, 1904, Veblen, 
1914 and Pasch, 1882, who analyzed various axiomatic systems and the mutual 
relationships between the primitive notions of Euclidean geometry. However, 
the axiomatic method in geometry only reached its logical maturity with the 
seminal work of Tarski and his students and followers Szczerba, Szmielew, 
Schwabhiuser, Scott, Monk, Givant and others (see Schwabhduser et al., 1983 
for comprehensive details) in the 1920-70’s. Tarski developed systematically 
the logical foundations of elementary geometry, which is “that part of Euclidean 
geometry that can be formulated and established without the help of any set- 
theoretical devices” (see Tarski, 1959). Essentially, that means the first-order 
theory of Euclidean geometry, developed over a suitably expressive first-order 
language (see below). In particular: 


= Tarski, 1951, Tarski, 1967 demonstrated how the elementary geometry 
of the real plane can be formally interpreted into the elementary (i.e. 
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first-order) theory of real-closed fields. Furthermore, Tarski showed the 
completeness and decidability of the theory of real-closed fields by means 
of quantifier elimination, and consequently obtained a decision procedure 
for the elementary Euclidean geometry. He then extended these results 
to Euclidean planes over arbitrary real-closed fields. 


Tarski, 1959 showed that the whole of elementary geometry can be devel- 
oped axiomatically using just two geometric relations, viz. betweenness 
and equidistance (used as the only primitives also by Veblen, 1904). He 
thus obtained an explicit axiomatization of the first-order theory of the 
Euclidean geometry in terms of these primitives and showed that it is 
complete and decidable, though not finitely axiomatizable. 


In a similar fashion, Szmielew, 1959 studied the first-order theory of 
the metric hyperbolic geometry, obtained by negating Euclid’s axiom in 
Tarski’s first-order axiomatization of the Euclidean geometry. 


Szczerba and Tarski, 1965, Szczerba and Tarski, 1979 studied and char- 
acterized the first-order theories of the fragments of the Euclidean, hy- 
perbolic and absolute geometries based on betweenness alone, for which 
they established explicit axiomatizations. 


Beth and Tarski, 1956, Tarski, 1956 studied the problem of which geo- 
metric relations are sufficient to be adopted as primitive notions in terms 
of which the whole of Euclidean geometry can be developed. 


Szmielew, 1983 developed the theory of point-based collinearity struc- 
tures and showed how to build up the Euclidean geometry from that 
theory, while Schwabhauser and Szczerba, 1975 studied line-based struc- 
tures for elementary geometry. 


While the post-Tarski period in the logical foundations of geometry is less 
active and spectacular, still there are several research lines which deserve dis- 
cussion. Besides the notable works of Tarski’s students mentioned above, they 
include: 


the study of classical and constructive axiomatizations of fragments of 
projective, affine, absolute, Euclidean, elliptic, hyperbolic, etc. ge- 
ometries, with emphasis on simplicity and minimality, in Pambuccian, 
1989, von Plato, 1995, Lombard and Vesley, 1998, Pambuccian, 2001a, 
Pambuccian, 2001b, Pambuccian, 2006, etc. For a general discussion of 
the axiomatics of affine and projective geometry, see Bennett, 1995. 


the investigation of primitive relations sufficient for the elementary affine, 
projective, absolute, etc. geometries; the expressiveness of such relations; 
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and axiomatizations in terms of such relations, in Scott, 1956, Pambuc- 
cian, 1995, Pambuccian, 2003, Pambuccian, 2004, etc. 


= the development of practical methods and algorithms for theorem proving 
in algebra and geometry: quantifier elimination based methods, such as 
Seidenberg’s implementation of Tarski’s method, Seidenberg, 1954, the 
method of cylindrical algebraic decompositions (see e.g. Caviness and 
Johnson, 1998, Buchberger et al., 1988), the more recent and efficient 
Heintz et al., 1990, Renegar, 1992, Basu et al., 1996, Basu, 1999, the 
Grobner basis method (Buchberger, 1985), the characteristic set method 
(Chou and Gao, 1990) and others. For more details and references see 
Sec. 9.2. 


Most of the studies and results mentioned above apply to geometric structures 
of which the logical languages are rich enough to express properties of ordering 
and metric. However, there are various weaker, yet natural and important, geo- 
metric structures such as parallelism, orthogonality, incidence and collinearity 
structures, which involve points and lines in a real or abstract geometric space. 
The elementary theories of these latter structures are considerably less studied, 
mainly from the perspective of discrete and combinatorial geometry. We will 
discuss these structures and their theories in some detail here, as they play an 
important role in various models of qualitative spatial reasoning. 

While affine and absolute spaces are too general to allow the development of 
a full-fledged elementary geometry in them, they are still amenable to algebraic 
treatment by means of coordinatization, which enables the study of affine and 
projective spaces by studying algebraic structures called ternary rings (see e.g. 
Blumenthal, 1961, Heyting, 1963, Szmielew, 1983, Hughes and Piper, 1973, 
Mihalek, 1972). Since the coordinatization is a first-order interpretation, it is 
instrumental for the algebraic investigation and characterization of the logical 
theories of affine spaces, and can be used to establish various logical properties, 
such as independence results, representation theorems, (lack of) finite model 
property, decidability and complexity results of these theories. 

In this chapter we survey and discuss from a logical perspective structures 
and theories of parallelism, orthogonality, incidence and order, gradually build- 
ing the full elementary geometry of Euclidean spaces, in Tarski’s sense. Besides 
traditional geometric properties and constructions, we discuss various logical 
issues such as: definability of relations and properties, expressiveness of con- 
cepts, axiomatic theories and their models, representation results and complete- 
ness, finite model property, decidability, categoricity and other model-theoretic 
properties. 

The chapter consists of two parts. In the first part we discuss classical, first- 
order theories of geometric structures, starting with very weak structures of 
parallelism (Sec. 3), orthogonality (Sec. 4), incidence (Sec. 5) and collinearity, 
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for which we show how to develop some geometric concepts, such as inde- 
pendence, basis, planarity and dimension. In Sec. 6 and Sec. 7 we outline 
coordinatization of projective and affine planes as a general method of inter- 
preting them into algebraic structures called planar ternary rings, and discuss 
the relationship between geometric and algebraic properties, and generally be- 
tween the logical theories of planes and the associated coordinate rings. We 
also discuss collineations and general affine transformations, and the associated 
(invariant) affine concepts and properties. In Sec. 8 we then add betweenness 
and order in affine planes, discuss definability in these planes, and the relation- 
ship of these planes with ordered coordinate rings, as well as the results from 
Szczerba and Tarski, 1965, Szczerba and Tarski, 1979 on axiomatic theories of 
betweenness. Eventually we consider some rich languages, i.e. languages con- 
taining primitive notions in terms of which the whole of elementary geometry 
can be developed, and present Tarski’s axiomatization of the Euclidean geom- 
etry in terms of betweenness and equidistance. The first part of the chapter, 
dealing with elementary theories of geometry, ends with a brief discussion in 
Sec. 9.2 of the development of decision methods for elementary geometry since 
Tarski’s seminal decidability results, and automated reasoning for elementary 
geometry. 

The second part of the chapter is devoted to modal logics arising from clas- 
sical, mainly two-dimensional geometrical structures. After a short general 
discussion of spatial modal logics in Sec. 10, we consider modal logics of sev- 
eral sorts: point-based (Sec. 11), line-based (Sec. 12), and point-line based 
logics (Sec. 14) with incidence relations between the sorts, defining affine or 
projective incidence structures. In Sec. 13 we show how two-sorted relational 
structures based on points and lines can be replaced by one-sorted relational 
structures containing the same geometrical information, and how modal logic 
can be developed on such structures. In Sec. 14 we discuss point-line spa- 
tial logics and show how modal languages can be interpreted on two-sorted 
relational structures. 


2: Preliminaries 
2.1 Some terminology and notation 


The following notions will be introduced more than once in this chapter. 
Here we only fix the notation and terminology used further (unless otherwise 
specified) for the convenience of the reader. 

We deal with two basic geometric objects: points and lines. 


Points. Points are usually considered primitive concepts, but as we shall see, 
they can also be defined in terms of co-punctual lines. Specific points will be 
denoted as A, B, C etc. and typical point variables will be X, Y, Z, etc. Basic 
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relations on points are collinearity, denoted as Col(X Y Z), meaning that the 
points X, Y and Z lie on a common line (sometimes generalized to n points); 
betweenness, denoted as B(X Y Z), meaning that Y lies on the line segment 
joining X and Z (with possibly Y coinciding with X or Z); and equidistance, 
denoted XY = ZU and meaning that the line segment formed by X and Y 
has the same length as the line segment formed by Z and U. Using any of 
these, one can define the triangle relation, which holds when three points are 
non-collinear. 


Lines. Lines can be introduced as primitive concepts, or defined in terms 
of pairs of different points, or as equivalence classes of points in collinearity 
structures. Specific lines will be denoted as a, b, c, etc. Typical line variables 
will be x, y, z etc. Basic relations on lines are: 

Incidence, denoted as x Inc y, meaning that the lines x and y share acommon 
point, and may even coincide. Incidence may be generalized to co-punctuality 
(concurrence), denoted Cop(zx1 ... £n), meaning that the lines 71,... , £n have 
exactly one point in common. 

Intersection of two lines, denoted x Int y and meaning that x and y are 
incident but different. 

Co-planarity of lines, denoted Pl(xy) for two lines and generalized to 
Pl(x1 ... £n) for n lines, meaning that the lines lie in the same plane. 

Strict (irreflexive) parallelism, denoted x || y, meaning that the lines x and 
y are parallel and different, and weak (reflexive) parallelism, denoted x ıı y, 
meaning that x and y are parallel or coincide. 

Orthogonality, denoted x L y, meaning that the lines x and y are orthogonal, 
but not necessarily intersecting, and perpendicularity, denoted x ae y, meaning 
that the lines x and y are orthogonal and coplanar (and hence intersecting). 

Skewness, denoted x > y, meaning that x and y are not co-planar. 

Lines can also be defined as sets of points collinear with a pair of points: 
given two distinct points P and Q, the line determined by P and Q, denoted 
1(P, Q), is defined as the set of all points X such that Col( PQX) holds. 

Given a point X and a line y, the claim that X is incident with y will be 
denoted as X Iy or simply as X € y, while, assuming Euclid’s parallel postulate, 
the unique line parallel with y and containing X will be denoted p(X, y). The 
unique line incident with two distinct points X and Y will be denoted as 1(X, Y) 
or simply as XY, while the line segment between X and Y will be denoted 
|X Y| and the length of that segment as || XY ||. Given intersecting lines x and 
y, their point of intersection will be denoted P (x, y). 
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For every integer n > 1, Diff, (X1 . . . Xn) will be the formula stating that 
X1,..-,Xn are distinct, i.e. 


Diffn(X1...Xn) = [A Xi #X;, 
ifj 
1<i,j<n 


and likewise for Diffn (x1 . . . £n). 


2.2 Algebraic background 


The terminology on algebraic structures varies considerably in the literature, 
so we fix ours here. The reader is referred to any standard text in abstract 
algebra, or to Szmielew, 1983 for more details. 

Consider the structure (G; 0, +) where G is a non-empty set, + is a binary 
operation on G and 0 is some distinguished element in G. Then (G; 0, +) is 
called an (additive) loop (with zero 0) if 


1. a+0=a=0+aforalla € G; 
2. a+ b = c uniquely determines any of a, b, c from the other two. 


Likewise we refer to (G; 1, -) as a multiplicative loop with unit 1. 

If the operation + is associative as well, then (G; 0, +) is called a group. Note 
that the second property above guarantees the existence of additive inverses in 
any group. A group will be called abelian when the operation defined in it is 
commutative. 

A structure (F; 0, +,-) is called a ring if (F; 0, +) is an abelian group and 
the multiplication - is both associative and distributive over +. 

A structure (G;0,1,-) (where 0 and 1 are distinct distinguished elements 
from G) is called a multiplicative loop with zero if 


1. (G\{O}, 1,-) is a multiplicative loop with unit 1; 
2. a-0=0=0-aforalla€G. 


Here 0 is the zero of (G;0,1,-) and 1 is its unit. Again (G;0,1,-) will be 
called a multiplicative group with zero when - is associative. 
(F;0,1,+,-) is called a double loop when 


1. (F;0,+) is an additive loop; 
2. (F;0,1,-) is a multiplicative loop with zero. 


A double loop (F; 0,1, +, -) is called a left division ring (respectively, right 
division ring) when (F; 0, +) is an abelian group and - is left-distributive (re- 
spectively, right-distributive) over +. A division ring is a double loop that is 
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both a left and right division ring. A division ring (F; 0, 1,+,-) with associa- 
tive multiplication - is called a skew field, and if - is also commutative then 
(F;0,1,+,-) becomes a field. Note that there is some variation in the literature 
regarding these terms; for example, division rings are called in Szmielew, 1983 
quasi-fields. By a classical result of Wedderburn, every finite skew field is a 
field. 

A structure (G; 0, +, <) will be called an ordered loop if (G;0,+) is a loop 
and < is a linear ordering on the set G such that 


1. ax<b>c+a<c+6 (left additive monotony) 


2. ax<b=>a+c< b+ c (ight additive monotony) 


for all a,b,c € G. If (G;0,+) is a group then (G; 0, +, <) will be called an 
ordered group, etc. 

A structure (F; 0, 1, +, +, <) will be called an ordered double loop if (F';0, 1, 
+, +) is a double loop and < is a linear ordering on the set F such that both left 
and right additive monotony holds, and 


1. ax<b=c-a<c-b (left multiplicative monotony) 


2. ax<b=a-c<b-c (ight multiplicative monotony) 


for all a,b,c € G with c > 0. Likewise if (F';0,1,+,-) is instead, say, a left 
division ring, then (F; 0,1, +, +, <) will be called an ordered left division ring, 
etc. It is easy to see that if (F’;0,1,+,-) is an ordered double loop then 0 < 1 
and hence F has infinite cardinality since for every x € F,x<a-+1. 

An ordered structure (F; 0,1, +,-+, <) is Euclidean, if for every a € F with 
a > 0 there exists b € F such that a = b?; real closed, if it is Euclidean and 
every polynomial of odd degree over F' has a zero in F. 


2.3 Logical background 


In the treatment of some logical issues, we assume that the reader has back- 
ground on the basic model theory of first-order logic, suitable references on 
which include Doets, 1996, Enderton, 1972 and the very comprehensive and 
more advanced Hodges, 1993. Here we only mention a few more specific 
concepts and results used in the chapter. 


Theories. A (first-order) theory is any set of first-order sentences. A theory 
T is complete if every two models of the theory are elementarily equivalent, i.e., 
satisfy the same first-order sentences. A typical example of a complete theory 
is the set TH(2l) of all first-order sentences satisfied in a given structure 2. A 
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theory T is w-categorical (or, countably categorical) if all countable models of 
T are isomorphic; T is decidable if there is an algorithm which can determine 
if a given sentence is a logical consequence of T’. By the Los- Vaught Test (see 
e.g. Doets, 1996) every w-categorical theory is complete and decidable. 


Padoa’s method. Let £ be a first-order language over some signature S, 
let s be a symbol not in S and T a theory over the signature S U {s}. If X and 
B are models of T with A|S = B|S but s% 4 s? then s cannot be defined 
by T in £. More generally, if X is a model of the theory T and if there exists 
an automorphism of 2|.S' that fails to preserve the symbol s, then s cannot be 
defined by T in £. For example, consider the structure (Z; +). The constant 
0 is explicitly definable using the formula yo(x) := Vy(a + y = y). From 
this we can then explicitly define subtraction using the formula x- (x, y, z) := 
udv(yo(u) Ay +v = uAa+v = z). To show that multiplication - is 
not definable in (Z; +), simply note that the automorphism h of (Z; +) given 
by h(x) = —ax does not preserve multiplication, since in general —(x- y) # 


(=x) - (~y). 


Interpretations. Let S be any signature with S4, Sg C S, and consider the 
structures 2 = (A; S4) and B = (B; Sg). An n-dimensional interpretation 
of B in 2 consists of the following (the vectors z will refer to n-tuples of 
variables): 
1. A formula y(z1,... , £n) over the signature S4 which defines some rela- 
tion Dg C A” representing the domain B interpreted in A; 
2. A bijective (“decoding”) function f : Dg —> B; 
3. (i) for every constant symbol c € Spg, a formula y_(%) over S4 that 
defines some element cg € Dg such that f(cg) = c”; 
(ii) for every m-ary relation symbol r € Spg, a formula y;(Z1,...,Zm) 
over S4 that defines some relation rg C D% such that f [rg] = r? 
(likewise for the equality symbol); 
(iii) for every m-ary function symbol g € Sg, a formula 
g(%1,---,%m,Em41) over S4 that defines some function gpg : 
DR = Dp such that f (gpg (ā1, egi eae an), esis , f(Gm)). 
The above definition can be extended to non-injective interpretations and 
interpretations between many-sorted structures, see e.g. Hodges, 1993. A clas- 
sical example is the 2-dimensional interpretation of the rationals Q = (Q; +, -) 
in the integers 3 = (Z; +, -), as ordinary fractions. Interpretations will be used 
in Sec. 7, where affine planes will be interpreted in algebraic structures called 
ternary rings. 




















3. Structures and theories of parallelism 


We begin our study with very weak and simple structures which consist of 
a set of lines subject only to the relation of parallelism (besides equality). We 
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will provide a definitive axiomatic description of such structures which can 
be extracted from the real Euclidean space of any dimension. In particular, 
it will turn out that the relation of line parallelism is too weak to distinguish 
dimensions greater than n = 1. 

By a line parallelism frame, or simply a parallelism frame, we mean any 
structure of the form (Li, ||), where || is a binary relation called parallelism over 
a non-empty set Li of which the elements are called lines. When the relation || 
holds for two lines x and y, we will use phrases such as x is parallel to y, etc. 

A pre-model of parallelism is a parallelism frame (Li, ||) satisfying the fol- 
lowing conditions: 


Sym): VeVy(x || y— y || x) (symmetry) 


PTran): VaVyV2(x | yAy ||z>2=2zV2|| z) (pseudo-transitivity) 


A pre-model of parallelism in which the parallelism relation is reflexive (re- 
spectively, irreflexive) will be called a model of weak parallelism, (respectively, 
a model of strict parallelism). Thus, models of weak and strict parallelism must 
satisfy respectively the axioms: 





Ref): Va(ax || x), and Irr: Az (x || x). 


Hereafter, unless otherwise specified, by parallelism we will mean strict 
parallelism, and weak parallelism will be denoted by the symbol ıı. Clearly, 
these are definable in terms of one another: 


ziy & z|yvyvr=y, zly & Diy Aes yy, 


Thus models of weak parallelism are simply equivalence relations, while 
models of strict parallelism are isomorphic to disjoint unions of relational struc- 
tures of the form (W, #), where Æ is the difference relation over some non- 
empty set W. Given a model of strict parallelism (Li, ||) and any line x € Li, 
the set of lines {x} U {y € Li: y || x} will be called the parallel class (con- 
taining x). The property that a model of strict parallelism contains infinitely 
many parallel classes can be modelled using the scheme Par consisting of the 
axioms 

















£1... I£p | Difg(x1 ... £k) A A Ti | x; 
fj 
for every natural k > 1. 
In models of strict parallelism, it is possible for a line to be parallel with 
no other line. A model of parallelism will be called k-serial if it satisfies the 
property 

















k 
Vary, ... Iyk (Dison Nui | a) 
i=1 
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A model that is k-serial for every natural k > 1 will be called infinitely serial, 
and the scheme specifying that a model is infinitely serial, consisting of all the 
above axioms for k > 1, will be denoted as Ser e 

A model of strict parallelism is real if it consists of (not necessarily all) lines 
in the real plane, with the usual relation of strict parallelism. 

Given a line u, let u denote the parallel class of u. Now, with every such 
parallel class u we associate a real number Mu meant to represent the slope of 
the lines in u in some arbitrarily fixed orthogonal coordinate system in the real 
plane, so that the mapping m is to be injective. Then, each line v in the class 
u can be mapped to a unique real number b,, and the line v is identified with 
the line in the real plane having equation y = myx + by. Thus, we have the 
following elementary characterization of line parallelism in R”. 


PROPOSITION 7.1 Every model of strict parallelism of cardinality not greater 
than the continuum is isomorphic to a real model. 


By taking the mappings m and b to be surjective, after setting aside a parallel 
class for the vertical lines, we obtain: 


PROPOSITION 7.2 Every model of strict parallelism in which there are contin- 
uum many parallel classes, each of them with the cardinality of the continuum, 
is isomorphic to the model of strict parallelism consisting of all lines in R?. 


COROLLARY 7.3 For every natural n > 2, the model of strict parallelism 
consisting of all lines in R” is isomorphic to the model of strict parallelism 
consisting of all lines in R?. 


PROPOSITION 7.4 The theory of the class of models of strict parallelism 
which satisfy the schemes Par), and Ser, is w-categorical, and hence complete 
and decidable. 


Indeed, let X and % be two countable models of strict parallelism satisfying 
the schemes Par, and Ser). Then X contains countably many parallel classes, 
each of them of countable cardinality, and likewise for B. Let y be any bijection 
between the parallel classes of 2 and the parallel classes of B, and for every 
parallel class x in 2, let Yx be a bijection between the lines in x and the lines 
in y(x). Then the line u in 2 lying in the parallel class x is mapped to the line 
Yx(u) in B, and this establishes an isomorphism between 2 and B. 

The completeness and decidability now follow by the Los- Vaught Test. 

Since the theory of strict parallelism is reducible to the first-order theory of 
equality, from Stockmeyer, 1977 it follows that the theory of strict parallelism 
is PSPACE-complete. 
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4. Structures and theories of orthogonality 
4.1 Orthogonality frames and dimension 


By a line orthogonality frame, or simply orthogonality frame, we will mean 
any structure of the form (Li, 1), where Li is a set, the elements of which 
will be called lines, and L is a binary relation on Li, called the orthogonality 
relation. Lines x and y satisfying x L y will be called orthogonal. If x and y 
are both orthogonal as well as incident, then they will be called perpendicular, 
denoted «ly. For n > 1, dimension can be defined in an orthogonality frame 


using the conjunction of the sentences dim” and Dim”, given as 


dim” : Dist an (Ai j ri 4 =): 


Dim” : Jti Jtn tnya UER 









































Clearly Dim” = adim +® 


of n-dimensionality (for n > 1) if both sentences dim” and Dim” hold in 
that frame. A frame that satisfies 2-dimensionality will also be called planar. 

When dealing with orthogonality frames, the expression x, ıı £2 will be an 
abbreviation for the formula 


. An orthogonality frame satisfies the property 


(7.1) Vy (y L z1 > y La). 


In the context of orthogonality frames, by parallelism we will mean weak par- 
allelism, and will say that lines x; and x2 are parallel when x, ıı x2 in the sense 
of (7.1). Clearly, the binary relation defined by ıı is an equivalence relation. 

From the class of all orthogonality frames, we single out those which satisfy 
the additional axiom 


Pen, : VayVaq (a1 Æ £2 > T1 22). 


Such orthogonality frames will be called pencils, by analogy with a pencil 
being a collection of co-punctual lines. However, our pencils of orthogonality 
are not pencils in the strict sense. The axiom Pen, simply states that there may 
be no parallel lines, and this mimics pencil structure. But the axioms do not 
exclude models where the lines are not all co-punctual. In fact, incidence is not 
even definable from orthogonality, so that it is futile to try and axiomatize co- 
punctuality of lines in orthogonality frames. For example, let R\’ be the set of 
all lines in the Euclidean space R”, and define R” := ( R” ; L), where L is the 
Euclidean orthogonality relation. By using the method of Padoa (i.e. finding 
an automorphism of Aș, which fails to preserve incidence) we can obtain the 
following. 


PROPOSITION 7.5 Forn > 3, the relation of line incidence is not definable 
in Ra. 


356 HANDBOOK OF SPATIAL LOGICS 


Orthogonality pencils can be obtained from orthogonality frames by factor- 
ing over parallel classes: if X is any orthogonality frame, then 21/ , the quotient 
structure of X induced by the parallelism relation defined by (7.1), is an orthog- 
onality pencil; parallelism reduces to equality in orthogonality pencils. 

We will call an n-dimensional orthogonality frame real if it can be isomor- 
phically embedded in R” with the Euclidean orthogonality relation, where two 
lines are orthogonal when the dot product of their direction vectors is 0. 


4.2 Planar orthogonality frames 


DEFINITION 7.6 A 2-dimensional model of orthogonality, or simply a planar 
model of orthogonality, is an orthogonality frame (Li, L) with Li non-empty 
and subject to the following axioms: 

leis Sale L x) 

Sym, : VaVy(a Ly > y Lz) 

prd? : Vary (y L x) 


Prd? © VaVyiVye Meu Yi L £ —> yı 1 yo) 








The axioms Irr} and Sym, specify respectively the irreflexivity and symmetry 
of L, while pra? and Prd” combined state that, up to parallelism, every line 
has a unique line orthogonal to it. It can easily be verified that the axioms 
dim? and Dim? hold in these structures. It is useful to note that Goldblatt, 
1987 studies orthogonality structures which admit self-orthogonal lines (lines 
which lie orthogonal to themselves) as well as singular lines (lines which lie 
orthogonal to all lines in the structure). 
We will also make use of the following axiom schemes: 


m Infa := {Ax}ken, stating the existence of infinitely many lines, where 
Ap = da,... Arp (Ai jTi Æ £j); 

















m Ser], stating that every parallel class has infinite cardinality, consisting 
of the axioms Vz3yı . . . Jyk (Diff. (y1 Yb) A AB ye i x) for every 
k eN; 

















= Par], stating that there are infinitely many parallel classes, consisting of 
the axioms 3x... dx, (Aizji y x4) for every k € N. 

















Using an approach similar to the proof of Proposition 7.1, we can map lines 
in an abstract planar orthogonality model to lines in the real plane, to obtain the 
following. 


PROPOSITION 7.7 (REPRESENTATION THEOREM) Every planar model of 
line orthogonality with cardinality at most the continuum is isomorphic to a real 
planar model of line orthogonality. 
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By bijectively associating pairs of mutually orthogonal parallel classes in 
any two countable planar orthogonality models, we furthermore obtain the 
following. 


PROPOSITION 7.8 The theory of the class of planar orthogonality models 
satisfying the schemes Ser, and Par, is w-categorical. 


COROLLARY 7.9 The theory of the class of planar orthogonality models sat- 
isfying the schemes Ser, and Par, is complete and decidable. 


4.3 Orthogonality frames in higher dimensions 


Given an n-dimensional orthogonality frame, a set of lines x1, . . . , x, in that 
frame (with n > 1 and k < n) will be called independent when the formula 














2) UP + a7 Sta | a baga fal 
fj i,j 





is satisfied. Lines which are not independent will be called dependent. The 
formula (7.2) takes k < n arguments. For k > n define the lines 71,..., £k to 
be dependent and the formula LI® (%1,..., 2x) to be false. 


Say that y lies in the span of £1, . . . , £k when the following formula holds: 
(7.3) Span(z1,..., £k, Y) := Vz (aa zd apg L y) ; 


Independence and span of lines is an abstraction of the notion of linear inde- 
pendence and span of vectors in a vector space. 


DEFINITION 7.10 An orthogonality frame (Li, L) with Li non-empty will be 
called an n-dimensional model of orthogonality, where n > 3, if it satisfies the 


axioms Irr}, Sym, and Dim”, together with the axioms 


pra” : Varz...V&n—1dy (Nru | x) 
Prd”) : Vay... Yenı (uf? Ce ee 


n-1 


Vy Vye (Nij Yi L zj > yı ıl w)) 


From the axiom pra” it is immediate that the axiom dim” will hold in all 


n-dimensional orthogonality models, and in the case where n = 3 it can also be 


shown that the axiom Dim?) may be dropped. The axioms pra” and Pra 
imply that every n-dimensional orthogonality model has the property 








Yzı ese .VXn—1 (i (a1, e.i; Ln—-1) => 


n-1 


(7.4) dil Ay bane Nela + zuy))) 
i=l 





g 
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i.e. every n — 1 independent lines x1, . . . , &n—1 have a unique parallel class, 
which we shall call the product of x1, .. . , £n—1 - denote it as 71 X +++ X £n—1- 
that lies orthogonal to all of the x;. Line products are an abstraction of the vector 
cross product in R3. In the context of pencils, 1 X- -X £n—1 will not be a paral- 
lel class, but simply a single line. Note that the operation x is not total, but only 
defined on tuples of independent lines. In the 3-dimensional case, the formula 
(7.4) reduces to YVx1Yzə (x1 1/22 Fy (y L £1, £2 A Vz (z L 41,222 11y))), 
i.e. every two non-parallel lines have a unique parallel class orthogonal to both 
of them. 

The axioms used above for the formalization of orthogonality in higher di- 
mensions illustrate the novel expressive power of orthogonality, but they do 
not constitute a complete first-order axiomatization of the class of orthogonal- 
ity structures in dimension n > 3. To our knowledge, the complete axioma- 
tization of the first-order theory of orthogonality in these dimensions has not 
been established yet. Unlike the case for planar orthogonality models, it can be 
shown that the theory of line orthogonality in Euclidean n-space is not count- 
ably categorical for n > 3, and this negative result indicates that the problem 
of identifying this theory is presumably difficult. 

Since orthogonality is a metric notion - arguably the simplest and most intu- 
itive ofall metric line notions - it has great expressive power, and one anticipates 
that its theory will capture a non-trivial and substantial fragment of that of the 
full Euclidean geometry, as witnessed by the fact that notions like linear inde- 
pendence and span of vectors can be abstracted and expressed in the language 
of orthogonality. 





5. Two-sorted point-line incidence spaces 


In this section, we consider point-line incidence structures, described by a 
two-sorted first-order language with equality, equipped with sorts for points 
and lines and the intersort relation of incidence. 


5.1 Point-line incidence frames 


A point-line incidence frame is a two-sorted structure (Po, Li, I) where Po 
and Li are non-empty sets and I C Po x Li is a symmetric incidence relation 
between them. The elements of Po are called points, and the elements of Li 
are called lines. If the relation I holds for a point X and a line x then we use 
expressions like X is incident with x, X lies on x, X belongs to x, x passes 
through X, x contains X etc. When XIz and YIz we also say that the line 
z connects the points X and Y while the point X is in the intersection of the 
lines y and z will mean that X Iy and XIz. 
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We say that the lines x and y are incident, denoted x Inc y, if they are 
incident with a common point, formally 





x Inc y := 3Z(ZIx A Z1y). 


Further, we say that the lines x and y are intersecting, denoted x Int y, if 
they are incident and different. Formally 


x Int y := r #4 y^ ncy. 


Given an arbitrary incidence frame (Po, Li, I), we also introduce the relation 
of collinearity of three points 


Col(XY Z) := 





x(XIx A YIr A Ziz) 


and that of co-punctuality of lines 


Cop(z1... £n) := IX (A xt) ; 
i=1 


Thus incidence of lines is a special case of co-punctuality of lines. 





5.2 Linear spaces of incidence 


Linear spaces (not in the sense of vector spaces) are the most general inci- 
dence structures which are geometrically meaningful. It is instructive to note 
that a number of fundamental concepts in vector spaces, such as independence, 
basis and dimension can be generalized to linear spaces. The following defini- 
tion reflects Hilbert’s axioms for incidence (see Karzel et al., 1973). 


DEFINITION 7.11 A linear space (aka incidence geometry or incidence basis 
in Mihalek, 1972) is an incidence frame (Po, Li, I) in which the following 
axioms hold: 


LS1 Every two distinct points are incident with a unique common line. 
LS2 Every line passes through at least two points. 


Given distinct points X and Y ina linear space, the unique line incident with 
both of them will be denoted by 1(X, Y ). Thus, the expression l(X, Y ) assumes 
that X and Y are distinct. Furthermore, if two lines x and y in a linear space 
intersect, then by LS1 they have a unique common point, hereafter denoted as 
P(x,y) and called the intersection of x and y. We will only use this notation 
in the case of intersecting lines. 


5.3 Linear subspaces, independence, bases and dimension 


To begin with, linear spaces can be regarded as two-sorted algebraic struc- 
tures, with two partial operations: one applied to two different points produces 
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the unique line passing through them, and the other applied to two intersecting 
lines produces their intersection point. Thus, a subspace of a linear space could 
be defined as a non-empty substructure which is closed under these partial op- 
erations. However, this definition also allows subspaces consisting of just one 
line from the space, and two or more, but not all, points on that line. It is more 
natural to require that a subspace contains with every line in it all points on 
that line. Therefore, by a (linear) subspace (linear variety in Gemignani, 1971) 
of a linear space £ we will mean every substructure £ = (Po’, Li’, I) of the 
incidence frame £, which is itself a linear space, and all points lying on lines in 
Li’ are in Po’. Note that any non-empty intersection of a family of subspaces 
(i.e. incidence structure in which the sets of lines and points are the respective 
non-empty intersections of the families of lines and points of the spaces in the 
family) is a subspace itself. The subspace L’ of £ is generated by the pair of sets 
of points and lines (P, L) in £, denoted here L’ = |P, L|, if it is the smallest 
subspace (i.e., the intersection of all these) of £ containing the points (lines). 
Alternatively, £” can be obtained from (P, L) by a finite number of successive 
steps of adding the line passing through two given points and adding all points 
lying on a given line. Clearly, it suffices to generate subspaces starting from 
sets of points only; then we write simply £’ = |P]. 

A set of points P in a linear space is independent if none of the points of 
P belongs to the subspace generated by the rest of P. An independent set of 
points which generates (paired with the empty set of lines) a given subspace 
is called a basis of that subspace. The reader is also referred to Ch. 12 for a 
discussion of matroids and independent sets. 

Note that, unlike vector spaces, not all bases in a linear space need to have 
the same cardinality; for an example, see e.g. Batten, 1986, Sec. 2.1. However, 
as shown there: 


PROPOSITION 7.12 All bases ofa linear space £ = (Po, Li, 1) have the same 
cardinality provided that the space satisfies the following exchange property: 
for any P C Po and X,Y € Po, if X ¢ |P] and X € |PU{Y}] then 
Y e |PU{X}]. 


The dimension of a subspace £’ of a linear space £ is the least number n such 
that £’ can be generated by a (clearly independent) set of n + 1 points. Thus, 
every subspace containing just one line has dimension 1; a subspace containing 
three non-collinear points has dimension at least 2. A subspace with dimension 
2 of a linear space £ is a (linear) plane in £. 

Examples of linear spaces include the usual Euclidean plane and 3D-space, 
as well as the points in any open disc in the Euclidean plane or space, where 
lines are the intersections of the usual lines in the plane (space) with that disc. 
Besides, there is a huge variety of finite linear spaces (see e.g. Batten, 1986, 
Mihalek, 1972). For instance, take the vertices of a tetrahedron as points, and 
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the sides of the tetrahedron as lines, where incidence is standard. Note that this 
is the simplest example of a linear space of dimension greater than 2. 


5.4 Linear transformations and collineations 


Given linear spaces £ = (Po, Li, I) and £’ = (Po’, Li’, I’), a mapping f 
from £ to £ is a pair of mappings fpo : Po — Po’ and fy : Li —> Li’. 
Such mapping is a linear transformation if it preserves incidence both ways, i.e. 
XIx iff fpo( X)! f(x). Thus, the action of a linear transformation on a line is 
determined by its action on any two distinct points of the line, and therefore it 
suffices to consider linear transformations as mappings on the set of points of a 
linear space. It is immediate from the definition to see that if £ has dimension 
greater than 1 then every linear transformation on £ is injective on the set of 
points, and on the set of lines, of £. Thus the notion of linear transformation 
is the natural notion of a mapping between linear spaces that preserves their 
structure. 

An isomorphism between linear spaces is a bijective (on each of the sets of 
points and lines) linear transformation. A collineation is an automorphism of 
a linear space, i.e. an isomorphism of a linear space onto itself. With ų as the 
identity and function inverse and composition as basic operations, the set of all 
collineations of a linear space £ is a group, called the group of collineations 
Aut(£) of £. Many properties of a linear space can be determined by its group 
of collineations; for more detail see e.g. Gemignani, 1971, Behnke et al., 1974, 
Coxeter, 1969, Hughes and Piper, 1973. 


5.5 Parallelism and planarity in linear spaces 


Given a linear space £ =(Po, Li, I), one way to define parallel lines in it 
is to take intuition from the Euclidean space, where two lines are parallel if 
they are co-planar but do not intersect. Thus, we call two lines in £ (strictly) 
quasi-parallel if they have no common incident point and belong to a subspace 
of dimension 2. Note that this relation need not be transitive. For technical 
reasons, however, we consider separately the case where every line in the space 
is incident with exactly 2 points. We will call such spaces meagre. In meagre 
spaces, by strict quasi-parallelism we will mean simply non-incidence. 

An alternative definition is based on another intuition from “real Euclidean 
geometry”: two lines are parallel if they are not incident, but the diagonals 
of every quadrilateral with a pair of opposite sides lying on these lines must 
intersect. Formally, we define the relation || of (strictly) parallel lines in £ as 
follows: 


x|| y := nz Incy A YX1YXaVYIVY2( (2 = (X1, X2) Ay = (Y1, Y2)) 
> (I(X1, Yı) Int 1(X2, Y2) V (X1, Yo) Int (Xə, Yı))). 
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Then we define weak parallelism: 
zuy = S| oe Se 


Again, in the special case of meagre linear spaces, by strict parallelism we 
mean non-incidence. 

The relation || is irreflexive and symmetric but not necessarily transitive 
either. Still, if two lines in a linear space are parallel, then they are quasi- 
parallel too: take any two pairs of distinct points, one on each of the lines, 
and take the intersection point of the respective “diagonals”; that point together 
with the pair of points on any of the lines generates a subspace of dimension 2 
containing both lines. The converse need not hold, which can be shown by an 
example from Batten, 1986, Sec. 2.1, of a linear space of dimension 2 which 
contains a set of 4 independent points, mentioned in Sec. 5.3. 

Given a linear space £ =(Po, Li, I), two lines x,y € Li are called co- 
planar, denoted P1(y), if they are incident or parallel: 


Pl(zy) := x Inc yV «z || y- 


A linear space is planar if every two lines in it are co-planar. 

Thus, by convention, every meagre linear space is planar. 

It is easy to see that a non-meagre linear space £ is planar iff it satisfies the 
following property: 


YXYXYX;YX4 (x, Xp) Inc 1(X3, X4) V 
(1.5) (Xy,X3) Inc (Xo, X4) V (X1, X4) Inc I(X2, X3)), 


saying that the diagonals of every quadrilateral must intersect. 


We now have two different notions of a “plane” in a linear space, one based 
on dimension, the other on planarity. Note that the tetrahedron is a meagre (and 
hence planar) space of dimension 3. However, if a non-meagre linear space £ is 
planar, then it has dimension 2. Indeed, let A, B, C be any three non-collinear 
points in it. Then for any point D in the space, at least one of the pairs of lines 
(1(A, B),1(C, D)), (A, C), 1(B, D)) and (1(A, D),1(B, C)) are incident, say 
1(A, B) Inc 1(C, D), and let X = P(1(A, B),1(C, D)). Then X belongs to 
the line 1(A, B) and D belongs to the line 1(C, X). 

The converse of the claim above need not hold, again by the example from 
Batten, 1986 mentioned above; note that any independent set of 4 points in a 
non-meagre space violates the planarity condition above. 


5.6 Projective spaces and planes 


DEFINITION 7.13 A projective space is linear space satisfying the following 
additional axioms: 
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PS1 If A, B,C are distinct points and a line | intersects AB and AC in two 
distinct points, then it intersects BC as well. 
PS2 There are at least four points, no three of which are collinear. 


(See also Lenz, 1954.) 

Projective spaces satisfy the exchange property (see Batten, 1986, Sec. 3.9), 
and hence, by Proposition 7.12, every two bases in a projective space have the 
same cardinality, called the rank of the space. The dimension of the space is 
thus defined as 1 less than its rank. 

A projective plane is a projective space of rank 3, i.e. dimension 2. Equiva- 
lently, a projective plane is a projective space in which every two lines intersect; 
in particular, projective planes contain no parallel lines. 

Conversely, one can re-define projective spaces of higher dimension in terms 
of the sub-planes that they contain. For instance, the projective 3D-space can 
be defined (see Hartshorne, 1967, Mihalek, 1972) as a projective space with 
the following additional axioms: 


PS3 There exist at least 4 non-coplanar points. 

PS4 Every three non-collinear points lie on a unique sub-plane. 
PS5 Every line meets every sub-plane in at least one point. 

PS6 Every two sub-planes have at least one common line. 


Since every line in a projective plane intersects all other lines in different 
points, and every point is line-connected with every other point in the plane, 
it follows that in a finite projective plane every line is incident with the same 
number of points, and every point is incident with the same number of lines. 

If y is any statement about a projective plane formulated in terms of “point”, 
“line” and “incidence” then the statement y*, formed from y by interchanging 
the words “point” and “line”, is called the dual statement (with respect to 
“point” and “line”) of y. A statement ¢ is self-dual if p = y*. A theorem 
about projective planes formulated in terms of the notions “point”, “line” and 
“incidence” is a projective validity if it is true in the class of all projective 
planes, i.e. it is derivable from the axioms for projective planes. One reason 
why projective planes are interesting is the following “two for the price of one” 
result (see e.g., Mihalek, 1972, Hughes and Piper, 1973, Batten, 1986): 


THEOREM 7.14 (DUALITY PRINCIPLE FOR PROJECTIVE PLANES) Let 
ọ be a projective validity. Then the dual * of is also a projective validity. 


To prove the duality principle it suffices to note that the duals of the axioms 
for projective planes provide an equivalent axiomatization, and hence the “dual” 
of every proof in the (first-order) theory of projective planes is a proof in that 
theory, too. For instance, it follows from the duality principle that in every 
projective plane there are at least four lines, no three of them incident with the 
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same point. The reader is also referred to the self-dual axiomatizations of Esser, 
1951, Esser, 1973, Kordos, 1982, Menger, 1948, Menger, 1950. 

To illustrate the power of the duality principle, consider the following com- 
binatorial example. Suppose we have some projective plane with the property 
that every line contains n points. Now fix any line / and any point P not on l. 
Then every point X on l determines a line PX and since l contains n points 
then there must be n distinct lines of the form PX. Furthermore, note every 
point in the plane must lie on exactly one of these lines PX. If all these lines 
were disjoint (as sets of points) then there would be n? points in total, but since 
the point P is counted n times then the total number of points in the plane is 
n? — (n — 1) = n? — n + 1. Thus, we have just shown that the following is a 
projective validity: 


= If every line is incident with n points, then there are n? 


in the entire plane. 


— n + 1 points 


By the duality principle, we can conclude the dual of this result: 


2 


= [f every point is incident with n lines, then there are nf — n + 1 lines in 


the entire plane. 


Finally, note that axiom PS2 implies that every line in a projective plane 
is incident with at least 3 points, and therefore, by the result above, the least 
projective plane, known as Fano plane, given on Fig. 7.1, has 7 points and 7 
lines. 








Figure 7.1. 


5.7 Affine spaces and planes 


DEFINITION 7.15 An affine structure is a linear space £ =(Po,Li,I) in 
which the relation of weak parallelism \\ is an equivalence relation. 

An affine space is an affine structure with at least 3 non-collinear points, in 
which the following axiom (Euclid’s Fifth postulate) holds: 


V: Givena line x and a point X not on zx, there is a unique line through X 
that is (strictly) parallel to x, denoted by p(X, x). 
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(See also Lenz, 1954, Lenz, 1989.) 
A slightly more general, but essentially equivalent, definition is: 


V’ : Given a line x and a point X, there is a unique line through X that is 
weakly parallel to x, denoted by p(X, x). 


An (affine) subspace of an affine space £ is any linear subspace of £ con- 
taining 3 non-collinear points and satisfying V itself, i.e. closed under the 
operation p. 

Examples of affine spaces include the usual Euclidean plane and 3D-space, 
but not the open disc in the Euclidean plane or space, as the axiom V fails there. 
An example of a finite affine space is the tetrahedron. For other examples, see 
e.g. Batten, 1986 and Coxeter, 1969. 

Note that, usually, the literature on affine and projective geometry deals only 
with affine planes, and only occasionally introduces higher-dimensional affine 
spaces. Thus, our definition is somewhat more general, as it is based neither on 
coordinatization of the space, nor on the earlier-defined notion of affine plane. 
A small price to pay for that generality was the adjustment of the definitions of 
parallelism and planarity in the special case of meagre spaces. 

Planes and planarity can be re-defined in non-meagre affine spaces using the 
following observations. Any “triangle” (three non-collinear points) together 
with the three lines determined by these points, must define a plane. By the 
Fifth Postulate, every line in that plane must be incident with at least two of 
these three lines, i.e. every line is determined by a pair of different points, 
each incident with some of these lines. Furthermore, every point in that plane 
belongs to at least one line constructed in this way, e.g. any line determined by 
that point and a vertex of the original triangle, which intersects the side opposite 
to that vertex (there will be at least one, by planarity). Turning these observation 
around, we obtain the following definition: a plane in an affine space £ is an 
incidence structure G constructed as follows: take three non-collinear points 
P,, Po, Ps in £ and the lines in £ determined by them, say x1, x2, x3, where 
x; = 1(P;, Pr), for i, j, k pairwise different. Let P be the set of points in £ 
incident with at least one of the lines x1, x2, 73. Then the lines in G are exactly 
those lines in £ incident with at least two different points from P, and the points 
in G are those points in £ incident with at least one of these lines. It is not 
difficult to see that every such plane is an affine subspace of £, which will be 
called the (affine) plane in the space £ generated by the points Pı, P2, P3. In 
particular, every plane in an affine space is closed under the affine operations of 
taking lines through two points, intersections of lines, and construction of lines 
passing through a given point and parallel to a given line, based on the axiom V. 

Now, an affine space is called planar, or an affine plane, if it coincides with 
some plane init. Itis easy to show (see e.g. Batten, 1986) that every affine space 
satisfies the exchange property, and hence all bases in an affine space have the 
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same cardinality. Rank and dimension are introduced as in projective spaces. 
As for the tetrahedron, despite being of dimension 3, we have a good excuse (to 
become clear further) to consider it an affine plane as well, and that is the main 
reason to adjust the definition of parallelism and planarity for meagre spaces. 
In fact (see Batten, 1986, Sec. 4.1), it is the only affine plane of dimension more 
than 2. Moreover, as shown there, every line in a finite affine space is incident 
with the same number of points, and vice versa. 

Thus, if an affine space is planar then the relation of non-incidence of lines 
is pseudo-transitive, hence it is a relation of strict parallelism. To summarize: 


PROPOSITION 7.16 A non-meagre affine space £ is an affine plane iff it has 
a dimension 2 iff the relations of non-incidence and strict parallelism between 
lines in £ coincide. 


5.8 Relationship between affine and projective planes 


Affine planes do not have the duality property. For example, the dual of 
LS1 does not hold as it violates Euclid’s Parallel Postulate V. Still, there is an 
intimate relationship between affine planes and projective planes, given by the 
following two theorems, the proofs of which can be found e.g. in Gemignani, 
1971, Hughes and Piper, 1973, Mihalek, 1972. 


THEOREM 7.17 Let 8 = (Po, Li,I) be a projective plane and let l* be a 
line in Li. Define Po” := {X € Po: X g I*}, Li” := Li\{l*} and 
I~ := I|po-ypj-- Then the structure P = (Po, Li, I~) is an affine 
plane, called the (deletion) affine subgeometry of $B induced by /*. 


THEOREM 7.18 Let 2 = (Po, Li, I) be any affine plane and let |* be any set, 
disjoint with Po and Li, and of the same cardinality as the number of parallel 
classes in A. To every parallel class |l], in A, assign some distinct element 
Py, € & to [l]. Define 


1. Pot := Poul; 
2. Lit := Liv AL}; 
3. It = IU {(Py,,2) bel VCP ys Pee}, 


Then the structure At = (Po*,Lit,I+), which we will call the projective 
extension of A, is a projective plane. 


Thus, affine and projective planes are separated by a single line, the so-called 
“line at infinity”. The tetrahedron and Fano plane (Fig. 7.1) illustrate the latter 
two results. It is because of the Fano plane that we insisted the tetrahedron, 
being its deletion subgeometry, should be an affine plane. 
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Note that the constructions between affine and projective planes described 
above are mutually inverse, up to isomorphism. These constructions can be 
described in logical terms, relating the first-order theories of the affine and 
projective planes. On the one hand, every affine plane is first-order interpretable 
into its projective extension in an obvious way; on the other hand, the first-order 
theory of a projective plane can be reduced to the first-order theory of its affine 
subgeometry. Consequently, given a class of projective planes, its elementary 
theory is decidable iff the elementary theory of the class of respective affine 
subgeometries is decidable, too. Likewise, the elementary theory a class of 
affine planes is decidable iff the elementary theory of the class of respective 
projective extensions is decidable, too. 


6. Coordinatization 


In this section we give an overview on the coordinatization and subsequent 
algebraization of affine planes. We will introduce a special class of algebraic 
structures called “ternary rings”, the elements of which can serve as coordi- 
nates of points in the plane. It will turn out that affine planes and ternary 
rings are inter-definable in the sense that from every affine plane one can 
extract a ternary ring while every ternary ring gives rise to an affine plane. 
In fact, these constructions are essentially logical (first-order) interpretations, 
which thus relate their first-order theories. In particular, we will see that nat- 
ural and important geometric properties of affine planes, viz. Desargues’ and 
Pappus’ properties, correspond to natural algebraic properties in these ternary 
rings. Furthermore, we will demonstrate the interaction between special di- 
lations of affine planes and the properties of Desargues and Pappus, and will 
discuss the logical consequences of the coordinatization. In particular, we 
will extract the axiomatizations and (un)decidability of the first-order theories 
of some important affine planes and classes of planes from their associated 
coordinate rings. 

The method of coordinatization applies likewise to projective planes, and 
most of the results obtained below have close projective analogues. Since both 
constructions are very similar, we will only present here coordinatization of 
affine planes. For a more detailed account of the coordinatization of affine and 
projective planes and the relationships (with proofs) between geometric and 
algebraic properties, the reader is referred to Blumenthal, 1961, Artin, 1957, 
Heyting, 1963, Szmielew, 1983, Mihalek, 1972, Hughes and Piper, 1973, etc. 


6.1 Coordinate systems in affine planes 


We adopt weak parallelism in the discussion for the rest of this section. 
Let any affine plane A = (Po; Li; I) be given. The following procedure 
assigns coordinates to the plane. 
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Take any triplet of non-collinear points O, X and Y. The point O will 
be called the origin and the triplet OXY will be called the coordinate 
system. 


Let I be the point of intersection of the line in the parallel class of OX 
containing Y , with the line in the parallel class of OY containing X. The 
point I will be called the unit point while the lines OX, OY and OJ will 
be called respectively the x-axis, y-axis and unit line. 


Let I’ be any abstract set, containing elements 0 and 1, of the same 
cardinality as the number of points on the unit line. In fact, since all 
lines in an affine plane contain the same number of points, I’ can have 
the cardinality of any line in the plane. We call the set I the coordinate 
set. 


Now let y be any bijection between points on the unit line and I’ and such 
that y(O) = 0 and q(T) = 1. 


Points in the plane are assigned coordinates consisting of ordered pairs from 
T? in the following manner: 


= If P is a point on the unit line and y(P) = p then the coordinates of P 





are (p, p). 


Let P be any point not on the unit line. Suppose the line in the parallel 
class of the y-axis, containing P, intersects the unit line in the point with 
coordinates (a, a), and suppose that the line in the parallel class of the 
x-axis, containing P, intersects the unit line in the point with coordinates 
(b, b). Then the coordinates of P are (a, b) (refer to Fig. 7.2). 


yanis y-axis 
line of 
slopes 






unit line unit line 








a-axis O a-axis 


Figure 7.2. Figure 7.3. 


The point P with coordinates (x, y) will be denoted as P(x, y), and points 
will be identified with their coordinates (P(x, y) may also refer to the point 
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of intersection of lines x and y, but the context should make it clear what the 
intended meaning is). For example, the points O, X, Y and J can be given 
simply as (0,0), (1,0), (0, 1) and (1,1). The value z is called the abscissa of 
P and y is called the ordinate of P. Any line not in the parallel class of the 
y-axis will intersect the y-axis in some point (0,c). This value c is called the 
y-intercept of the line. 

Next we define the slope of a line, using what will be called the line of slopes, 
which is that line in the parallel class of the y-axis intersecting the unit point J. 
Every point on the line of slopes will have coordinates (1, m) for some m ET. 
There are two types of line to consider: 


1. If lis a line parallel to the y-axis, its slope is left undefined. 


2. Consider any line / not in the parallel class of the y-axis. There will be a 
unique line l’ parallel to / and incident with O. This line /’ will intersect 
the line of slopes in some point (1, m). The slope of l is defined as the 
value m (refer to Fig. 7.3). 


Thus the slope of the z-axis is 0 and the slope of the unit line is 1. 

The equation of a line is any equation formulated in terms of variables x 
and y such that all and only those points (x, y) belonging to the line satisfy 
the equation. For example, the line parallel to the x-axis with y-intercept b has 
equation y = b, and the line parallel to the y-axis intersecting the x-axis in the 
point (a,0) has equation x = a. The unit line has equation y = x. But we 
need further algebraic machinery to describe the equations of lines other than 
these trivial examples. This is provided by the operation T : T? — T, defined 
as follows. Let the triple of values (m,a,c) € T? be given. To compute the 
value of T(m, a,c) consider the line | of slope m and y-intercept c. Then l 
intersects the line parallel to the y-axis, and containing the point (a, 0), in the 
point (a, T(m, a, c)) (refer to Fig. 7.4). 


y-axis 


(a, T(m, a,c)) 








O (a,0) z-axis 


Figure 7.4. 
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T enables us to obtain an equation for any line in the plane. If the line 
has undefined slope, it has an equation x = a for some a € I, while the 
line with slope m and y-intercept c has equation y = T'(m,z,c). The above 
definition of T uses a geometric construction, but T can also be described purely 
algebraically using the structure of ternary ring. 


6.2 Ternary rings and coordinate systems 


DEFINITION 7.19 A ternary ring (also known as a Hall planar ternary ring, 
in reference to Marshall Hall Jr.) is an algebraic structure Z = (F;0,1,T) 
consisting of a set F together with distinguished elements 0,1 € F anda 
ternary operation T on F subject to the following axioms: 


To: OF1 

Tı: T(a,1,0)=a 

T2: T(1,b,0) =b 

T3: T(a,0,c) =c 

T4: T(0,b,c)=c 

Ts: T(a,b, x) = d has a solution for x 
Te: IfT(a,b,c) =T (a,b,c) then c = d 


Tz: Ifbb then the simultaneous equations T(x,b, y) = d and 
T(x,b', y) =d' have a solution for x and y. 


Tg: Ifa#a' then T(a,x,c) = T(a’,x,c) has a solution for x 


Tg: Forb#b',ifT(a,b,c)=T(a',b,c') and T(a,b',c) = T(a',b', c) 
then a = da' and c = ¢' 


It is easy to see that the ternary operation T satisfies all the axioms Tg — Tg 
hence we have the following important result. 


THEOREM 7.20 Let the affine plane X be coordinatized with coordinate sys- 
tem OXY and coordinate set T and let T be the resulting ternary operation on 
T. Then the structure (1; 0,1, 7) is a ternary ring. 


The ternary ring (T; 0, 1, T) above will be called a (coordinate) ring attached 
to the plane X (by means of the coordinate system OXY), denoted To yy (A). 
Given the coordinate system OXY, there is only one, up to isomorphism, 
ternary ring attached to the plane by means of OXY. A ternary ring T is said 
to be attached to the affine plane XA provided there is some coordinate system 
OXY such that T = Toxy (A). 
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A converse to the last theorem is also true. 


THEOREM 7.21 For any ternary ring $ = (F;0,1,T), the plane A(T) with 
point universe F°, and line universe consisting of all sets of the form {(a,y) : 
y € F} and {(x,T(m,2,c)) : x € F} for every a,m,c € F, is an affine 
plane, called the affine plane over the ternary ring T. 


The constructions given in the two theorems above are inverse in the fol- 
lowing sense. Let an affine plane 21 be given and fix some coordinate system 
OXY. Then A(T,,, (2)) = A. Let a ternary ring T = (F; 0,1, T) be given. 
Then T oo 9)¢0,1)<2,0) ACT) = T. 

If two ternary rings are isomorphic then so will be the affine planes over 
those ternary rings. But surprisingly, there are non-isomorphic ternary rings 
such that the affine planes over those ternary rings are still isomorphic. In 
particular, coordinatizing an affine plane with different coordinate systems may 
sometimes give rise to non-isomorphic ternary rings attached to the same plane, 
and later on we will give a sufficient condition for uniqueness of the coordinate 
rings. 

Given a ternary ring (F; 0,1, T), addition + and multiplication - are defined 
on F as follows: 


a+b=T(1,a,b); a-b=T(a,b,0). 


The structure (F; 0,1, +, -) thus formed is actually a double loop. Hence the 
class of double loops contains the class of ternary rings. 

The geometric analogue to addition is the translation of a line in the plane, 
that of multiplication is the rotation of a line in the plane. To calculate a + b 
proceed as follows. Take the points A(a, a) and B(b, b) that lie on the unit line. 
Intersect the line parallel to the z-axis containing B with the y-axis to obtain 
the point Q(0, b). Then take the line parallel to the unit line containing the point 
Q and intersect it with the line parallel to the y-axis containing the point A to 
obtain the point P(a, c). The line parallel to the x-axis containing P intersects 
the unit line in the point C(c,c). We define C = A+ B and c = a + b (refer 
to Fig. 7.5). To calculate a - b proceed as follows. Take the points A(a, a) 
and B(b,b) that lie on the unit line. Intersect the line parallel to the x-axis 
containing A with the line of slopes to obtain the point Q(1, a). Then the line 
parallel to the y-axis containing the point B will intersect the line OQ in some 
point P. Intersect the line parallel to the x-axis containing P with the unit line 
to obtain the point C (c, c). We define C = A- B and c = a-b (refer to Fig. 7.6). 

We are now able to give the linear equations of two more classes of lines. 
The line of slope 1 with y-intercept c will have equation y = x + c while the 
line of slope m and y-intercept 0 will have equation y = m- x. 

Call a left division ring strong if it satisfies the additional property 





my Æ Mm => VeVeodz (my -£ + c1 = M2: £+ c2); 
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y-axis y-axis 





unit line 





unit line 


O a-axis O z-aris 
Figure 7.5. Figure 7.6. 


informally, lines with different directions intersect. We can define a ternary 
operation T in the strong left division ring (F; 0,1, +, :) as T (a,b,c) := a-b+ c. 
Then it turns out that (F; 0,1, T) will be a ternary ring. Hence the class of 
ternary rings contains the class of strong left division rings. It is easy to see that 
every division ring (with full distributivity) is strong. Let 7, LD, D, SF and F 
denote respectively the classes ternary rings, strong left division rings, division 
rings, skew fields and fields. Then we have the following chain if inclusions: 


TOLDIDIDSFOF. 








6.3 The properties of Desargues and Pappus 


The theorems of Desargues and Pappus, known from Euclidean geometry, 
turn out to hold in a more general, affine setting. While retaining their ele- 
mentary nature, these properties of the Euclidean plane will lose their status as 
theorems when taken in arbitrary affine planes, as they may or may not hold 
true depending on the affine plane concerned. The Desargues and Pappus prop- 
erties deal with configurations of six points, in the former case lying in pairs on 
three lines, and in the latter case lying in triples on two lines. We distinguish 
cases where the lines (i) are parallel, or (ii) are mutually incident. The specific 
geometric properties thus described will correspond to specific classes of the 
algebraic structures described above. 


DEFINITION 7.22 An affine plane satisfies the First Desargues Property D4, 
if the following holds (see Fig. 7.7): 


Dı : (-Col(AA’B) A =Col(AA’C) A AA’ i BB’ A AA CC’ 
MAB n AB’ AAC AC’) > BC i BC 


The Euclidean plane satisfies D4. As an example (see Blumenthal, 1961) of 
a plane that does not satisfy D1, consider the real plane R? with lines modified 
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B B' 














Figure 7.7. 


as follows. Any line of either undefined slope or non-positive slope will be 
left unaltered, but any line y = mz + c with strictly positive slope m > 0 is 
changed to the union of the two rays 


y=mxz+c when y<0, 
y = imz + $c when y>0O. 


It is easy to see that this modified plane is affine. Fig. 7.8 gives a configu- 
ration of points that falsify D1. 





a-axis 





Figure 7.8. 


We have the following representation theorem. 


THEOREM 7.23 If an affine plane & satisfies Dy then A ~ A(¥) for some 
strong left division ring § € LD. Conversely, if § € LD is any strong left 
division ring, then A(8) satisfies D4. 


The First Desargues Property allows us to give the linear equation for any 
line with slope m and y-intercept c. Say that a ternary ring (R; 0,1, T) is linear 
if T(a, b,c) = a-b+c forall a,b,c € R (where + and - are interpreted in the 
expanded structure (R;0,1,+,-,7)). 


THEOREM 7.24 An affine plane & satisfies the property D; if and only if every 
ternary ring ({;0,1, 7) attached to Y is linear. 
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If X is an affine plane satisfying D1, then every line of undefined slope has 
the form z = a while the line of slope m with y-intercept c has equation 
y =m- x+ c. This concludes the task of finding a linear equation for every 
line of the plane. 

Let P be the quaternary parallelogram relation defined by 


P(ABCD) & AB || CDA AC || BD. 
The property Dı guarantees that P will be transitive in the sense 
P(ABCD) A P(ABEF) => P(CDEF) 
where the points are so as to exclude obvious degenerate cases. 


DEFINITION 7.25 An affine plane is said to satisfy the Second Desargues 
Property Dg if the following holds (see Fig. 7.9): 
D2: (Diff7(OABCA'B'C") A ~Col(ABC) A =Col(A’B'C’) 
^ Col(OAA’) A Col(OBB’) A Col(OCC’) A BC u B'O" 
AAC AC’ A AC i OB) > AB A'B’. 


B' 


A’ 








Figure 7.9. Figure 7.10. 
Də (also known in the German-language literature as Trapezdesargues) en- 
dows the attached ternary ring with right distributivity. 


‘THEOREM 7.26 Let an affine plane %X satisfy both the properties Dy and Do. 
Then every ternary ring attached to is a division ring. 
DEFINITION 7.27 An affine plane is said to satisfy the Third Desargues Prop- 
erty D3 if the following holds (see Fig. 7.9 and Fig. 7.10): 
D3: ((O 4 A,B,C,A’',B’, C") A =Col(AA’B) A =Col(AA'C) 
^ Col(OAA’) A Col(OBB’) A Col(OCC’) 
A AB A'B' A AC i A'C") — BC BC’. 
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Clearly D3 = Dg and it can also be shown that D3 => Dj. 
D3 endows the ternary ring attached to a plane with associative multiplica- 
tion. We have the following representation theorem. 


THEOREM 7.28 Ifan affine plane & satisfies D3 then A = A(¥) for some 
skew field § E SF. Conversely, if § E SF is any skew field, then A(§) 
satisfies D3. 


For any point O, let To be the quaternary trapezium relation: 
To(ABCD) & Col(OAB) ^A Col(OCD) ^ AC || BD. 
The property D3 guarantees that To will be transitive in the sense 
To(ABCD) A To(ABEF) = To(CDEF) 


where the points are taken so as to exclude obvious degenerate cases. 

In particular, note that in any affine plane satisfying the Third Desargues 
Property the midpoint of a line segment AB, being the intersection point of the 
diagonals AB and CD of any parallelogram ACB D, is definable in terms of 
A and B. 


As shown in Szmielew, 1983, if the plane satisfies D3, then the coordinate 
ternary ring attached to the plane is invariant, up to isomorphism, of the coordi- 
nate system used; equivalently, every skew field can be restored uniquely from 
the affine plane over it. Formally: 


THEOREM 7.29 (SZMIELEW, 1983, SEC. 4.5) If §, Tare skew fields such 
that A(®) = A(T) then § = T. 


COROLLARY 7.30 Jf 2 satisfies Dz and OXY, O'X'Y’ are two coordinate 
systems in A then T oxy (A) = Toy yry (A). 

Hereafter, whenever 2 satisfies D3 we will denote the unique coordinate ring 
attached to XA by T(2l). 


DEFINITION 7.31 An affine plane satisfies the First Pappus Property Pj if the 
following holds (see Fig. 7.11): 


Pı : (Col(ABC) A Col(A’B'C’) A AB || A'B’ 
A AB’ A'BA AC' 1 A'C) —> BC BC. 


DEFINITION 7.32 An affine plane satisfies the Second Pappus Property Pa if 
the following holds (see Fig. 7.12): 


Ps : ((O £ A,B,C, A’, B,C) A Col(OABC) A Col(0.4'B'C’) 
\ AB # A'B' \ AB’ u A'B a AC’ AC) > BO BIC. 
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Figure 7.11. Figure 7.12. 


It can be shown that Pz = P1. A famous theorem by Hessenberg establishes 
the implication P2 = D3. In fact, the following string of implications holds: 


Ps > D3 > Dı => P4. 
From algebraic considerations one can also derive 


Dı # D3 Æ Po. 


For instance, the affine plane over the skew field of quaternions satisfies D3, but 
not P2, because of Theorem 7.33 below. It remains an open problem whether 
P implies (and hence is equivalent to) D1. See Szmielew, 1983 and Menghini, 
1991 for further details. 

The property Pz endows the attached ternary ring with commutative multi- 
plication, and the following representation theorem holds. 


THEOREM 7.33 Ifan affine plane A satisfies Pz then A S A(Ẹ) for some 
field §. Conversely, if § is a field, then A(®) satisfies Po. 


In this section we have followed the terminology from Blumenthal, 1961. 
Other names for the First and Third Desargues Properties, used in the literature 
are respectively the Minor, or Weak, and Major, or Strong, Desargues Prop- 
erties; likewise for the First and Second Pappus Properties (Szmielew, 1983). 
Hereafter, by “the Desargues Property” we will mean the Third Desargues Prop- 
erty, and by “the Pappus Property” we will mean the Second Pappus Property. 
Accordingly, we will speak about Desarguesian and Pappian affine planes. 

Finally, we note that the Desargues and Pappus properties of affine planes 
have precise analogues for projective planes, satisfying the same relationships 
with their algebraic counterparts. In fact, the projective versions of Desargues 
and Pappus properties are simpler, since they need not take into account the cases 
of parallel vs intersecting lines. For instance, all affine Desargues’ properties 
turn out to be particular cases in projective extensions of affine planes of the 
projective Desargues’ property which simply states that “Jf two triangles are 
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perspective from a point (meaning that the lines determined by the three pairs of 
corresponding vertices are co-punctual), then they are perspective from a line 
(meaning that the three intersection points of the pairs of corresponding sides 
of the two triangles are collinear).” Actually, this property holds in a projective 
plane iff it can be embedded into a projective 3D-space. Likewise, the two 
affine Pappus properties are combined in one projective Pappus property. For 
more details, see e.g. Blumenthal, 1961, Hartshorne, 1967, Mihalek, 1972, 
Blumenthal and Menger, 1970, Hughes and Piper, 1973. 


6.4 Analytic geometry and affine transformations of 
affine planes over a field 


Affine planes with the Pappus Property are close enough to the real affine 
plane that one can introduce not only coordinatization, but even develop analytic 
geometry of points and lines in them. In fact, for most of what follows it suffices 
to assume the Desargues Property, i.e. to consider planes A(#), where § is a 
skew field, but to avoid having to deal with the non-commutative multiplication, 
we assume that § is a field. Recall from Sec. 6.3 (see also e.g. Gemignani, 1971, 
Sec. 3 or Blumenthal, 1961, Sec. V.9) that, given a coordinate system OXY in 
such an affine plane A (¥), any line l is determined by an equation y = ag + m 
if not parallel to the line OY, otherwise by x = c, where a, m, c € ¥ are fixed 
parameters. In either case, the line has a general equation ax + by + c = 0 
where at least one of a, bis not 0, and conversely, every such equation represents 
a line in A (Ẹ) in the standard analytic geometric sense. Furthermore, a change 
of the coordinate system to a new one O' X’Y", with coordinate axes O'X’ and 
O'Y” having equations in the old system respectively ajax + byy + c1 = 0 and 
a£ + boy + c2 = 0, leads to change of the coordinates (x, y) of a given point 
in the plane according to the following equations: 


gv = ulag + biy + c1), y = v(agx + boy + c2), 


where u = (arez +b1ey + e1)7! and v = (azer + bzey + c2)~! where (ex, ey) 
are the coordinates of the new unit point J’ in the old coordinate system OXY. 
The non-parallelism of the new coordinate axes is analytically expressed by the 
condition 

ay bi 
az bg 





| = aıb2 — ab 1 A 0. 


Thus, change of the coordinate system in A (Ẹ) can be represented (after mul- 
tiplying out in the equations above) as a transformation of the plane, determined 
by affine equations: 


a(x) = ax + by +c, aly) =ar +by+ c, 
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where a,b, c,a',b', c € ¥ are such that ab’ — a'b Æ 0. Such transformation is 
called an affine transformation, or an affinity. Conversely, we will see further 
that every affine transformation can be viewed as a change of the coordinate 
system. 

It is easy to see that every affine transformation is a collineation on the 
plane. Moreover, using coordinatization, we can characterize explicitly all 
collineations in an affine plane over a field A(¥). First, recall that collineations 
of linear spaces preserve incidence, and therefore parallelism, of lines, and 
collinearity of points. Actually, a bijection a of the points in the plane is a 
collineation iff it preserves line parallelism, i.e. if œ maps the point P to the 
point P% then: 


ABiCD Ss A°B% 1C°D". 


Now, consider a collineation a in A(¥) and let OXY be any coordinate 
system in A(¥). Since a preserves every line (as a set of points), in particular 
the unit line, it determines a bijection h : § —> § by sending the point 
from the unit line (x, x) to the point (h(x), h(x)). Recall that addition and 
multiplication in were geometrically defined in Sec. 6.2 on the unit line by 
means of the “affine operations” (Sec. 5.7) of taking the intersection point of 
two lines, producing the line through two points, and producing the line parallel 
to a given line through a given point. These constructions are preserved by 
collineations, and therefore h is an automorphism of §. Now, for any point P 
with coordinates (x, y) in the system OXY, its image under a is the point P% 
with coordinates (h(a), h(y)) in the system O/X’Y’, because the point P can 
be obtained (see Sec. 6.1) from the points (x, x) and (y, y) by affine operations. 

We can now obtain an explicit algebraic characterization of the collineation a. 
Suppose the images of O, X, Y under a are O' (c, c’), X'(a+c,a’ +c), Y' (b+ 
c, b' +c’). (Note that such a, b, c, a',b', c! € § always exist.) It is easy to check 
that O’, X’, Y” are non-collinear iff ab’ 4 a'b. Now, following the construction 
on Fig.7.2 (or, as a standard exercise in linear algebra) one can compute the 
coordinates of a( P) in OXY: 


a(x) = ah(x) + bh(y) +c, aly) = a'h(x) +U'hA(y) + d. 


Conversely, it is immediate to check that every mapping defined by such equa- 
tions in a given coordinate system, where h is an automorphism of and 
ab! — a'b Æ 0, is a collineation. 

Thus, we have obtained the following (see Gemignani, 1971, Sec. 3): 


THEOREM 7.34 A mapping a in the plane A(Ẹ), where § is a field, is a 
collineation, iff it can be defined in some coordinate system by equations 


a(x) = ah(x) + bh(y) + c, aly) =a’h(x) + b'hly)+ c, 
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where h is an automorphism of § and a,b,c,a’,b',c! © § are such that 
ab! Æ a'b. 

Therefore, every collineation of A (Ẹ) is uniquely determined by its action 
on any three non-collinear points O, X,Y in the plane, i.e. by their images 
O', X', Y’, and any mapping in A(Ẹ) that sends the three non-collinear points 
O, X,Y respectively to three non-collinear points O'(c,c'), X'(a + c,a' + 
c), Y' (b+c, b' +) can be uniquely extended to a collineation of A (X) defined 
by the equations above. 


We now see that affine transformations form a special case of collineations, 
corresponding to the identity automorphism of ¥. 

Note that the affinities of a plane form a subgroup of its group of collineations. 
In the case when ¥ is rigid, i.e. has no non-trivial automorphisms, as is the field 
of reals R, every collineation in A(¥) is an affinity, but in general this need 
not be the case, e.g. (see Gemignani, 1971, Sec. 3.2) the complex conjugate 
mapping h(z) = Z is an automorphism of the field of complex numbers C, and 
therefore any collineation of A(C) associated with h , e.g. (x,y) — (z, 7), is 
not an affinity. 


A particular case of affine transformations is dilation (or dilatation). This is 
a collineation ô which sends every line to a parallel one, i.e., 


A®B?° AB. 


The set of dilations of an affine plane 2 will be denoted as Dil(2l). Itcan be easily 
shown (see Gemignani, 1971, Behnke et al., 1974, Coxeter, 1969, Hughes and 
Piper, 1973) that every dilation of A (Ẹ) can be defined in a suitable coordinate 
system by equations 


alx) =axz +c, a(y) =ay+ c, 


for some a,c,c' € § such that a # 0. Therefore, if a dilation is different 
from the identity dilation . (for which every point is a fixed point), then it 
has either no fixed points (if a = 1 and (c,c’) 4 (0,0)) or exactly one fixed 
point C((1 — a)~*c, (1 — a)~!¢’); accordingly, it will be called respectively 
a translation or homothety with center C. The set of all translations of 2 will 
be denoted Tr(2l) while the set of homotheties with center C will be denoted 
Htc (XA); the set of all homotheties will be denoted Ht(2l). When the plane is 
fixed, we will sometimes omit it from these notations. The identity dilation is 
taken by definition as both a translation as well as a homothety with any point 
taken as its center. 

If a is any non-identity translation and P, Q are any points in the plane then 
PP 11 QQ”. Hence, all lines PP® lie in the same “direction”, which will be 
called the direction of the translation a. Given any line l, we define the set of 
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translations with direction l: 
Tr; := {a € Tr : a has direction J} U {4}. 


Another important class of affine transformations is the class of rotations, 
given by equations 


a(z) = az + by, aly) = a'z + by, 


for some a, b,a',b' € § such that a = 1. Rotations are not dilations, 
but have a fixed point, viz. O(0, 0). 


For any line / and any point O in a given affine plane, we have: 
Tr, < Tr < Dil < Aut, Hto < Dil < Aut, 


where < means subgroup. 

A set of dilations D is transitive on a set of points S if, for every A,B € S 
the equation A° = B has a solution for ô in D. Given any line l, the set Tr, 
will be called transitive if it is transitive on the set of points in the line /. The 
set Tr will be called transitive if it is transitive on the entire universe of points. 
The set Hto will be called transitive when, for every line / containing O, the set 
Hto is transitive on the set of points /\{O}. Finally Ht will be called transitive 
when Hto is transitive for every point O. 

A set of dilations D is called commutative when compositions of dilations in 
D commute. It can be shown that (i) the set Tr will be commutative if and only 
if the set Tr; is commutative for every l, and (ii) if Tr is transitive then it is also 
commutative. 

Transitivity and commutativity of the dilations of a given affine plane are 
closely related to the Desargues and Pappus properties satisfied in that plane. 
For a more comprehensive discussion on these relations, the reader is referred 
to Szmielew, 1983, from where we cite the following. 


THEOREM 7.35 An affine plane satisfies 
i) Dy iff the set Tr is transitive; 
it) Dg iff the set Ht is transitive; 


itt) Po iff the set Hto is transitive and commutative for every point O. 


7. On the first-order theories of affine and projective 
spaces 


Here we will discuss some logical results about definability in affine spaces 
and axiomatization and decidability of the first-order theories of affine and 
projective spaces, that can be obtained as consequences from the method of 
coordinatization. 
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7.1 On affine relations in affine spaces 


Two lines x and y in an affine structure are called crossing or skew, denoted 
x X y, if they are not incident and not parallel. Thus, each of the relations 
Int , ıı and œx in affine spaces is definable in terms of incidence between a 
point and a line. Therefore, affine spaces and planes can be defined with these 
relations taken as primitives, but that would not enhance the expressiveness of 
the language. 

Note that every relation in an affine plane definable in terms of incidence 
alone is preserved under collineations. Therefore, using collineations one can 
show e.g. that orthogonality of lines in R” is not definable in terms of the 
relation of incidence alone, for any n > 2. Indeed, the mapping in R” that 
halves the first coordinate of a point is clearly a collineation, but it does not 
preserve orthogonality. 

Further, we can define an affine relation in affine planes as one which is 
preserved under affine transformations. Thus, incidence and parallelism are 
affine relations, while orthogonality is not. 

Note on the other hand, that many not obviously affine concepts can be de- 
fined in affine terms, or constructed with purely affine means, in affine planes 
satisfying special additional properties, e.g. in A(R). For example, the equidis- 
tance relation on strictly parallel line segments, denote it here as =, is given 
by the formula 


XiX =1 NY. @& X= XQAN =VYQV (1%, X2) Il 11, Y2) 
A (1X1, Ya) I 1X2, Y2) V (Xa, Ya) || (X2; Y1))) 


(see Fig. 7.13), while equidistance on arbitrary parallel line segments, denote 
it here as =g, is given by the formula 

















XıXə =p) Yi Y2 Zi Z2 (XıX2 =i AVA) A Yi Yə =f Zı Zə) 


(see Fig. 7.14). In particular, the midpoint operation between two points, ©, is 
given by the formula 


X=Y06Y. & XY, =: XV. 


As we will note in Sec. 8, betweenness of points in R” for n > 1 is an affine 
relation, but is not definable in terms of incidence alone, because it may not be 
preserved by collineations which are not affinities. 


7.2 Coordinatization as logical interpretation 


Every coordinatization of an affine plane X4 defines an interpretation of 2{ in 
the corresponding ternary ring $ = (T; 0, 1, T) attached to it. Indeed, if we treat 
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A / se i = 
xX, Xə Xı Xə Yı Yə 


Figure 7.13. Figure 7.14. 


the plane as a collinearity structure (Po, Col), then 2 can be 2-dimensionally 
interpreted in $ as follows: 


i) The domain of 2 interpreted in T is given by the formula 
Vets) := (£1 = z1). 
ii) The pair (a, b) in T? is mapped to the point P (a, b) in 2. 
iii) Collinearity of points in 2( is given by the formula 


Wcol(L1, Y1, 2, Y2, £3, Y3) = Ni, j=1,2,3 Ti = Lj 
V Amdc (^i=1,2,3 Yi = T(m, xi, c)). 




















If the plane is regarded as a two-sorted incidence structure (Po, Li, I) then 
A can be 4-dimensionally interpreted in { as follows: 


i) The domain of % interpreted in Ẹ is given by the formula 
(x1, £2, £3, £4) = (xı = z1). 


ii) The quadruple (a, b, c, d) in T4 is mapped to the point P(a, b) if (a, b) = 
(c,d), and to the line determined by the points P(a,b) and Q(c, d) 
otherwise. 


iii) Incidence of a point and a line in 2( is given by the formula 


Wr(21, £2, £3, L4, Y1, Y2, Y3 Y4) = 

(z1 = £3 A £2 = T4) A (yı # ys V yo # ya)) ^ 
m c(x2 = T(m, xz1,c) A y2 = T(m, yi.) 
A ya = T(m, y3,¢)) V (xı =y Ay = ya) 

















(informally: Z is a point, y is a line and Z lies on y). 

Alternatively, lines in planes satisfying Dı can be interpreted in the coordinate 
ring as triples of coefficients, by their general equations. 

Conversely, any ternary ring can be interpreted in the affine plane over it, by 
taking the points on the unit line as a domain of the interpretation, and defining 
addition and multiplication by means of first-order formulae in the language of 
incidence, constructed following the geometric description of these operations, 
as described above and illustrated by Fig. 7.5 and Fig. 7.6. 
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7.3 Decidability and undecidability of affine and 
projective theories 


The interpretations between affine planes and coordinate rings enable ef- 
fective translation of first-order formulae from one to the other language and 
transfer of various logical properties between the first order theories of these 
classes of structures. 


THEOREM 7.36 For every affine plane X and every ternary ring $: 


1. If the first order theory of A(&) is decidable then the first order theory 
of Z is decidable, too. 


2. If the first order theory of T oyy (X) is decidable for some coordinate 
system OXY in A, then the first order theory of A, expanded with point- 
constants for O, X,Y, is decidable, too. 


Given a class of ternary rings 7 we denote by A(T) the class of affine planes 
over these rings; likewise, given a class of affine planes A with the Desargues 
property, we denote by T(A) the class of ternary rings attached to the planes 
in A. 


THEOREM 7.37 For every class of ternary rings T and every class of affine 
planes A satisfying the Desargues property the following holds: 


1. If the first order theory of A(T) is decidable then the first order theory 
of T is decidable, too. 


2. If the first order theory of T(A) is decidable, then the first order theory 
of A is decidable, too. 


As shown in Tarski, 1949a, Tarski and Mostowski, 1949, Tarski, 1949b and 
Tarski et al., 1953, the first order theories of all fields, and the field of rationals, 
are undecidable, while the first order theory of real closed fields, being the 
same as the first order theory of the field of reals is complete and decidable. 
Therefore, we obtain the following. 


COROLLARY 7.38 


1. The first order theories of all Pappian affine planes, and of the rational 
affine plane are undecidable. 


2. The first order theories of all affine planes over real closed fields, and of 
the real affine plane are decidable. 


A simple argument shows that if a first-order theory T' has an undecidable 
extension by means of finitely many axioms T”, then it is itself undecidable. 
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Indeed, let ¢ be the conjunction of all axioms extending T to T’. Then for any 
sentence Y, T’ + w iff T + o — y, hence any decision method for T yields a 
decision method for T’. Thus, we obtain the following results: 


COROLLARY 7.39 The following first-order theories are undecidable: the 
theory of all Desarguesian planes; the theory of all affine planes; the theory 
of all affine spaces; the theory of all linear spaces; the theory of all incidence 
structures. 


Analogous results hold for first-order theories of projective planes and spaces 
(see Ziegler, 1982). 


7.4 On the axiomatizations of the first-order theories of 
the real projective and affine planes 


The real affine plane A(R) is simply the Euclidean plane with the standard 
points, lines and incidence relation. The real projective plane P(R) can be 
obtained from A(R) by the extension construction described earlier, but also 
e.g. by the well-known central projection of the affine plane onto a sphere 
touching that plane (see Coxeter, 1969). 

Here we briefly discuss the questions: what are the first-order axiomatiza- 
tions of the real projective and affine planes P(IR) and A(R) in the language 
with incidence ? 

The first-order theory of R has a well-known axiomatization (the theory of 
real-closed fields, see e.g. Tarski, 1967, Chang and Keisler, 1973). It extends 
the axioms for fields with the following axiom schemes: 


RealFields : —1 is not a sum of squares: 
Vary... Y£n a(z? +... + z2 = —1) 
for every integer n > 0. 


RealClosedFields : Every polynomial of odd degree has a zero: 





Vag... Van (~an = 0 > Jr(ao + aix +... + anz” = 0)) 
for every odd integer n > 0. 


PythagoreanFields : 





Vady(y* = z V yf = —2). 


In view of the mutual interpretability between R and each of P(R) and 
A(R), and the uniqueness of the coordinate field for each of these planes, 
translating the axioms above to the geometric language should in principle 
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suffice to axiomatize their first-order theories. Still, it is natural to search for 
explicit and geometrically meaningful axiomatizations of the real projective 
and affine planes, rather than a translation of the axioms of real closed fields to 
the geometric language. 

When betweenness is added to the language, such a complete axiomatization 
(involving an infinite axiom scheme of continuity) for the real affine plane has 
been obtained by Szczerba and Tarski, 1965 and Szczerba and Tarski, 1979, 
and will be presented in Sec. 8. The language with betweenness, however, is 
substantially more expressive, so the question is: what affine properties of R? 
can be expressed in terms of incidence alone, in projective and affine settings. 

We already know that there are rather non-trivial universal properties true 
in P(R) and A(R), such as the Pappus property which guarantees that their 
coordinate ring is a field. 

Furthermore, there is a geometrically natural axiom, known as the Fano ax- 
iom, which is true in P(R) but does not follow from the Pappus property. In 
order to state the Fano axiom, we define a complete quadrangle in a projective 
plane to be a configuration of 7 points and 6 lines obtained as follows: take 4 
points A, B, C, D, no three of which are collinear, consider the 6 lines deter- 
mined by pairs of these points, and add the 3 “diagonal points” of intersection 
P(AB,CD), P(AC, BD) and P(AD, BC). 


Fano: The three diagonal points in any complete quadrangle are never collinear. 


The Fano axiom is true in every projective plane over a field of characteristic 
different from 2 (see e.g. Coxeter, 1969), but it fails in the Fano plane. An 
affine version of the Fano axiom can be formulated, too. In the particular case 
where the denied collinearity is along the infinite line, it claims precisely that 
the diagonals of every parallelogram in the plane must intersect. 

The Pappus property and Fano axiom are the only additional axioms to those 
for projective planes offered in Coxeter, 1969, Sec. 14.1 for the real projective 
plane. However, as shown in an exercise following Coxeter, 1969, Sec. 14.1, 
for every prime p there is a finite projective plane PG(2,p) of p? + p + 1 
points and as many lines satisfying all these axioms, so this system is far from 
complete. 

In fact, there are infinitely many other geometric axioms which should be 
added to the theory of Pappian planes, in order to obtain the complete theory 
of A(R), because it follows from results in Szczerba and Tarski, 1979 that the 
latter theory is not finitely axiomatizable. For further discussion and results 
on this, see von Plato, 1995, Pambuccian, 2001b. Still, the question of finding 
explicit and geometrically natural axiomatizations for P(IR) and A(R) appar- 
ently remains, as far as we know, unclosed. The same questions can be raised 
about the first-order theories of the n-dimensional real affine spaces A(R”) 
(with n > 3) generated over the field of reals. 
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8. Betweenness structures and ordered affine planes 


We now consider the geometric language in which the only primitive relation 
is the ternary relation B of betweenness on points. B( XY Z) means that the 
points X, Y and Z are collinear and Y lies between X and Z (with possibly Y 
coinciding with X or Z). The language consisting of the betweenness relation 
is significantly more expressive than the language with collinearity, and yet, as 
we will see later, betweenness is very much an affine notion so that it makes 
sense to add it to the language of affine geometries, as was done by Tarski. 

From the results in Sec. 6.4 is easy to see that collineations on the real affine 
plane preserve ratios of parallel line segments and consequently also between- 
ness on points, so that axiomatizing the real affine plane using betweenness 
does not leave the realm of affine geometry. 


8.1 Betweenness structures and ordered geometry 


A structure (S; B) consisting of a non-empty set S and a ternary relation 
B on S will be called a linear betweenness structure provided the following 
axioms are satisfied: 


B1: VXVYVZ(B(XYZ) V B(YZX) V B(ZXY)) (connectivity) 
B2: VXVY(B(XYX) ~ X=Y) 
B3: VXVYVZ(B(XYZ) — B(ZYX)) (symmetry) 
B4: VUVXVYVZ(B(UUXY) A B(UUYZ) — B(XYZ)) 
(inner transitivity) 
B5: VWUVXVYVZ(X # Y AB(UUXY) AB(XYZ) > B(UYZ)) 
(outer transitivity) 


The relation B is called the betweenness relation of the linear betweenness 
structure. In Szmielew, 1983 it is shown that these axioms are independent, 
although when the cardinality of the set S is different from 4, the axiom B5 
becomes redundant. 

Linear orderings and linear betweenness structures are closely related as 
follows. Let a linear ordering (S; <) be given. Then the structure (S; B<) 
with B< defined as 


Be(XYZ) $ X<Y<ZVZ<Y<X 


is a linear betweenness structure. Conversely, let a linear betweenness structure 
(S; B) be given and take any distinct A, B,C € S such that B(ABC). Then 
the structure (S; <p) with <p defined as 


X <B Y © (B(XYB)AB(XBC)) v 
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(B(X BC) A B(ABY)) V (B(ABY) A B(BXY)) 


is a linear ordering. The purpose of the parameters A, B, and C is to fix the 
direction of <p, since clearly every linear betweenness structure gives rise to 
a pair of mutually converse linear orderings. In fact, the parameters A, B, C 
are inessential in the sense that if A;, B;,C; € S are distinct with B(A;B;C;) 
(i = 1, 2) then the two pairs of linear orderings determined by these two triples 
of parameters will be identical. Betweenness structures and linear orderings 
related as above will be called adjoint. Thus, every linear betweenness structure 
has a pair of mutually converse linear orderings adjoint to it, and every linear 
ordering has a linear betweenness structure adjoint to it. 
A linear betweenness structure (S; B) is dense if it satisfies the axiom 





VX {V Xo (Xı Æ Xə AY (B(X1Y X2) AXi £ Y A Xo £ Y)) : 


A linear betweenness structure (S; B) is called Dedekind complete if it sat- 
isfies the second-order axiom 








VPWPs (3YB(Y P,P») — AZB(P, ZP2)), 


stating that if all points of the set Pı precede all points of the set Po, i.e. if 
B(Y XıXə2) forall X; € Pı and X2 € P2, then there is a point which separates 
P and P2. Accordingly, a linear ordering (S; <) is called Dedekind complete 
if the following second-order axiom is satisfied: 


VPiWPa( A Pi £ 0A PiU P2 = S A PLNAP2 = 0AP < Po 
i=1,2 





+ 3X (P1 < X <P»). 


This axiom states that if Pı and P2 are non-empty disjoint sets that cover the 
entire set S and if all elements in Pı are dominated by all elements in P2, then 
there is a point that separates Pı and P2. Szmielew, 1983 shows that if a linear 
betweenness structure (S; B) is adjoint to a linear ordering (S; <) then (S; B) 
will be Dedekind complete if and only if (S; <) is Dedekind complete. 

A linear betweenness structure (respectively a linear ordering) is called con- 
tinuous if it is dense and Dedekind complete. 


Now, a betweenness relation is defined on a collinearity structure (Po, Col) 
by defining it on every line in that structure. Thus we deal with a geometric 
structure € = (Po, Col, B) such that 


1. Bisa linear ternary relation on Po, i.e. B C Col; 


2. (1(X,Y);B) is a linear betweenness structure for every line 1(X, Y) 
from Li(¢). 
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Point collinearity can be defined in terms of betweenness: 


Col(X1X2X3):= V B(X;X;X;). 
A(i,j,k) 


Then, the axioms for betweenness in a collinearity structure are adjusted by 
adding the axiom B6 and replacing the axiom B1 with the axiom B7: 


B6: VXVYVZ(B(XYZ) — Col(XYZ)) (linearity) 
B7: VX iV XoVX3(Col(X1 X2X3) =% Vij) B(X;X;Xx)) 
(connectivity on lines) 


Consequently, betweenness can serve as the only primitive relation in ordered 
collinearity structures and their axioms can be phrased exclusively in terms of 
betweenness. Acollinearity structure with a betweenness relation imposed on it 
will be called an ordered collinearity geometry. Incase the collinearity structure 
has dimension > 2, and in particular when dealing with the real collinearity 
plane, it turns out that the axiom B2 becomes redundant. 

The betweenness relation has great expressive power; as will be seen in the 
next section, betweenness was both Veblen’s and Tarski’s primitive of choice 
for formalizing affine notions in first-order logic. For example, given points 
X and Y one can define the following types of line segment: closed intervals 
[X,Y] := {Z : B(XZY)}; open intervals (X,Y) := [X,Y] UX, Y }; the 
ray from X away from Y (when X # Y) X/Y := {Z : B(YXZ)}; the 
line containing X and Y (when X # Y) XY := {Z : B(ZXY)} U{Z: 
B(XZY)}U {Z : B(XY Z)}, ete. 


8.2 Definability of betweenness and order in affine planes 


Note that even if a linear betweenness structure is defined on every line in 
a collinearity structure, that may not suffice to have a “global” betweenness 
relation on the entire structure, satisfying the axioms B2 — B7, because the 
linear betweenness relations may not be synchronizable across the structure. 
To guarantee that, we should guarantee that betweenness is preserved under 
parallel projections between lines. This property is formalized by the following 
three axioms: 


Pasch : (Invariance—see Fig. 7.15.) 


(-Col(X1 X2XsVi¥e¥) ^ Col(X1X2X3) A Col(Y1Y2Y3) 


A B(X1X2X3) A Ai j=1,23 Xi¥in X;Y3) > B(Yi¥2Y3) 


oPasch : (Outer invariance—see Fig. 7.16.) 
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(-Col(X1 X2X3¥3¥3) ^ Col( X1 X2X3) A Col(X1Y2Y3) 


A B(X1X2X3) A X2Y2 1 X3Y3) > B(X1Y2Y) 
iPasch : (Inner invariance—see Fig. 7.17.) 


(-Col(X1¥2X3%i¥5) ^ Col(X1X2X3) A Col(¥,X2Y3) 


A B(X1X2X3) A X1Yı u X3Y3) > B(Y1X2Y;) 


X3 
Xə 


Xı 


Yı Yə Y3 


Figure 7.15. 








Y> Yz 





Figure 7.16. Figure 7.17. 


It turns out (see Szmielew, 1983) that in an ordered affine plane, not only are 
the three Pasch axioms pairwise equivalent, but they are also all equivalent to 
the Pasch axiom Ax.B5 to be described below. 

We now briefly investigate the relationship between ordered affine planes and 
ordered division rings (see Szmielew, 1983 for details). Let an ordered strong 
left division ring ¥ = (F; 0,1, +,:, <) be given, and let §- = (F; 0,1, +,-) 
be the unordered reduct of and B be the betweenness relation on F adjoint to 
<. Then A(§7 ) is an affine plane which satisfies the axiom D1. We can treat 
A(3~) as a collinearity structure (F°, Col). Put A(%) = (F°, Col, Bẹ), 
where Bz is a ternary relation on the points in A(§~) defined by stipulating 
that for any A = (4, ya), B = (xB, yB), C = (xc, yc) € F”, 


B;(ABC) iff Col(ABC) & B(x4rpuc) & B(yaysyc). 
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On the other hand, let 21 = (Po, Col, B) be any ordered affine plane satis- 
fying the axiom D; and let X~ = (Po, Col) be the unordered reduct of 2. 
Fixing some coordinate system OXY in W, the ternary ring Foxy (A) = 
(F;0,1,+,-) attached to A~ will be a strong left division ring. Since B is 
a betweenness relation in X then B will be a betweenness relation on every 
line in 21, and hence also on the set F. Thus (F; B) is a linear betweenness 
structure and it will have a pair of mutually converse linear orderings adjoint 
to it. Suppose < is the linear ordering on F adjoint to B and such that 0 < 1. 
Put Foxy (A) = (F30,1,+,-,<). 

The property that every line in a plane contains at least three points can be 
axiomatized as follows: 


B8: VXVYAZ(ZAXAZ#Y ACol(XYZ)) 





Szmielew, 1983 gives the following representation results. 


THEOREM 7.40 Let X be an ordered affine plane satisfying Pasch and B8. 
IfA also satisfies Dy (respectively, D3; P2) then A = A(¥) for some ordered 
strong left division ring (respectively, skew field; field) §. 


THEOREM 7.41 If § is an ordered strong left division ring (respectively, skew 
field; field) then A(§) is an ordered affine plane satisfying the axioms Pasch, 
B8, and D; (respectively, D3; P2). 





8.3 Axiomatizing betweenness in R? 


Szczerba and Tarski, 1965 and Szczerba and Tarski, 1979 study the affine 
fragment AF»>, called the elementary affine Euclidean geometry, of the 
Euclidean plane. AF» is the elementary geometry formalized in the language 
with only the betweenness relation B, where a sentence is valid in AF» if and 
only if it is valid in the Euclidean plane Ey. They give a complete axiomati- 
zation of AF» which will be outlined below (all axioms below are implicitly 
universally quantified over all occurring free variables). 


Ax.B1: IDENTITY AXIOM 
B(XYX)> X=Y 

Ax.B2: TRANSITIVITY AXIOM 
YAZAB(XYZ)AB(YZW) — B(XYW) 

Ax.B3 : CONNECTIVITY AXIOM 
VAWAB(VWX)AB(VWY) > (B(VXY) V B(VYX)) 


Ax.B4: EXTENSION AXIOM 
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AX(X AY AB(XYZ)) 





Ax.B5: (OUTER FORM OF) PASCH AXIOM 


B(XY'Z) A B(Y Z'Y') > 3X"(B(ZX'Y) A B(XZ’X’)) 





(given a triangle YY’ Z, a point X on the extension of the side Y’Z and a 
point Z’ on the inner side (with respect to X) of the triangle, the line X Z’ 
must intersect the triangle in its outer side (with respect to X) |Y Z|—see 
Fig. 7.18.) 


Ax.B6: DESARGUES AXIOM 
=Col(TXY) A -Col(TX Z) A =Col(TY Z) \ B(T XX’) AB(TYY’) 
A B(TZZ') A B(Y XU) AB(Y'X'U) AB(XZW) A B(X'Z'W) 
A B(YZV) AB(Y'Z'V) => B(UVW) 


(triangles perspective from a point are perspective from a line—see Fig. 
7.19.) 


Ax.B7: LOWER 2-DIMENSIONAL AXIOM 

















X3Y3Z(-B(XY Z) A -B(Y ZX) A=B(ZXY)) 





Ax.B8: UPPER 2-DIMENSIONAL AXIOM (See Fig. 7.20.) 





AV ((B(YVZ) A Col(XVW)) v (B(XVZ) A Col(YVW)) 
V (B(XVY) A Col(ZVW))) 
As.B9 : ELEMENTARY CONTINUITY AXIOM SCHEMA 


VW 





FJUVXVY (y(X,W) AW(Y,W) > B(UXY)) = 


“——~ 


AVVXVY (y(X,W) AW(Y,W) — B(XVY))) 





The variables W are distinct from U, V, X, Y, and p(X, W) and w(Y, W) 
are first-order formulae over B with free variables only amongst X, W 
in the case of y, and Y, W in the case of 7. This schema comprises 
the parametrically first-order definable instances of the full second or- 
der continuity axiom (see Ax.11 further). Note that As.B9 is an infinite 
schema, and it cannot be replaced by a finite one, as Tarski has shown. 


The axiom system given so far is denoted GA» and the geometry it describes 
is called by Szczerba and Tarski the general affine geometry. It does not reflect 
Euclid’s parallel postulate at all and it is shown in Szczerba and Tarski, 1979 
that: 
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Z! 





X Y Z 


Figure 7.18. 





Figure 7.19. 


Figure 7.20. 


= GA, is incomplete and has continuum many complete extensions. 


= In particular, GA is a proper subtheory of the elementary affine absolute 
geometry AAs (the affine fragment in the language of B of the absolute 
geometry Ag, which is the reduct of the Euclidean geometry obtained by 
dropping Euclid’s parallel postulate). 
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= However, GAs is complete with respect to universal sentences, i.e. if a 
universal sentence ø is true in some model of G Az then ø is true in every 
model of GA. 


= GAs is not finitely axiomatizable. 


= GAs is hereditarily undecidable, meaning that both G'Ag, as well as all 
its subtheories, are undecidable. 


= GAs is decidable with respect to inductive sentences. Therefore, G A2 
is not an inductive theory. 


Here is a form of Euclid’s postulate in the language of B: 
Ax.E: HEUCLID’S AXIOM 
Z#VAB(ZVT)AB(UUVW) > 
XAY(B(ZUX) AB(ZWY) A B(YTX)). 

















The axiom Ax.E says that through any point T in the interior of an angle there 
is a line intersecting both sides of that angle (see Fig. 7.21). 


Z 





Figure 7.21. 


Szczerba and Tarski show that adding that axiom Ax.E to GAs renders a 
complete axiomatization of the elementary affine Euclidean geometry AF», 
and hence also a complete axiomatization of the real affine plane. 

Finally, here is a representation result for models of G Ag. Let § = (F; 0,1, +, 
-, <) be any ordered field and define © and © as 


(1,41) ® (x2, Y2) = (x1 + 22,41 + y2), (21,41) OA := (z1 -@, y1 Q), 


for any (71, y1), (£2,Y2) E€ F? and a € F. We can define a betweenness 
relation Bz in the Cartesian square F? over the field § as follows: given 
a,b,c € F?, we stipulate that 


Bz(abc) & b= [a© (1—A)] @[cO A] 


for some A € F with 0 < \ < 1. The structure A(¥) = (F?;Bg) thus 
formed will be called the affine plane over the ordered field §. Using the 
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class of all interiors of triangles as a basis, we define a topology on the set 
F?. Now let S be any non-empty, convex, open subset of F?. The structure 
A(3; S) = (F?|5; Bz|s) will be called the S-restricted affine plane over §. A 
plane over some field will simply be called a restricted affine plane if it is an 
S-restricted affine plane for some S. 


THEOREM 7.42 1. Every model of G Av is isomorphic to a restricted affine 
plane over some ordered real closed field. 


2. Every restricted affine plane over the ordered field of reals is a model of 
GA. 


3. If § is an ordered real closed field not isomorphic to the field of reals, 
then there is a restricted affine plane over § which is not a model of G Ag. 


9. Rich languages and structures for elementary geometry 


We will call a geometric language rich if the whole of elementary geometry 
in R” is definable in that language. Perhaps the first study on rich primitive 
notions in elementary geometry is Pieri, 1908, where the Pieri relation A is 
introduced, defined as 


A(XY Z) = | XY|| = || XZ, 


meaning that the configuration of points XY Z forms an isosceles triangle with 
base |Y Z| (in the degenerate cases either Y = Z or X is the midpoint of |Y Z|). 
Pieri showed that A can be used as the only primitive relation in R” for n > 2. 
This result easily implies the richness of many other relations in terms of which 
A is definable, for example the ternary relation of closer-than 


IXY| < |XZ], 


which states that either the point Y is closer to X than what Z is to X or that 
Y and Z lie equally far from X. This furthermore implies that the quaternary 
relation shorter-than 

|XY]| < |ZU], 


which states that the line segment | XY | is shorther than the segment | ZU], or 
that they are of equal length, is also rich. 

Veblen, 1904 considered the two primitive relations of betweenness B and 
equidistance = (or 6), which are the same primitives that Tarski later used. 
Veblen showed that these primitives are sufficient for the elementary geometry, 
although he believed to have proved, falsely, that the relation of equidistance is 
definable in terms of the relation of betweenness (see Tarski and Givant, 1999). 
In fact, using the coordinatization of the Euclidean plane, and applying Padoa’s 
method, it is easy to see that the equidistance relation = is not definable in terms 
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of betweenness, not only in first-order languages, but even in higher-order logic. 
Indeed, the linear transformation f : R? — R?, defined by f(x,y) = (2, 2y), 
preserves betweenness, but not equidistance. 

On the other hand, Pieri showed that B is first-order definable in terms of 
the quaternary closer-than relation < defined above: 


B(XYZ) = VU ((|XU| < |XY|A|ZU| <|ZY|) =U =Y), 


meaning that if U and Y are intersection points of spheres with centers X and 
Z then they must coincide (see Fig. 7.22). 





Figure 7.22. 


W 
/\ i 
X Y Z U 
Figure 7.23. 


Furthermore, the quaternary shorter-than relation < is definable in terms 
of = as follows: 





IXY| < |ZU| & YV (ZV = UV = 3W(XW = YW AYW =UV)), 


meaning that if there is an isosceles triangle with base |ZU | and a given side, 
then an isosceles triangle with the same side and base |X Y | exists too (see Fig. 
7.23). It follows that = can be taken as the only primitive for the elementary 
geometry. 

Regarding primitive relations with smaller arities, forevery n > 2, no binary 
relation can be rich for R” (see Beth and Tarski, 1956). Also, as noted earlier, 
the relation B alone is not sufficient for the whole of elementary geometry in 
Rĉ. Still, Beth and Tarski show in Beth and Tarski, 1956 that the ternary relation 
E, where E(XY Z) means that the configuration XY Z forms an equilateral 
triangle (or degeneratively that the points X, Y and Z coincide), is rich for 
every R” with n > 3, by expressing Pieri’s relation in terms of E in the system 
R’. However, they also show that the relation E alone is not sufficient for 
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the geometries R” with n = 1 or n = 2. Similar results hold for the ternary 
relation specifying that three points determine a rectangular isosceles triangle, 
and the quaternary relation specifying that four points are the vertices of a 
square. However, it was shown in (see Beth and Tarski, 1956, Scott, 1956) that 
the completely symmetric ternary relation R, where R(X Y Z) means that the 
points X, Y and Z are distinct and form, in some order, a rectangular triangle, 
is rich for every R” with n > 2. 

Schwabhauser and Szczerba, 1975 investigate line relations which can be 
taken as primitives for the elementary Euclidean geometry. They establish sim- 
ple rich systems of such relations for every R”, n > 2. For the dimension-free 
Euclidean geometry they show that the binary relation İ of perpendicularity 
together with the ternary relation Cop of co-punctuality suffice. For dimen- 
sions higher than 3 perpendicularity alone suffices, while for R? it does not. 
For R? perpendicularity and the binary relation Cop of co-punctuality suffice. 
Later, Kramer, 1993 proves that | alone is not sufficient as a primitive for R3, 
because co-punctuality is not definable there in terms of it. Finally, the question 
of primitive geometric relations and definability in the case of R! turns out to 
be rather more complicated; the reader is referred to Tarski and Givant, 1999. 


9.1 Tarski’s system of elementary geometry based on 
B and 6 


In the mid 20th century Tarski developed systematically an axiomatic system 
for the elementary geometry based on the only primitive concept of point, and 
the two primitive relations betweenness B and equidistance = (or 5). Over 
many years, Tarski and his students refined, simplified, and minimized that 
system, and a detailed account of that development can be found in Tarski and 
Givant, 1999, which we follow here for the choice of axioms and notation. 
Again, all axioms are implicitly universally quantified over all occurring free 
variables. 


Ax.1: REFLEXIVITY OF EQUIDISTANCE. 

X 1X9 = X2 Xı 
Ax.2: TRANSITIVITY OF EQUIDISTANCE. 

(X1X2 = YY A XiX? = Z122) > NY. = Z122 
Ax.3: IDENTITY OF EQUIDISTANCE. 

XY =ZZ—- X=Y 


Ax.4: EQUAL SEGMENTS CONSTRUCTION. (See Fig. 7.24) There is a 
—> 
segment of length ||Y1 Y2|| beginning at X; in the direction of 7X: 
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JX2(B(ZX1X2) A X1X2 = Yı Yə) 





Ax.5: FIVE-SEGMENT AXIOM. (See Fig. 7.25) The corresponding line 
segments built on two congruent triangles are equal: 


(X AY A B(XYZ) A B(X'Y'Z’) A XY =X'Y' AYZSY'Z! 
AXW=X'W' AYW=Y'W’') > ZW=Z'W' 
Ax.7; : (OUTER FORM OF) PASCH AXIOM. (See Ax.B5 above.) 


B(XY'Z) A B(YZ'Y’) > 3X'(B(ZX'Y) A B(XZ' X") 





Ax.8() : LOWER 2-DIMENSIONAL AXIOM 

















X3Y3Z(-B(XY Z) A -B(Y ZX) ^-B(ZXY)) 





Ax.8) : LOWER n-DIMENSIONAL AXIOM FOR n > 3 


























UIV AIW Xren AA 1 (Diffn—1(X1 <- Xn-1) 


A -B(UVW) A =B(VWU) A -=B(WUV) 


n—1 n—1 n—1 
A \UX,;=UX;A NVXEVXAN WX, =WX;) 
j=2 i=2 i=2 


Vw = i= 


The axiom Ax.8‘”) claims that there exist n — 1 distinct points Xj,..., 
Xn-1, and three non-collinear points U, V, W, each of them equidistant 


from Xj,..., Xn —1, which implies that the dimension of the space is at 
least n. Using these axioms, one can express that the dimension of the 
space is n: 


Dimp := (Ax.8) A a(Ax.8F), 
Ax.101; : (FORM OF) EUCLID’S AXIOM. (See the axiom Ax.E above.) 
Z#V AB(ZVT) AB(UVW) > 
X3Y(B(ZUX) AB(ZWY) A B(YTX)) 

















Ax.11 : SECOND-ORDER CONTINUITY AXIOM. (See also Sec. 8 above.) 


JY (B(Y XX2)) > IZ(B(Xı ZX2)) 








This axiom says that if all elements of the set X; precede all elements of 
the set X2 on a line, then there is a point on that line which separates X1 
and X 2. This property is not definable in the first-order language of B 
and =. Its first-order approximation is the corresponding axiom schema. 


As.11: CONTINUITY AXIOM SCHEMA. See As.B9 above. 
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Ax.15: INNER TRANSITIVITY AXIOM FOR BETWEENNESS. 


B(XYW) AB(YZW) — B(XYZ) 


Xə 


xX 





Yı Yə 


Figure 7.24. 


Figure 7.25. 


THEOREM 7.43 (TARSKI, 1967) The set of axioms Ax.1 — Ax.5, Ax.71, 
Ax.10;, Ax.15 plus Dimy, taken with the second order axiom Ax.11, char- 
acterizes up to isomorphism the full n-dimensional Euclidean geometry F Gg”) 
for every n > 2. Furthermore, if As.11 is taken instead of Ax.11, the resulting 
first-order axiomatic system is a complete axiomatization of the first-order the- 
ory of the full n-dimensional Euclidean elementary geometry EG) for any 
n > 2. If, moreover, the dimension axiom Dim, is omitted, then, accord- 
ing to Scott, 1959, a complete axiomatization of the dimension-free Euclidean 
geometry is obtained. 


9.2 Decision methods and automated reasoning for 
elementary geometry 


The algebraic approach in geometry goes back at least to Descartes, who 
introduced the coordinate method in his study of geometry. The first modern 
development of general algebraic methods used in constructive solutions to 
classes of geometric problems in affine geometry is due to Hilbert, 1950. How- 
ever, the first explicit decision method for elementary Euclidean geometry, i.e. 
a general method for deciding the truth of any first-order sentence in this geo- 
metry, was developed in Tarski, 1951. Tarski’s decision method is based on a 
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decision procedure for the first-order theory of the field of real numbers, which 
is also the first-order theory of the class of real-closed fields. For that theory 
Tarski established quantifier elimination, i.e. it was proved that every first- 
order sentence formulated in the class of real-closed fields is equivalent, over 
the class of real-closed fields, to a boolean combination of algebraic equations 
and inequalities. Equations are simply conjunctions of inequalities, and every 
inequality can be expressed in the form t > 0 for some term t in the first-order 
language of rings (i.e. t will be a polynomial). Therefore, eventually, every 
first-order sentence of that language is equivalent, over the class of real-closed 
fields, to a boolean combination of formulae of the type 4x(x? = t), where t is 
a term not containing x. Subsets of R” definable by such formulae are called 
semi-algebraic sets. In particular, Tarski’s result implies that the parametrically 
first-order definable relations in R” are precisely the semi-algebraic sets of R”. 
For a sketch of an algebraic proof of this result based on Sturm’s theorem, see 
e.g. Hodges, 1993. 

Tarski’s decision procedure is practically inefficient as it has non-elementary 
complexity. More efficient, elementary decision procedures were developed 
later, first by Monk, followed by Solovay, Collins, and others. 

Currently there are several well-developed and applied automated theorem 
proving decision methods for the first-order theory of the field of reals and the 
theory of elementary geometry. 

Probably the most popular decision method for the theory of real closed fields, 
and the first one amenable to practical automation (it has in fact been imple- 
mented), is Collins’ method of Cylindrical Algebraic Decompositions (CAD), 
based on quantifier elimination (see Caviness and Johnson, 1998, Collins, 1975, 
Collins, 1998). Given a boolean combination 6 of algebraic equations and 
inequalities, this algorithm computes a so-called cylindrical algebraic decom- 
position of the solution set of B. This cylindrical algebraic decomposition 
partitions the solution set of B into a finite disjoint union of spatial regions 
called cells, which have the property that all polynomials occurring in B pre- 
serve sign on each of these cells. Cells in R” can be defined inductively on n as 
follows: a cell in R is an open interval or a singleton; a cell in R"*' is either the 
graph of a continuous function defined over a cell in R”, or a region bounded 
between the graphs f(x) and g(x) of two such functions f and g defined on the 
same cell in R”, and such that f < g on that cell. In particular, each of these 
functions can be taken as —oo or +00. 

Collins’ algorithm has a double exponential worst-case time complexity as a 
function of the number of variables in G. Later, Heintz et al., 1990 and Renegar, 
1992 constructed algorithms for quantifier elimination that are double expo- 
nential only in the number of quantifier alternations; see also Davenport and 
Heintz, 1988. Currently the most efficient algorithms for quantifier elimination 
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known to us can be found in Basu et al., 1996 and Basu, 1999; the latter employs 
uniform quantifier elimination. 

The Characteristic Set method, rooted in work by Ritt and later developed 
independently by Wu (see Chou, 1984, Chou, 1988, Chou, 1990, Chou and 
Gao, 1990, Wu, 1984, Wu, 1986), and the Grébner Basis method, developed 
by Buchberger (see Buchberger, 1985, Buchberger et al., 1988, Chou, 1990), 
work only on problems that can be formalized by systems of equations, and 
are only complete for algebraically closed fields. A related alternative method, 
based on Hilbert’s Nullstellensatz, has been proposed by Kapur, 1986. 

Another method, based on ideas coming from quantifier elimination in linear 
and quadratic formulae over the reals, has been proposed in Dolzmann et al., 
1998. Unlike the Characteristic Set and Grébner Basis methods, it is also 
applicable to geometric problems in the Euclidean plane and Euclidean n-space 
whose complex analogues may fail. 

Chou, 1984 shows how the Wu-Ritt method of characteristic sets can be 
applied to finding locus equations, and in Chou, 1987 it is also shown how this 
method can be used for the mechanical derivation of formulae in elementary 
geometry. All of these methods require large computational resources and can 
easily become unfeasible for more complex formulae. For an overview of 
automated reasoning in geometry see Chou and Gao, 1990. 


This concludes our discussion of first-order theories of geometry. 


10. Modal logic and spatial logic 


In the remainder of this chapter we will survey some modal logics related to 
classical geometric structures, and from now on we assume some familiarity 
with basic modal logic and Kripke semantics. 

Modal logics related to spatial structures are also considered in Ch. 5 and 
Ch. 9. The former is mainly oriented towards the topological interpretation of 
modal logic, whereas the latter deals with the combination of spatial logics and 
temporal logics. 


Basic modal logic. In order to fix the notations and terminology in basic 
modal logic we will give a short list of definitions and facts. For all notions 
mentioned without definitions the reader is invited to consult Blackburn et al., 
2001 or Hughes and Cresswell, 1996. 

Let C be a class of relational structures of the form F = (W, Ri,..., Rn), 
where W is a nonempty set whose elements are usually called possible worlds, 
and Ry, ..., Rn are binary relations on W called accessibility relations. In 
modal logic such relational structures are called Kripke frames. We associate 
with C a modal language L which is an extension of the standard language for 
propositional logic with unary connectives [R;],7 = 1,..., n, called modal box 
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operators, with the standard definition of a formula, given by the rule: 
A:=p|7A|(AV B)|[RiJA|...| [Rn]A. 


We use the classical abbreviations for “true” (T), “false” (L), conjunction 
(A), implication (—), and equivalence (>). We also use the dual, diamond 
operators (R;), defined by (R;)A = —=[R,;|4A. Standard modal logic has 
only one box-modality O called “necessity”, and the corresponding diamond 
modality © is called “possibility”. 

The semantics of L in a given frame F = (W, R,,..., Rn) is based on the 
notion of valuation on F, which is a function V assigning to each proposition 
letter p a subset V (p) of W. Intuitively, we think of V (p) as a set of possible 
worlds in which p is true. A pair M = (F,V) where V is a valuation on 
F is called a Kripke model based on F. We define the satisfiability relation 
M,w H A— in words, the formula A is satisfied at the possible world w of 
the model M — as in Ch. 5. In particular we have: 


M,w H [Ri] A iff M, w & A for all w’ € W such that wR;w’. 


A formula A is true in Kripke model M iff M, w H A for all possible worlds 
w in M. We say that A is valid in a Kripke frame F iff A is true in all models 
defined over F; A is valid in a class C of Kripke frames iff A is valid in all 
Kripke frames of C. The set £(C) of all formulas which are valid in C is called 
the logic of C and the formulas from £(C) are called the modal laws of C. If 
A € L(C) then we write Fic) A. 

The above is a semantic definition of a modal logic related to a class C of 
frames. Note that one and the same modal logic may be the logic of different 
classes of frames. 

A class C of frames of the form (W, R;,..., Rn) is modally definable if there 
exists a formula A such that for every frame F of the form (W, R;,..., Rn), 
F is inC iff A is valid in F. 

If this is true then we also say that C is modally definable by A. If the class 
C is definable by a first-order condition y on the relations R; then we also say 
that y is modally definable by A. For instance, reflexivity of a relation R is 
definable by [R]p — p, symmetry of R is definable by p — [R](R)p, and 
transitivity of R is definable by [R]p — [R][R]p, where p is a propositional 
variable. So modal definability is in some sense a way to talk about properties 
of Kripke frames by means of a propositional language. Let us note that not 
all first-order properties of Kripke frames are modally definable and that not all 
modal formulas define a first-order property. 

Axiomatically, a modal logic is defined as the smallest set of formulas con- 
taining a given set of axioms and closed with respect to a given set of inference 
rules. The elements of a modal logic £ are called theorems of £. If A is a the- 
orem of £ then we write Fe A. For instance, the modal logic K, of the class 
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of all Kripke frames of the form (W, R1, .. . , Rn) has the following axiomatic 
definition: 


Axioms: all substitution instances of classical tautologies, and all formulas 
of the form [R;](A — B) > ([R,|A — [Ri] B) for i =1,...,n. 

Inference rules of K„: Modus ponens “given A and A — B, derive B” 
and generalization “given A, derive [R;] A”. 

Axiomatic definitions of other logics can be obtained by adding to the above 
axiomatic system additional axioms and possibly additional rules of inference. 
If the axiomatic system does not contain additional inference rules it is called 
normal. For instance, the logic S4 is an extension of the logic Kg with the axiom 
schemes LIA — A (defining reflexivity of R) and OA — ODA (defining the 
transitivity of R). The logic S5 is an extension of S4 with the axiom scheme 
A — OQA (defining the symmetry of R). The statement of the equivalence of 
a given semantic definition of a modal logic with a given axiomatic definition is 
called a completeness theorem with respect to the corresponding class of frames. 
There are different methods for proving completeness theorems. One of them is 
the so-called method of canonical models. An important theorem related to this 
method is the famous Sahlqvist theorem saying that if the axioms of the logic are 
of a given specified form, then these axioms define first-order conditions, and the 
logic is canonically complete in the class of frames satisfying these conditions. 
For instance, the axioms of the logics S4 and S5 are of Sahlqvist’s type and 
hence are “canonical”, so S4 is complete in the class of all pre-orders and S5 
is complete in the class of all equivalence relations. Furthermore, it is known 
also that S5 is also complete for the class of all frames with R = W x W, the 
universal relation in W. Another method for proving completeness theorems 
is based on the notion of bounded morphisms. By means of this method one 
can prove, for instance, that two different classes of frames, C4 and C2, define 
equal logics L(C1) = £(C2). If, by some method (for instance by the method 
of canonical models), one can give a complete axiomatization of £(C;), then 
one automatically obtains a completeness theorem with respect to the class C4. 
For the method of canonical models, bounded morphisms, Sahlqvist’s theorem 
and some other methods see Blackburn et al., 2001. 

The above-described modal logics contain only unary modal operations. 
There are also modal logics with binary and in general with n-ary modal oper- 
ations, called polyadic modalities, with Kripke semantics using relations with 
arbitrary finite arity. If, for instance, A o B is a binary modality then it can be 
interpreted in frames with a ternary relation R C W? as follows: 

M,w — Ao B iff there exist w’ and w” such that wRw'w”, M, w' = A 
and M, w” = B. 
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Modal logic and applied modal logic. A major aim of modal logic is to 
study modal logics of different classes of frames, mainly with respect to modal 
definability, axiomatization, decidability and complexity. 

The broad applicability of modal logics rests, inter alia, on the fact that, while 
they are based on propositional languages, every modal formula corresponds 
in terms of frame validity to a universal monadic second-order formula, and 
thus can be used to express properties of relational structures. An important 
discipline of modal logic related to this issue—correspondence theory (see van 
Benthem, 1984)—is mostly about using modal formulas to define classes of 
relational structures. Thus, as noted in Blackburn et al., 2001, modal languages 
are simple yet expressive languages for talking about relational structures. 

Another reason for interest in some modal logics is that they represent 
tractable, decidable fragments of first- or second-order logic, which makes 
them computationally significant. The quest for computational efficiency has 
recently stimulated active research on the complexity of modal logic. 

An important feature of the modal approach is that modal logic presents 
formal methods of reasoning based on modal operators specific to given prac- 
tical domains. Often, the linguistic meaning of these operators, coming from 
their use in the everyday language, is quite imprecise; however, giving the ex- 
act semantics for the corresponding logic supplies these modalities with exact 
meaning. The complete axiomatization with respect to a given formal semantics 
presents a formal system for reasoning in the corresponding semantic domain, 
and the completeness theorem with respect to the given interpretation can be 
considered as a tool for establishing the adequacy of the proposed semantics. 

In summary, applied modal logic is a general concept covering modal systems 
naturally arising from various practical domains. The machinery of applied 
modal logic contains all tools developed so far in modal logic. Very often the 
analysis of some new area of application of modal logic needs to invent some 
new methods, stimulating in this way the general development of the field. 


Spatial modal logics. One of the origins of the classical modal logic 
of necessity and possibility arises from the analysis of the meaning of these 
modalities in natural languages. There are many other modes of truth which 
can be treated as different kinds of modalities: time modalities, space modal- 
ities, knowledge modalities, deontic modalities and so on. Examples of time 
modalities are: always, sometimes, always in the future, always in the past, at 
the next moment, tomorrow, since, until, etc. Examples of space modalities, 
related to geometrical relations in the space, are: everywhere, everywhere else, 
somewhere, somewhere else, near, far, on the left, on the right, on the top, in 
the middle, between, parallel, etc. Although geometry, as a mathematical the- 
ory of space, is one of the oldest branches of mathematics, and although the 
theory of time is not even a mathematical discipline, the logic of time is a much 
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better-established branch of modal logic than the modal logic of space. One 
explanation of this fact, as noted in Balbiani, 1998, Venema, 1999, is rooted in 
the use of temporal logic in computer science, especially in program verifica- 
tion and specification, concurrent programming and databases. Another reason 
is probably in the simpler mathematical structure of time, very suitable for a 
modal treatment by Kripke semantics: a set of moments of time together with 
a precedence relation between moments. In contrast, the structure of space is 
much more complex. For instance, the structure of classical Euclidean geom- 
etry consists of several sorts of objects—e.g. points, lines and planes—with 
various binary relations involving them—e.g. collinearity and betweeness in 
the set of points, parallelism, concurrence and orthogonality in the set of lines, 
and the intersort-relations of incidence between different sorts of objects. At 
first sight, many-sorted mathematical structures are not suitable for modal treat- 
ment in the above-described sense, because the standard Kripke semantics is 
based on one-sorted structures. But as we shall see later, the modal approach 
has been extended to many-sorted geometric structures. 

Recently, in connection with new directions in artificial intelligence and 
information science, such as geometrical information systems and qualitative 
spatial reasoning (Cohn and Hazarika, 2001), the application of logic and in 
particular of modal logic to the theory of space has become more popular, and 
this has stimulated the development of a new branch of applied modal logic, 
commonly called spatial modal logic. 

Lemon and Pratt, 1998 produced a criterion by which one can judge the spa- 
tial character of a modal logic and observed in the light of their criterion that 
several of the existent modal logics of space were not spatial at all. According 
to Lemon and Pratt, a spatial modal logic is one whose models are based on 
mathematical models of space. Obviously, affine geometry and projective ge- 
ometry and some of their fragments constitute mathematical models of space 
par excellence. Some of these geometrical structures considered as first-order 
systems are studied in the first part of this chapter. In this second part we include 
some modal logics with semantics based on them. In this chapter we will neither 
consider spatial modal logics based on interpretations of modal languages in 
topology and metric spaces, nor logics based on the primitive notion of a spatial 
region and some spatial relations between regions like contact, part-of, overlap 
etc., nor modal logics related to the relativistic interpretation of 4-dimensional 
space-time. The reader can find treatments of such logics in other chapters of 
this book. 


11. Point-based spatial logics 


In this section we will consider modal logics related to structures based on a 
set of points. 
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The logic of elsewhere and everywhere. One of the first modal logics 
with explicit spatial interpretation is the logic of “elsewhere” introduced by 
von Wright, 1979. Under the reading of box given by von Wright, OA means 
“everywhere else it is the case that A”. Thus, the modal logic of “elsewhere” 
may be formally identified with the validities in the class of all Kripke frames 
F = (W, R) in which R is the difference relation: VaVy(xRy +> x # y). That 
is why the box and the diamond of von Wright’s logic are usually written [4] and 
(Æ). This is not a typical spatial relation, because difference can be considered 
in any set of objects. But what really made von Wright’s box popular is the 
observation that enriching modal languages with [4] greatly increases their 
expressive power, as shown in Goranko, 1990, de Rijke, 1992, Venema, 1993. 

The logic of elsewhere can be axiomatized by adding to K the following 
axioms (von Wright, 1979): 


8 A> [AGA AAA (AAA 


Another example of a simple modal logic with a spatial interpretation is the 
logic S5 considered by Carnap (see Carnap, 1947) as the logic of all structures 
(W, R) with universal relation R: 


(U) Vas xRy. 


This gives the following spatial reading of OA as “everywhere A”. 

The condition (U) motivates the box of Carnap’s logic to be usually written 
as [U], and the diamond as (U). Note that [U]A is definable in the logic 
of elsewhere: [U]A = A A [#]A. Carnap’s reading of box has attracted 
the attention of many logicians, including Goranko and Passy, 1992 and 
Spaan, 1993. 








Collinearity and qualitative distance. Collinearity of points is one of the 
basic ternary relations between points. Stebletsova, 2000 considers the ternary 
relation of collinearity between points in projective geometry: Col( X,Y, Z) 
iff X, Y and Z all lie on a single line. She studies the spatial logic based on 
Col in any projective geometry of finite dimension d > 2. The ternary relation 
Col is used to interpret the binary modality o as follows: 


M,w Ao B iff for some w’ € W and for some w” € W with 
Col(w, w, w”) we have M, w’ = A and M, w” H B. 





In this setting, the following formulas are valid: 
= Ao(BoC) —(AoB)oC, 

= AoB—BoOA, 

a A — AoA, and 
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a (U)AA(U)B = (U)(Aco B) 


where (U) is the existential modality between points defined by (U) A = To A. 
Given a finite dimension d > 2, validity in the class of all projective geometries 
of dimension d can be axiomatized with a Gabbay-type inference rule (see 
Gabbay, 1981), but it is not known whether such rules can be replaced by a 
finite set of additional axioms. 

The modal logic of collinearity in projective geometry is rather expressive. 
For instance, for all finite dimensions d > 2, there exists a formula in the ba- 
sic modal language defined above that characterizes exactly those projective 
spaces of dimension d satisfying the property of Pappus (see Sec. 6.3). Us- 
ing the fact that Pappus’ theorem holds in any finite projective geometry of 
finite dimension d > 3, Stebletsova, 2000 has shown that this logic lacks the 
finite modal property: there exist satisfiable formulas that cannot be satisfied 
in finite models. What is more, for any finite dimension d > 3, the satisfiability 
problem in the class of all projective geometries of dimension d is undecidable. 
See Stebletsova, 2000 for further details. 

Another interesting ternary spatial relation is the relation N(x, y, z) of qual- 
itative distance between points, with the intuitive reading “y is nearer to x than 
z (van Benthem, 1983). Its most obvious properties in the real plane may be 
formulated as follows: 


Transitivity VaVyVzVt(N(a, y, z) \ N(a,z,t) = N(a,y,t)), 
Irreflexivity VaVyN (x,y, y), 

Almost-connectedness Va yV zVt( N(x, y, z) > N(az,y,t) V N(a,t, z)), 
Selfishness VaVy(x 4 y > N(a,2,y)), 

Triangle inequality VaVyVz(N(2,y,z) ^A N(z,2,y) > N(y,2, 2)). 


Itis known that N can serve as the basis of elementary plane Euclidean geometry 
(see Tarski, 1956). Nevertheless, no complete modal spatial logic has been 
developed so far with Kripke semantics based on that relation (see Aiello and 
van Benthem, 2002 for further discussion). 


12. Line-based spatial logics 


In this section we examine some spatial logics based on lines and some 
standard relations between lines: parallelism, orthogonality and intersection 
of lines. 


The logic of parallelism. Recall that || denotes the relation of strict 
parallelism. Parallelism frames are structures of the form (Li, ||), where Li 
is a non-empty set whose elements are called lines and || is the relation of 
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strict parallelism between lines. That relation satisfies the following first order 
conditions: 


a Vx: x |x -no line is parallel to itself, 
a Vx,y: 2x || y implies y || x — the relation || is symmetric, 


a Vu,y,z: x || yand y || z and a # z implies z || z — the relation || is 
“pseudo- transitive”. 


Frames satisfying all these conditions are called strict models of parallelism 
and their class is denoted by Csm p. The frames satisfying the second and the 
third axiom are called pre-models of parallelism and their class is denoted by 
CPreMP- 

Balbiani and Goranko, 2002 consider the modal logic of strict parallelism, 
where [||] A means “A is true at all parallel lines”. The semantics, based on 
parallelism frames (Li, ||), is as expected: 

M,w F ||||A iff for all w’ € Li such that w || w, M, w’ H A. 

Obviously, the following formulas modally define the class C prem P: 


A> [IIKIDA, AA [IIA — MIIA. 


We denote by PAR the axiom system obtained by adding these formulas to 
the minimal modal logic K. Let us note that these axioms are of Sahlqvist type 
and just modally define the class of frames Cpremp. Then, by the Sahlqvist 
theorem, PAR is sound and complete with respect to Cp;eiyp. 

Let us remark that PAR and the logic of elsewhere are the same. Hence, 
repeating the completeness proof of the logic of elsewhere given in Segerberg, 
1981, one can show that PAR is also complete with respect to the strict models 
of parallelism Csm p. The difficulty of working with strict models of parallelism 
is that there is no formula corresponding to the irreflexivity of the relation ||. 
This lack of expressive power of the modal language enables us to show that 
the satisfiability problem SAT (Csmp) is NP-complete (Demri, 1996). 

Adding to PAR the following Sahlqvist formulas for all n > 0, we obtain 
the axiom system PAR®: 


(vo) (IDT, 
(en) (DIJA A- -A DUA > (MAiA. -A An). 


The formula pn corresponds to the following first-order property on parallelism 
frames: 





(n) x llyrA...A2z |] yn > Gz (z| zA yi || zA... Ayn || 2). 


Note that pn is derivable from yp+1. 
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Since the model F,42 = ({1,...,n+2}, Æ) of strict parallelism consisting 
of exactly n + 2 parallel lines validates yn but does not validate Yn+1, we infer 
that PAR® is not finitely axiomatizable. Nevertheless, since pn is a Sahlqvist 
formula for all n > 0, PAR” is sound and complete with respect to the class 
Cem p Of all pre-models of parallelism satisfying (®,,), for all n = 0,1,.... 
Repeating the line of reasoning suggested by Segerberg, 1981 within the context 
of the logic of elsewhere, one can show that PARF is also complete with respect 
to the class C24, p consisting of all strict models of parallelism satisfying the 
conditions (®,,) for all n = 0,1,.... 

A more interesting completeness result for the logic PAR® is that it is sound 
and complete both in the standard parallelism frames in real plane IP? and in real 
3-dimensional space IP? with the usual relation of strict parallelism. This shows 
that the language of strict parallelism is not expressive enough to distinguish the 
standard parallelism frames in C%. yp- Despite this obvious lack of expressive 
power of our modal language, the advantage of our modal approach is that the 
decision problem SAT (C2, p) for satisfiability in C25, p is also NP-complete 
(see Balbiani and Goranko, 2002 for the details). 


The logic of orthogonality. The relation of orthogonality -L is another 
typical binary relation between lines. We interpret |] A as “A is true at all lines 
which lie orthogonal to the current line”. Letus note that in every orthogonality 
frame F = (Li, L), the binary relation ||, defined as follows, is an equivalence 
relation: 


w || w iff for all lines w”, w L w” iff w Lw”. 


We consider the class Cpomo of all planar quasi-models of orthogonality 
F = (Li, L), where L is symmetric and 3-transitive, i.e.: 


a VwVw'(w Lw — w L w)- symmetry of L, 


a YwVw Yw" Yw" (w Lw Anw Lw nw” Lw” => w L w”)- 
3-transitivity of L. 


The class Cp mo of planar models of line orthogonality is the class of all frames 
F = (Li, L) where L is irreflexive, symmetric and 3-transitive. Let Como 
be the class of all planar quasi-models of line orthogonality in which every 
equivalence class modulo || is infinite. Similarly, let Czo be the class of all 
planar models of line orthogonality in which every equivalence class modulo || 
is infinite. 

The modal logic ORT based on Cpg yo is obtained by adding the following 
axioms to K: 


a A>[I](L)A, [1]A > [L][ LILJA. 
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These axioms are formulas of Sahlqvist type just defining the properties of 
symmetry and 3-transitivity of the relation L, so by the Sahlqvist theorem 
ORT is complete in Cpomo. Using bounded morphisms one may prove that 
the classes Cpomo and Cpmzo define the same logic, which shows that ORT’ 
is also complete in the class Cpygzo. See Balbiani and Goranko, 2002 for 
details. 

The logic which corresponds to the class of frames CEQ mo İS finitely ax- 


iomatizable through the axiom system ORT” obtained by adding to ORT the 
axiom (LT. This axiom system is sound and complete also with respect to 
validity in the Euclidean orthogonality plane consisting of all lines in the real 
plane together with the usual orthogonality relation (see Balbiani and Goranko, 
2002). So the language of orthogonality is not expressive enough to distinguish 
the standard orthogonality frame in the class C% mo- 

Let us note that the formula [L] A — [][1][-L]A is not valid in the Euclidean 
orthogonality space consisting of all lines in real 3-dimensional space together 
with the usual orthogonality relation. Hence, comparing with the modal logic of 
parallelism, the modal logic of orthogonality is able to distinguish the Euclidean 
orthogonality plane and the Euclidean orthogonality 3-dimensional space. 





A modal logic of parallelism and intersection of lines. Having outlined the 
modal logics of parallelism and the modal logics of orthogonality, we are now 
in a position to consider richer geometrical structures. Specifically, we discuss 
the line-based modal logic based on the binary relations of parallelism and 
intersection of lines, with the corresponding modal operators [||] and [x]. Our 
aim is to axiomatize the logic of the standard two-dimensional frame consisting 
of all lines in the real affine plane (called SAP) with the strict parallelism 
relation || and the standard relation of intersection: a x b iff a and b have only 
one common point. Let us note that the following modal formulas are true in 
SAP: 


a yo [IKI 

= A [le = IU., 

a yp [x](x)e 

= [x] > [Illlx]¢, 

a Allla [x] > [x][x]y, 
= (|T, 


a (AA (en > MEDA -A (Men), n = 1,2, 
a (x)T, 
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m (X) A... A (X)Yn > (X)((X) G1 A... A(X) Gn), n = 1,2,.... 


We denote by ML(SAP) the extension of the logic K with these ax- 
ioms. Since all of them are Sahlqvist formulas, they define the following class 
C(PreS AP) of frames (called pre-standard affine planes ) in which ML(S AP) 
is complete: 


a u| vv] u, 
eullvAv|lwAuAw-— ull w, 


»uUuxXU>UXU, 





eullvAvxw-ux uw, 


mE uxvAvxw>u=wVu]||wVuxw, 





a (Vudv)(u || v), 





m u| uA... Au || un > Gw)(u | wAw | v^... Aw || vp), 
i A 





m (Vudv)(u x v), 





m UXULA...AUXUn > (Jw) (uxwAwxVA... NWX) n = 1,2,.... 


Let us call a structure (Li, ||, x) a general affine plane if it satisfies all of the 
above first-order conditions plus the conditions of irreflexivity of the relations 
x and ||, and let us denote the class of all such structures by C(GAP). Note 
that the standard affine plane SAP is in this class. Applying the method of 
bounded morphisms it can be proved that ML(SAP) is also complete in this 
class. However this still does not prove that ML(SAP) is complete with 
respect to SAP. Applying more complicated techniques from model theory, 
it can be proved that any two frames from C(GAP) are modally equivalent, 
i.e. determine equal logics. Since the standard affine plane is in C(GAP), this 
implies that the logic ML(S AP) is complete for its standard semantics. 

The classes C(PreSAP) and C(GAP) are quite different. Using the se- 
lective filtration techniques one can prove that the logic ML(S AP) has finite 
model property (fmp) with respect to C(PreSAP) and hence is decidable, 
while with respect to C(GAP) it does not have fmp — all frames from C(GAP) 
are infinite. These facts, however, help to prove that satisfiability problem for 
C(GAP) is NP-complete. 

The completeness theorem for ML( SAP) implies that the modal language 
of parallelism and intersection of lines is too weak to distinguish the standard 
2-dimensional frame from the other frames in the class C(GAP). But the lan- 
guage can distinguish the 2-dimensional standard frame from the 3-dimensional 
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standard frame, the later consisting of all lines in real 3-dimensional space with 
the standard relation of strict parallelism and the standard relation of intersec- 
tion of lines. For instance the following axiom of MZL(S AP) is not true in 
3-dimensional space: [x]A — |||][x]A. The reason is that in 3-dimensional 
space there are lines intersecting one of two parallel lines but not the other. 


13. Tip spatial logics 


Projective geometry and affine geometry are among the most prominent 
mathematical models of space. They arise from the study of points and lines 
by means of properties stated in terms of incidence. In this section and in the 
following one, we will introduce modal logics for incidence between points 
and lines. There are two different approaches for defining a modal logic of 
incidence between points and lines. The standard semantics for modal logic 
assumes Kripke models with only one sort of possible worlds. Therefore, the 
first approach consists in the replacement of the two-sorted structures based 
on points and lines by one-sorted structures containing the same geometrical 
information. The second approach consists in the extension of the modal logic 
formalism allowing two sorts of formulas, point formulas and line formulas, and 
two sorts of possible worlds in Kripke models. The remainder of this section 
briefly describes the first approach (see Balbiani et al., 1997), while the second 
approach will be considered in Sec. 14. 


Tips. | Let F = (Po, Li, I) be a point-line incidence plane, that is: 


= Poisanon-empty set of points with typical elements denoted by X, Y, 
Z, T, etc, possibly with subscripts, 


= Liis a non-empty set of lines with typical elements denoted by x, y, z, 
t, etc, possibly with subscripts, 


m J isa binary relation of incidence between points and lines. 


99 66 


The relationship X Ix will be read “X is incident with x”, “X lies in x”, “x 
is incident with X”, or “x passes through X”. We will always assume that 
Po N Li = Í, i.e. no point is a line and no line is a point. Hereafter, we will 
assume in this section that the binary relation J satisfies the following first-order 
conditions: 





a YXVY3z(XIz AYIz), 
a VXVYV2Vi(XIzAYIz AXItAYIt > X =YVz=t), 











a VWrdYIZ(YIx A ZIx AY # Z), 

















a YVX3Jy3z(XIy A XIzAy # z). 
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The notion of point-line incidence plane can be extended with new first-order 
conditions in different directions. Two natural extensions are the notion of 
affine plane and the notion of projective plane. Consider a point-line incidence 
plane F = (Po, Li, I). Let us define on Li the binary relation || in the following 
way: 


= «x || y iff forall points Z, if ZIx and ZTy then x = y, 


A point-line incidence plane F = (Po, Li,I) is called an affine plane if it 
satisfies the following additional first-order conditions: 





a VXVys2(XIzAy || z), VaVyV2e(a | y ^y || z> «z || 2). 


Obviously, point-line affine planes are Euclidean in the sense that they satisfy 
the following condition: 


m VXVyV2Vi(XIzAy | zAXItAy || t> z=). 


A point-line incidence plane F = (Po, Li, I) is called a projective plane if it 
satisfies the following additional first-order conditions: 





a VaVysZ(ZIx ^ ZIy), 





€ WeVYVZAT(YI2 \ ZIz > Tle AT ŁY AT #Z). 


It is clear from our definition that if F = (Po, Li,I) is a projective plane 
then two different points are always incident with exactly one line whereas two 
different lines have always one point in common. 

Traditionally, the Kripke semantics of modal logics is based on one-sorted 
relational structures. That is why we introduce a new kind of relational struc- 
tures, called incidence frames, which are one-sorted and which will be used 
for defining the Kripke semantics of our next spatial logics. Now consider a 
point-line incidence plane F = (Po, Li, I). We shall say that the pair (X, x) 
in Po x Liisa tip over F iff X Ix. Intuitively, the tip (X, x) can be considered 
both as the point X and as the line x. Using tips, we can define the following 
binary relations: 


a (X,x) =f (Y,y)iffX =Y, 
m (X,x) =F (Y, y) iff £ = y. 


In the expression (X, x) =f (Y, y), (X, x) and (Y, y) are considered as the 
points X and Y and the relation =; can be seen as the equality of points. 
Similarly, in the expression (X, x) =% (Y, y), (X, x) and (Y, y) are considered 
as the lines x and y and the relation =; can be seen as the equality of lines. 
Using the binary relations =f and =% , we can simulate the binary relation 
of incidence between points and lines and the binary relation of parallelism 
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between lines in F. Let OF and || be the binary relations between tips defined 
in the following way: 


a (X,£)OF(Y,y) iff XTy, 
= (X,x) ||F (¥,y) iff æ || y. 


Obviously, the relation of incidence O* between tips is definable by means of 
=f and =% as follows: wO? wy iff there exists a tip w such that w, =f w 
and w =f w2, hence OF == o =f where o is the composition of binary 
relations. Likewise, w, ||* we iff for all tips w, if wO% w, and wO7 we then 
Wi =F w2. 

Incidence frames. Tips motivate the following definition. Consider a 
point-line incidence plane F = (Po, Li, I). The incidence frame over F is 
the structure W (F) = (WF, =f ,=3 ) where WF is the set of all tips over F. 
It is not too difficult to see that =; and =% are equivalence relations on W 
satisfying the following additional conditions: 


(I1) VwVw! (w =F w Aw =F w > w = w"), 
(I2) YwYw aw" (wOF w" A w'OFw"), 


(I3) YwYw Yw" Yw" (wOF w" A w'OF w" A wOF w" Aw OT w" — 
A A =F w"), 





w=? w Vw 





~ 





(14) YwIw aw" (wOFw A w"OFw Aw £F w"), 


(I5) YwIw Iw” (WOT w A wWOF w" Aw £F w"). 








~ 














Let us remark that =; and =2 define = in the following way: w = w' iff 
w =, w and w => w. Moreover, if F is affine then: 


(A1) VwVw!dw"(wOF% w" Aw |F w"), 


(A2) YwYwYw" (w |F w Aw! || w" > w || w"). 





If F is projective then: 

(P1) VwVw'Sw"(w"OFw A wOF w’), 

(P2) YwYw Yw" aw" (w!OF w A w" OF w —> 
w"OFwrw" £F w Aw” AT w"). 








These conditions are characteristic in the following sense: if in a set W we 
have two equivalence relations =; and =2 satisfying the conditions (71)-(15) 
then there exists a point-line incidence plane F such that the relational struc- 
tures (W, =1, =2) and W (F) = (W*,=7,=2) are isomorphic. Moreover, 
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if (W, =1, =2) satisfies the conditions (A1) and (A2) then the corresponding 
point-line incidence plane F = (Po, Li,I) is affine, and, if it satisfies the 
conditions (P1) and (P2) then the corresponding point-line incidence plane 
is projective. So, in order to define the Kripke semantics of a modal logic of 
incidence, we can use one-sorted structures of the form (W, =1, =2) instead of 
point-line incidence planes. 

Consider a relational structure of the form (W, =1,=2) where =; and =2 
are equivalence relations on W. We shall say that (W, =1, =2) is an incidence 
frame if it satisfies the conditions (I1)—(15). Moreover, (W, =1, =2) is said to 
be affine if it satisfies (A1) and (A2), and it is said to be projective if it satisfies 
(P1) and (P2). 

Let us note that the properties of Desargues and Pappus are also expressible 
in the present language, so it is quite rich. 


A modal logic for incidence. Our modal language for incidence frame 
uses the modal operators [=1], [=2], [#1], and [#2]. Well-formed formulas are 
given by the rule: 


a Az=p|l|rA| (AV B) | [EJA | [EJA | iA | [Z2]A 





Abbreviations: difference — [Z] A = |[#1]A A [#2]A, universal modality — 
[U]A = AA [F]A, incidence [O] A = [=1][=2] A, [O07 HA = [2 2][=1] A. 


The semantics is based on incidence frames in the expected way. In particular, 
we have: 


a M,w H [=,A iff for all w € W such that w =; w’, M, w' = A, 
a M,w H [#;]A iff for all w € W such that w 4; w, M, w' = A, 


fori € {1,2}. 
The following formulas are valid in Cinc: 


Azı) A > [#il(Ai)A, i € {1,2}, 
) A > [#](#)4, 
) AA[AJA > [AI[AIA, 
) [U]A > [E;]A, i € {1,2}, 
Azs) [E] A A [ž;:]A > [U]A, i € {1,2}, 
) 
) 
) 








(Ai) A > [E:](#)4, i € {1,2}, 
=;]A > A, A > [E4] (=)4, [E]A > [E][=:]A, i € {1,2}, 
O)[O“JA = [U]A, 
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(Azə) (O)(AA (07!) ([A|B A C)) > ([O]((=2)4 v [O~*]B) v (=1)C), 
(Axio) A > (07?) (#1) (0)A, 
(Az11) A > (O) (#2) (071A. 


Let MIG (Modal Incidence Geometry) be the axiom system obtained by adding 
the formulas (Ax )—(Ax11) to the minimal normal modal logic in our language. 
Note that all proper axioms of M IG are Sahlqvist formulas and that the associ- 
ated first-order properties correspond to conditions defining incidence frames. 
For example, the formulas (Azg), (Axi9), and (Ax11) correspond to the first- 
order properties (I3), (14), and (15) respectively. Nevertheless, WIG is not 
known to be complete with respect to validity in the class Cinc of all incidence 
frames. The point is that the interpretation in incidence frames of formulas in 
the form [#1] A and [#9] A is based on the complements 4; and £2 of the binary 
relations =; and =2. The difficulty with the complementarity relations is that 
there is no axiom corresponding exactly to the first-order properties saying that: 


a =; N 4,;= 0 for i € {1,2}. 


We have seen that [#1] and [#2] define [4]. Moreover, notice that on the class 
Cine of all incidence frames the formula A A |4]—A is satisfied at some tip w in 
some incidence model M = (W, =1, =2, V) iff w is the only tip in W where A 
holds. Hence, A^ |[4]—A can be considered as a sort of proper name for w. The 
reader may observe that the first-order properties saying that the binary relations 
=; and Æ; are disjoint are equivalent to the first-order condition of irreflexivity 
of the binary relation #; U #2. Although irreflexivity does not correspond to 
a modal formula, it can be characterized in some sense by an inference rule. 
In this connection, see Gabbay, 1981, de Rijke, 1992, and Venema, 1993. This 
suggests that we enrich the axiom system MIG with a special inference rule, 
the inference rule of irreflexivity: 


= “Given p A |4|=p — A, prove A”, 


where p is a proposition letter not occurring in A, thus obtaining the axiom 
system MIG*. This inference rule has also an infinitary version: 


= “Given p A [#]=p — A for all proposition letters, prove A”, 


which gives rise to the same set of provable formulas. Soundness of MIGT 
with respect to validity in Cinc is straightforward: we already know that MIG 
is sound, hence, it is enough to verify that the inference rule of irreflexivity 
preserves validity in the class Cine. As for the completeness of MIG*, we 
build a special model from maximal consistent sets of formulas which are closed 
under the infinitary version of the rule. Since all proper axioms of MIG* are 
Sahlqvist formulas and our modal language is versatile, the underlying relational 
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structure is an incidence frame. See Blackburn et al., 2001; Venema, 1993 for 
more details about the importance of inference rules like the inference rule of 
irreflexivity. We do not know if it is possible to eliminate the inference rule of 
irreflexivity in our axiom system: the completeness of MIG with respect to 
validity in the class Cinc is still open. In addition, the decidability/complexity 
issue of validity in Cinc is still unresolved. 

Note also that we can obtain a complete axiomatization of the projective 
incidence frames adding the following axioms to the system MIG: 


(MPG1) (U)A— (O7!\)(O)A, 
(MPG2) (O)(A A (O7')B) = (#)((O)AA (#)B). 


Extending the language with the modality [||] we can axiomatize also the 
affine incidence frames. 

We note that the presented systems have rich expressiveness, containing 
modalities with the following intuitive readings: [U]A — everywhere; [4]A — 
everywhere else; [=| A — in all points; [#,]A — in all other points; [=2] A — in 
all lines; [#2]A — in all other lines; [O] A — in all lines through the current point; 
[OT}]A — in all points on the current line. 


14. Point-line spatial logics 


Standard modal languages have semantics over one-sorted frames. Within 
the context of dynamic logic, van Benthem, 1994, Marx, 1996, and de Rijke, 
1995 were among the first to use relational structures made up of several sets 
of possible worlds together with binary relations between them. One possible 
application of such languages are many-sorted geometrical structures like inci- 
dence geometries based on points and lines and inter-sort relations of incidence 
between them. In this section we follow Venema, 1999. 


Two-sorted modal logic. Consider, for instance, a relational structure of 
the form F = (W1, W2, R) where W, and W are nonempty sets and R C 
W, x W 2. For the sake of simplicity, we assume that the sets W; and W2 
are disjoint. From now on, such structures will be called two-sorted Kripke 
frames. In F, the binary relation R links elements of W; with elements of W2. 
If modal languages must be used for talking about relational structures like F, 
one possibility is to consider a language with two sorts of formulas: 


a A:=p|t|-7A| (AV B) | Ga-formulas of the first sort, 




















a a:=7|tl|-7a| (av p) | OA- formulas of the second sort. 


where p and 7 denote propositional letters of the corresponding sorts. Note that 
the modality UJ transform the one sort into the other. 
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A two-sorted Kripke model based on F is nothing but a structure of the 
form M = (W1, W2, R, V) where V—the valuation of the model—associates 
a subset V (p) of W; with every propositional letter p of the first type and a subset 
V(r) of Wa with every propositional letter 7 of the second type. Elements of 
Wj being denoted by upper case letters like X, Y, Z, etc, and elements of 
W? being denoted by lower case letters like x, y, z, etc, formulas like p, ~A, 
A V B, and Oa will be interpreted at elements of W1, whereas formulas like 
T, aa, a V 8, and OA will be interpreted at elements of W2 according to the 
satisfiability relation defined as usual. In particular: 


a M,X H Oa iff (Vy € W2)(X Ry implies M, y = a), 
a M,x | OA iff (VY € W1)(Y Re implies M,Y |= A). 









































We can define what it means for a formula of a given sort to be true (satisfiable) 
in given model in the obvious way. 

Although the extension of the standard techniques (canonical model, bisim- 
ulation, filtration, etc) and results (completeness, finite frame property, defin- 
ability, etc) of modal logic to multi-sorted languages like the one we have just 
described has never been considered in detail we believe that their extension to 
multi-sorted modal logics is straightforward. As for the two-sorted modal lan- 
guage considered above, it is a simple matter to check that Kə—the following 
axiom system—is sound and complete with respect to validity in the class of 
all two-sorted Kripke frames: 


= Axioms of the first type: all first-type substitution instances of classical 
tautologies together with all formulas of the form 


(a > B) — (Ga — Of) and A > OQA, 






































= Axioms of the second type: all second-type substitution instances of 
classical tautologies together with all formulas of the form 


(A > B) > (0A — OB) anda > Oa, 






































= Inference rules of the first type: Modus ponens “given A and 


” 


A— B, prove B” and generalization “given a, prove Oa”, 





= Inference rules of the second type: Modus ponens “given a and a > /, 
prove 8” and generalization “given A, prove OA”. 














In the next paragraph we apply these ideas to the cases of plane projective 
geometry and plane affine geometry. 


A two-sorted modal logic for plane projective geometry. The point-line 
incidence planes defined in Sec. 13 are good candidates to stand for the Kripke 
semantics of a two-sorted modal language. Let us consider the class Cpg of all 
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projective planes, i.e. two-sorted structures F = (Po, Li, I) where Po is a 
nonempty set of points, Li is a nonempty set of lines, and J is a binary relation 
between points and lines such that: 


a VXVYS2(XIzAYIz), 
a VaVysZ(ZIx ^ ZIy), 
m VXVYV2Vi(XIz AYIz AXItAYIt—> X =YVz=ty). 








Of course, we will assume that the sets Po and Li are disjoint. According to 
the discussion above, we now turn to the definition of our two-sorted modal 
languages for talking about projective planes. Let us consider a countable set 
Po of point-type proposition letters, with typical members denoted p, q, r, etc, 
and a countable set ®L; of line-type proposition letters, with typical members 
denoted 7, p, o, etc. The well-formed formulas are defined by the following 
rules: 





a A:=p|t|-7A| (AV B) | Oa -point formulas, 














m an=a7|Ll|-7a| (av) | OA- line formulas. 














In some two-sorted model M = (Po, Li, I, V), the point-formula Oa is satis- 
fied at a point X iff the line-formula a is satisfied at every line passing through 
X. Similarly, the line-formula OA is satisfied at line x iff the point-formula A 
is satisfied at every point lying on x. 

Seeing that two points are always incident with at least one line and two lines 
are passing together through at least one point, the reader may easily verify that 
the point-formula A is satisfied at point X in M iff the point-formula A 
is true everywhere in M. Similarly, the line-formula Oa is satisfied at line 
x in M iff the line-formula a is true everywhere in M. Hence, the universal 
modality [U] for points and the universal modality [u] for lines are definable in 
the following way: [U] A = ODA and [uJja = Oa. 

The two-sorted modal logic defined by the class Cpg of all projective planes 
has been studied first by Balbiani, 1998 and Venema, 1999. They proved that 
the axiom system KX, obtained by adding all instances of the following axioms 
to Kə is complete with respect to validity in Cpg: 
















































































Axioms of the first type: Axioms of the second type: 


























a — [U]©a A —> [uJoOA 
UJA—> A uja —> a 
UJA > [U][U]A ula — [u][u]a 
A => [U](U)A a = [u] {uja 


The proof of the decidability of the set of all formulas of the first type sat- 
isfiable in Cpg and the proof of the decidability of the set of all formulas of 
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the second type satisfiable in Cpg can be done using the standard technique 
of the filtration. As usual, this filtration argument implies that satisfiability of 
point-formulas and line-formulas within Cpg is in NEXPTIME. What makes in- 
teresting our two-sorted modal logic is the following result proved by Venema, 
1999: satisfiability of point-formulas and line-formulas within the class Cpg is 
NEXPTIME-complete. 

The expressive power of our two-sorted modal language is weak. For exam- 
ple, neither the difference modality between points nor the difference modality 
between lines are definable in it. Let us extend our two-sorted language by 
allowing point-formulas like [D]A and line-formulas like [d]a. In some two- 
sorted model M = (Po, Li, I, V), the point-formula [D]A will be satisfied at 
point X iff point-formula A is satisfied at every point different from X whereas 
the line-formula like [d]a will be satisfied at line x iff line-formula a is satisfied 
at every line different from x. The axiomatisation/completeness and decidabil- 
ity/complexity issues of validity and satisfiability of formulas in the extended 
two-sorted language with respect to the class Cpg are still open. 


A two-sorted modal logic for plane affine geometry. A particular aspect 
of plane projective geometry is the duality between points and lines. In plane 
affine geometry, points and lines are no longer interchangeable seeing that, 
in point-line affine planes, although two different points are always incident 
with exactly one line, parallel lines have no point in common. This imbalance 
between points and lines in affine planes is translated into additional difficulties 
for those who wish to define a two-sorted modal logic for plane affine geometry. 
The language of this two-sorted modal logic must be able to talk about incidence 
between points and lines and parallelism between lines. The solution in Balbiani 
and Goranko, 2002 is to consider the following rules that mutually define the 
formulas of sort point and the formulas of sort line: 


= A:=p|t|—7A|(AVB) | Qa, 
a a:=a7|1|-a|(aV)|OA| [lls]a. 




















As for the two-sorted modal logic for projective geometry, point formulas 
like Oa are read “a is satisfied at every line incident with the current point” 
and line formulas like ODA are read “A is satisfied at every point incident with 
the current line”. The unary modality [||] will be interpreted by the strong, i.e. 
irreflexive, binary relation of parallelism between lines defined, in any affine 
plane F = (Po, Li, I), by: 

















m «x ||s y iff for all points Z, not ZI x or not Zly. 


In this setting, if V is a valuation on F then the definition of the satisfiability 
relation in the two-sorted model M = (Po, Li, I, V) defined by V over F now 
contains the following item: 
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a M,x 5 [||s]a iff for all y € Li such that x ||, y, M, y H a. 


It is a simple matter to check that, in M, the universal modality [U] for points 
and the universal modality |u] for lines are definable in the following way: 
[U]A = ODA and [uja = Oa A |||,]a. Seeing that [||] corresponds to the 
strong relation of parallelism, we observe that for all points X in Po and for 
all lines x in Li: 


a M,X — Oj||,|OA iff for all Y € Po such that X 4 Y,M,Y H A, 






























































a M,2 H [||:]0Da iff for all y € Li such that z 4 y, M, y Ea. 


Hence, the difference modality [D] for points and the difference modality [d] for 
lines are definable in the following way: [D] A=O]||,|0.A and [dja=|||, JOO. 
To illustrate the value of our two-sorted modal language, let us remark that, in 
the class Cap of all affine planes, the following formulas are valid: 




























































































Formulas of type point: Formulas of type line: 

Oa —> Oa AOA 

[U]A — [U][U]A lJa = (|[sa 

[U]A > [D]A a — [llla 

AA [D]A > [U]A a A [ls] > [lls][lls]a 

[U]Da > Oa A Ofl|s]a uja — [dla 

AN O(aAAO(AAA [DJA)) = O(0AVa) ad [dla > [ula 
uJDA = DA A [lls] 


























These formulas are Sahlqvist formulas. Hence they correspond to first- 
order conditions on two-sorted structures. For example, the point formula 
[U] A — [U][U]A is related to the property of line-connectedness saying that 
every two points are incident with a common line whereas the point formula 
[U]Oa > Oa^0O[|||s]a and the line formula [uJ OA > OAA|||s]OA are related 
to the existence part of Euclid’s property saying that every point not incident 
with a given line is incident with at least one line parallel to the given line. As 
for the line formula A A O(a A O(4=A A[D]A)) — O(OAV a), it corresponds 
to the normality conditions saying that every two distinct points have no more 
than one common incident line. Whether adding to Kə all instances of the 
above formulas yields a complete axiom system for validity in Cag is still open. 
However, thanks to the possibility of defining in our language the difference 
modalities between points or lines, we may axiomatize the validity by means of 
irreflexivity rules. The axiom system AF F is obtained by adding all instances 
of the above formulas to the basic logic together with the following special 
inference rules: 






























































Irreflexivity rule of type point: “given p A |[D|=p — A, prove A” where 
pis a proposition letter of sort point not occurring in A, 
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Irreflexivity rule of type line: “given 7 A [d] =ar — a, prove a” where 7 is 
a proposition letter of sort line not occurring in a, 


The completeness of AF F with respect to Cap is proved by transferring the 
analogous techniques based on irreflexivity rules known from the one-sorted 
case. See Balbiani and Goranko, 2002 for details. 

Since it is possible to define the difference modalities between points or lines 
in our two-sorted modal language, this language is expressive enough to allow 
us to define formulas expressing Desarguesian and Pappian properties and to 
axiomatize the corresponding logics. 

Itis still open whether satisfiability of point formulas and line formulas within 
Cap is decidable. Nevertheless, following the line of reasoning suggested by 
Venema, 1999, Balbiani and Goranko, 2002 proved that satisfiability within 
Cap is NEXPTIME-hard. 

Finally, we note that the two-sorted modal perspective in geometry is dis- 
cussed further in van Benthem, 1996, where Henkin model for second-order 
logic are considered as two-sorted geometric structures, and in van Benthem, 
1999, where space and time sorts are put together. 


Concluding remarks: elementary geometry and spatial 
reasoning 


We end this chapter with two brief remarks. 

First, there is an obvious disparity in the influence and utility of modern 
mathematical logic to algebra and geometry: while the main (and quite deep) 
applications of logic to algebra are model-theoretic, the immediate role of logic 
in geometry is still mainly confined to axiomatizations of geometric theories and 
logical independence of geometric concepts and properties. While some recent 
model-theoretic developments (see Hodges, 1993) have deep applications to 
geometry, they are still far from being accessible enough to enter the geometer’s 
toolbox. In this chapter we have just hinted that logic can say and do more to 
geometry than what it has so far. 

Second, we admit that the topic of this chapter is not directly related to 
practical spatial reasoning. Yet, we believe that the issues and results discussed 
here are relevant to it, because quite often, spatial reasoning ignores many 
geometric attributes such as distances, angles, precise shapes, etc. Just as 
topology can sometimes be more appropriate than metric geometry for the 
reasoning tasks at hand, affine planes (with or without ordering) or even plain 
linear incidence spaces may turn out to be the right level of abstraction. For 
instance, this should be the case when a street map is used for orientation and 
routing in the city, or in designing a method for orientation in a maze. We thus 
see the practical value of the study of logical theories for geometric structures 
discussed here in offering a hierarchy of levels of abstraction, and providing 
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logical tools and techniques to suit the particular needs of the agent for spatial 
representation and reasoning. 
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1. Introduction 


Mac Lane and Moerdijk, 1992, in their thorough introduction to topos theory, 
start their Prologue by saying: 
A startling aspect of topos theory is that it unifies two seemingly wholly distinct 
mathematical subjects: on the one hand, topology and algebraic geometry, and 


on the other hand, logic and set theory. Indeed, a topos can be considered both 
as a “generalized space” and as a “generalized universe of sets”. 


This dual nature of topos theory is of great importance, and one can quite 
reasonably understand Grothendieck’s name “topos” as meaning “that of which 
topology is the study”. Mac Lane and Moerdijk are unquestionably masters 
of the spatial nature of toposes, yet one could easily read through their book 
without grasping it. The mathematical technology is so firmly expressed in the 
set theory and the logic that the spatiality is obscured. 

The aim in this chapter is to provide a reader’s guide to the spatial content 
of the major texts. Those texts can also provide a more detailed account of 
original sources and other applications than has been possible here. 

We have on the one hand, the logic and set theory, and, on the other, the 
topology. In a nutshell, the topos connection between them is that the topos 
acts like a “Lindenbaum algebra” (of formulae modulo equivalence) for a logical 
theory whose models are the points of a space. 

The prototype is Stone’s Representation Theorem for Boolean algebras, 
which relates propositional logic to Hausdorff, totally disconnected topology. 
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However, it takes some work to develop the idea to its full generality. First, the 
logic is not at all ordinary classical logic. It is an infinitary positive logic known 
as geometric logic. Second, we are in general talking about predicate theories, 
and for these the appropriate notion of Lindenbaum algebra is not straightfor- 
ward. Itis really the “category of sets generated by the theory”. Lastly, “space” 
of points is not an ordinary topological space—it is a real generalization. 

However, the propositional fragment of the predicate logic does correspond 
more or less to ordinary topological spaces. As a rough picture of the corre- 
spondence, in the propositional case we find: 


m space ~ logical theory 

= point ~ model of the theory 

™ open set ~ propositional formula 
= sheaf ~ predicate formula 


= continuous map ~ transformation of models that is definable within ge- 
ometric logic 


These “propositional toposes” are called localic, or (with slight abuse of 
language) locales. They are equivalent to the locales introduced in—say— 
Johnstone, 1982 or Vickers, 1989. 

Now the topos theorists discovered some deep facts about the interaction 
between continuous maps and the logic and set theory of toposes. A map 
f: X — Y gives a geometric morphism between the corresponding toposes 
of sheaves. A topos is sufficiently like the category of sets that a kind of set 
theory can be modelled in it. Roughly speaking, in sheaves over X it is set 
theory “continuously parametrized by a variable point of X”. The map f then 
comes to be seen as a “generalized” point of Y , parametrized by a point of X, 
and this is a point of Y in the non-standard set theory of sheaves over X. So by 
allowing topological reasoning to take place in toposes instead of in the category 
of ordinary sets, one gains a simple way to reason about the generalized points 
of Y—in other words the maps into Y. 

However, to make this trick work one has to reason constructively because 
the internal logic of toposes is not in general classical. And constructive topol- 
ogy does not work well unless one replaces topological spaces with locales. 
For instance, the Tychonoff theorem and the Heine-Borel theorem hold con- 
structively for locales but not for topological spaces. 

Now there is a well known drawback to locales. They do not in general have 
enough points and for this reason are normally treated with an opaque “point- 
free” style of argument. However, they do have enough generalized points. 
Since constructive reasoning gives easy access to these, it also allows locales to 
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be discussed in a spatial way in terms of their points. We in fact get a cohesive 
package of mathematical deals. 


1 Constructive reasoning allows maps to be treated as generalized points. 


2 Locales give a better constructive topology (better results hold) than 
ordinary spaces. 


3 The constructive reasoning makes it possible to deal with locales as 
though they were spaces of points. 


What’s more, the more stringent geometric constructivism has an intrinsic 
continuity—one might almost say it is the logical essence of continuity. The 
effect of this is that constructions described in conformity with its disciplines 
are automatically continuous. 

The prime aim of this chapter is to explain how this deep connection between 
logic and topology works out. However, as a spinoff we find that “generalized 
spaces” corresponding to toposes become more accessible. They are spaces in 
which the opens are insufficient to define the topological structure, and sheaves 
have to be used instead. 

These ideas are not essentially new. They have been a hidden part of topos 
theory from the start. Some writers, such as Wraith, 1979, have made quite 
explicit use of the virtues of geometric logic. Our aim here is to make them 
less hidden. At the same time we shall also stress a peculiarity of geometric 
logic, namely that it embodies a geometric type theory. This provides a more 
naturally mathematical mode of working in geometric logic. 

For further reading as a standard text on topos theory, we particularly recom- 
mend Mac Lane and Moerdijk, 1992. The standard reference text (Johnstone, 
2002a, Johnstone, 2002b) is much more complete and ultimately indispensable. 
In particular, it treats in some depth the notion of “geometric type construct” 
that is very important for us. However, it can be impenetrable for beginners. 

Though the chapter is so closely linked to toposes, many of its techniques 
can also be used in other (and distinct) constructive foundations such as formal 
topology in predicative type theory (Sambin, 1987). We refer to Vickers, 2006 
and Vickers, 2005 for some of the connections. 


2. Opens as propositions 


Since Tarski, 1938 it has been known that topologies—by which we mean 
specifically the lattices of open sets for topological spaces—can provide models 
for intuitionistic propositional logic. 

For discrete topologies, i.e. powersets, this is no surprise. Classical propo- 
sitional logic can be embedded in classical predicate logic by translating each 
proposition into a predicate with a single variable x, and then the standard se- 
mantics interprets each proposition as a subset of the carrier for x. Logical 
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connectives translate directly into the corresponding set-theoretic operations in 
the powerset, and classicality of the logic corresponds to the fact (in classical 
mathematics) that powersets are Boolean algebras. 

What is interesting is that when we replace the powerset by a topology (on 
X, say), there is still enough lattice structure to model intuitionistic logic. The 
connectives ^ and V can still be translated to N and U, which both preserve 
openness. However the direct set-theoretic correspondent with negation is 
complementation, and that does not preserve openness. If proposition P is in- 
terpreted as open set U, then —P is interpreted as the interior of the complement 
X — U. Similarly, if also Q is interpreted as V, then P — Q is interpreted as 
the interior of (X —U)UV. 

These latter operations can both be defined directly in terms of the complete 
lattice structure of the topology. The interior of X — U is the join of all those 
opens W such that W MU = @, while the interior of (X — U) U V is the 
join of those W such that W MU C V. Every topology is a Heyting algebra; 
and since intuitionistic logic freely expresses Heyting algebra structure, any 
interpretation of the propositional symbols can be extended to all formulae in 
a way that respects intuitionistic equivalence. 


DEFINITION 8.1 A frame is a complete lattice in which the following frame 
distributivity law holds (a € A, S C A): 


a\\/S=\f[{anb| bE sS} 


A frame homomorphism is a function between frames that preserves joins and 
finite meets. 


Every topology is a frame, because the axioms for opens sets tell us that ^ 
and \/ are set theoretic N and |J. 


PROPOSITION 8.2 Every frame A is a Heyting algebra. 


Proof We must show that for every a,b € A there is an element a — b € A 
such that for all c € A, 


(*) c<a—>becha<hb. 
(If this element exists, then it is unique.) We can define 
a—b=\/{z|z^a <b}. 


Now the (<) direction in (*) is immediate, while the (=) follows from frame 
distributivity, which implies that (a — b) Aa < b. QED 


In fact, frames and complete Heyting algebras are the same things. However, 
we distinguish between the notions because of the homomorphisms. A frame 
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homomorphism does not necessarily preserve the Heyting arrow, and so is not 
necessarily a Heyting algebra homomorphism. 

Tarski, 1938 went further, and showed a stronger property if the space X is 
a dense-in-itself separable metric space (such as the real line). If P; (i € N) 
is a countable family of propositional symbols, then an intuitionistic formula 
@ in the Pys is an intuitionistic theorem iff, for every interpretation of the 
Ps in QX, the corresponding interpretation of ¢ is the whole of X. This 
was explained further in McKinsey and Tarski, 1944 using an embedding of 
intuitionistic propositional logic in the classical modal logic S4, which could 
then be interpreted in the powerset of X with the O modality corresponding to 
the interior operator—for further details, see 5. 





2.1 Lindenbaum algebras for classical logic 


A Lindenbaum algebra is a lattice of formulae modulo provable equivalence, 
and they and their generalizations will be key to the whole of this chapter. Let 
us review how it works for classical logic, where the connection with topol- 
ogy is essentially Stone’s Representation Theorem for Boolean algebras (see 
Johnstone, 1982). If X is a propositional signature (i.e. a set of propositional 
symbols) then we write Seny for the set of sentences constructed over X using 
a classically adequate set of connectives. If T is a theory over X (a set of 
sentences), then an equivalence relation =p can be defined by 


$ =r wiff T H ($ e y) in classical logic 


and the Lindenbaum algebra for T is defined as LA(£, T) = Seny/ =r. Itis 
a Boolean algebra. 

The central idea is to use the dual nature of homomorphisms g : LA(X, T) > 
LACT) 

Logically, g is a logical translation of (©,T) into (¥', T’). It translates 
propositional symbols into sentences (modulo equivalence) and preserves the- 
oremhood. Isomorphism gives a natural presentation-independent notion of 
equivalence of theories, by mutual translatability. 

Spatially, g provides a transformation of models, from (»’, T’) to (©, T) 
(note the reversal of direction). It is this spatial view that provides the link with 
topology, for a model transformation arises this way iff it is continuous with 
respect to certain topologies on the model spaces. This is shown by Stone’s 
Theorem. 

Because classical logic is complete, the transformation of ordinary models 
suffices to determine the Lindenbaum algebra homomorphism. However, we 
also get an alternative view by considering generalized models. Then there is a 
generic model whose transformation determines that of all the other (specific) 
models, and this idea becomes important for incomplete logics. 
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Let A be a Boolean algebra. An interpretation of X in A is a function 
M : > — A, and this extends uniquely to a function M : Seng — A that 
evaluates the connectives by the corresponding Boolean operations on A. Then 
M isa model of T iff M(¢) = 1 for every ¢ € T. Models are preserved by 
Boolean algebra homomorphisms f : A — B -— the composite f o M is also a 
model in B. We write Mod 4(T) for the set of models of T in A. 

Those were the generalized models referred to above. The standard models, 
interpreting propositions as truth values, are found by taking A = 2 = {0,1}. 

The generic model Mr of T is a particular model in the Lindenbaum algebra. 
It interprets each proposition symbol P € X as the equivalence class of P as 
sentence. Clearly, by definition of the Lindenbaum algebra we have ¢ =r T for 
every ọ € T, and so Mr(¢) = 1, so Mr is a model. It has a universal property: 
any model can be got, uniquely, by applying a Boolean algebra homomorphism 
to the generic model. 


PROPOSITION 8.3 Let (%,7T) be a classical propositional theory, and A a 
Boolean algebra. Then the function f =œ f o Mr, taking Boolean algebra 
homomorphisms LA(X,T) — A into Mod4(T), is a bijection. 


Proof LetM : X£ — AbeamodelofT’. Suppose ¢ =r y. The classical proof 
of T + (¢ + y) will involve only finitely many elements of T, say t1,..., tn. 
Because of the nature of classical proofs (this requires some checking) it will 
imply that M (t1) A--- A M(tn) < M(@¢) = M(w) in A. But each M (t;) is 
1, because M is a model, and hence M(¢) = M (7) = 1 and M(¢) = M(w). 
It follows that M factors (uniquely) through LA(£, T) as f o Mr where f is 
a Boolean algebra homomorphism. QED 


The generic model Mr corresponds to the identity homomorphism on LA 
(x, T). 
As a consequence of the proposition, we see there is a function 


E: Moda(T) x LA(Z,T) — A 


with F (M, ¢) the image of ¢ under the homomorphism corresponding to M. 
In the particular case of A = 2, E (M,¢) = 1 iff M F ¢ in the usual sense. 

Now consider a homomorphism g : LA(£, T) — LA(d’,T"). By the 
proposition, g corresponds to a model of T in £.A(X’, T’). This gives a logical 
translation of (X, T) in (X’, T’) —the propositional symbols in © are interpreted 
as formulae over X’, in such a way that the axioms in T all become provable 
from T”. 

But one can also view this from the model side. A model of T” in A is a 
homomorphism from £A(’, T”) to A, and precomposing this with g gives a 
homomorphism from LA(£, T) to A, in other words a model of T. (Note the 
reversal of direction!) Thus g gives a uniform way of transforming models of 
T” into models of T. 
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Trivially, g = goId¢ a(s r is completely determined by its transformation 
of the generic model. The generic model is non-standard, but we also find g is 
determined by its transformation of the standard models. 


PROPOSITION 8.4 Let (¥,T) and (X', T") be two propositional theories, and 
let f,g : LA(X,T) —> LA(X’,T") be two homomorphisms inducing model 
transformations F,G : Mod2(T’) + Mod2(T). If F = G then f = g. 


Proof It suffices to show that f(P) =r g(P) for every P € ©. By com- 
pleteness, it suffices to show that for every standard model W’ of T’, f(P) and 
g(P) have the same truth value at W’. But those truth values are the same as 
those for P at F( M’) and G(M’) respectively, and they are equal. QED 


Not every transformation of standard models is induced by ahomomorphism. 
Stone’s Representation Theorem represents 2.A(X, T) using a topological space 
Moda(T), with sets of the form {M | M E ¢} (¢ € LA(®,T)) providing 
a base of opens. (In fact, they are the clopens.) This is a Stone space — 
Hausdorff and totally disconnected (see Johnstone, 1982). The Theorem shows 
that the homomorphisms correspond to the continuous maps between the model 
spaces. 


DEFINITION 8.5 Let (%,T) and (¥', T") be two propositional theories. We 
define a map from (£, T) to (¥' , T") to be a homomorphism from LA(¥!, T") > 
LA(E,T) (or, equivalently, a model of (¥', T") in LA(%, T)). 


Note — In this chapter, the word “map” will always carry connotations of 
continuity. A map between topological spaces is understood to be continuous. 

By emphasizing the model transformations, and defining “maps” to go in 
the same direction (opposite to that of the homomorphisms), we try to foster 
a view that the theory represents its “space of models”. Propositional theories 
and maps between them form a category. Here are some simple but important 
examples. 


EXAMPLE 8.6 (1) The theory (@, 9) (no symbols or axioms) corresponds to the 
one-element space 1. It has a unique, vacuous model in any Boolean algebra 
and is final in the category of theories. LA(Q,0) = 2, and for any theory 
(ST) the maps from (0,0) to (8,7) are equivalent to the standard models of 
(arr) 

(2) The inconsistent theory (Ù, {_}) corresponds to the empty space Ú. It has 
no model except in the one-element Boolean algebra 1, which is LA(Q, { L}). 
It is initial in the category of theories. 

(3) The theory ({ P},@) corresponds to the discrete 2-element space 2, having 
two standard models P — 0 and P +> 1. Models in A are elements of A. In 
Stone’s Theorem, this corresponds to the fact that for any space X, clopens 
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are equivalent to maps X — 2. Its Lindenbaum algebra LA({P},0) is the 
Boolean algebra 4 = {0, P, =P,1}, which is freely generated by P. 


2.2 Frames as Lindenbaum algebras 


The Stone topologies, with the Boolean algebra of clopens forming a base and 
corresponding to classical propositional logic, are very special. We generalize 
this by changing to propositional geometric logic, for which the Lindenbaum 
algebras are frames, playing the role of topologies (with all opens, not just 
clopens). 

From Tarski’s results on interpreting intuitionistic logic in topologies, one 
might expect the logic here to be intuitionistic. However, full intuitionistic 
logic is too strong. If f is a continuous map, then its inverse image function, 
restricted to open sets, will act as the corresponding homomorphism between 
Lindenbaum algebras. In general this does not preserve the Heyting arrow, 
though it is a frame homomorphism. Hence we need a logic that corresponds 
to the structure of frame rather than of Heyting algebra. 


DEFINITION 8.7 Let & be a set (of propositional symbols). Geometric for- 
mulae over X are constructed from the symbols in X using T (true), ^ and 
arbitrary — possibly infinitary — disjunctions V. A geometric theory over X is 
a set of axioms of the form ¢ — yp, where @ and y are geometric formulae. 

Coherent formulae and theories are defined in the same way, but without any 
infinitary disjunctions. This is sometimes known as positive logic. 


Note that because of the limitations of the logic, a theory is not simply a set 
of formulae. The logical rules are best described in sequent form. (A theory 
is in effect a set of axiomatic sequents, and we shall often write its axioms as 
sequents, using +. We shall also write F- for bidirectional entailment.) The 
rules are identity 
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and frame distributivity 


PAV SEV {oAv| ve Sh. 


Note that \/ Ø plays the role of L (false). Note also that frame distributivity 
allows us to reduce every formula to a disjunction of finite conjunctions of 
symbols from X. Hence although the formulae as defined syntactically form a 
proper class, modulo equivalence they form a set (classically, and according to 
at least some constructive foundations). 

We write Q[T] for the Lindenbaum algebra of T, i.e. the set of geometric 
formulae modulo equivalence provable from T. The logical rules imply that it 
is a frame. There is an obvious notion of model of T in any frame, and ()[T] has 
a particular generic model Mr given by interpreting each propositional symbol 
as its equivalence class of formulae. 


PROPOSITION 8.8 (cf. Proposition 8.3.) Let (%,T) be a geometric theory, 
and let A be a frame. Then the function f œ> f o Mr, taking frame homomor- 
phisms Q{T| — A into Mod 4(T), is a bijection. 


Proof IfM :% — Aisa model then it extends to a function M on the class 
of formulae. All the logical rules will be valid in A under this interpretation, 
and the axioms in T will all hold because M is a model, so it follows that M 
factors (uniquely) via Q(T]. QED 


Standard models are given, as usual, by interpreting the propositional sym- 
bols as truth values. However, we write 2 for the frame of truth values, by 
contrast with 2 for the Boolean algebra. (This follows the topos-theoretic no- 
tation.) Constructively they are different, allowing for the fact that geometric 
logic is a positive logic. A geometric truth value is equivalent to a subset of a 
singleton, while a Boolean truth value is a decidable subset of a singleton. 


DEFINITION 8.9 Let A be a frame. Then the geometric theory Tha is pre- 
sented as follows. For the signature ®©, introduce a propositional symbol P, 
for each a € A, and then take axioms 


P, — P, (a < bin A) 
Pa Py > Pano (a,b € A) 
To Pi 
Als = Vacs Pa (S C A) 
All the theory is saying is that the finite meets and arbitrary joins in A should 


be treated logically as finite conjunctions and arbitrary disjunctions. Hence the 
connectives of propositional geometric logic correspond directly to the frame 
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structure. From this it follows that the models of Th, in a frame B are the 
frame homomorphisms from A to B, and so we see that Q([Thy] & A. 

Astandard model of Th 4 can also be described by saying which propositional 
symbols P, are assigned the truth value true, and hence by a subset F C A 
satisfying certain conditions corresponding to the axioms. The first axiom says 
that F is an upper subset of A, the next two that F is a filter, and then the 
fourth that it is a completely prime filter. (Note: the standard texts contain 
various other descriptions of the standard models, but they are constructively 
inequivalent.) 


2.3 Locales 


Let us define, conceptually, a locale to be a “propositional geometric theory 
pretending to be a space”, using the ideas of Sec. 2.1, which took the logical the- 
ory as the starting point. That is to say, the locale is the theory, but repackaged 
in a spatial language of points and maps instead of models and Lindenbaum al- 
gebra homomorphisms. What makes this repackaging significant is the fact that 
geometric logic is incomplete—in general, there are not enough standard mod- 
els to account for all the frame homomorphisms (cf. Proposition 8.4). Thus the 
spatial side (in terms of standard models) and the logical side (in terms of Lin- 
denbaum algebras) become mathematically inequivalent. However, the logical 
side still contains good topological results; indeed, in constructive mathematics 
they are often better than the spatial ones. The localic repackaging makes it 
much easier to see this topological content. 

The usual definition is that a locale is a frame. We prefer to say it is the 
propositional geometric theory, and that it has a frame. This makes it easier to 
see locales as a special case of toposes, which arise from predicate geometric 
theories. In addition, in certain foundational schools such as predicative type 
theory, the frames are problematic. They are constructed using the powerset, 
and that is impredicative. The main account in this school is the formulation as 
“formal topology” (Sambin, 1987). 

A formal topology gives a base S (so every frame element is to be a join of 
base elements) and the cover relation <, which describes when one basic open 
is to be covered by a set of others. This then corresponds to a propositional 
geometric theory in which S provides the propositional symbols, and the cover 
relation provides axioms to say one symbol entails a disjunction of others. (For 
these purposes, the notion of map can be defined in a more primitive way that 
does not rely on constructing the frame.) The variant notion of inductively 
generated formal topology (Coquand et al., 2003) is even closer to the propo- 
sitional geometric theory in that it does not require the complete cover relation 
but just a part from which the rest can be deduced. 

Let us review the ideas of Sec. 2.1 in the light of Sec. 2.2. 


Locales and Toposes as Spaces 439 


DEFINITION 8.10 1 A locale is (presented by) a propositional geometric 
theory. If the theory is T, we write |T] for the locale. The locale |T] 
should be conceptualized as “the space of models of T”. 


2 If X isa locale, then QX denotes its Lindenbaum algebra, a frame. 
3 The opens of X are the elements of QX. 


4 If X andY are locales, thena map f : X — Y is aframe homomorphism 
f“: QY — QX. We write Map(X,Y) for the set of maps from X to 
Y. Locales and maps form a category Loc, dual to the category Fr of 
frames. 


5 If X and Y are locales then Map(X,Y) is partially ordered by the 
specialization order, f E f’ if f*(U) < f*(U) for every U € OY. 


6 If X and W are locales then the (generalized) points of X at stage (of 
definition) W are the maps W — X. Think of these as points of X 
“continuously parametrized by a variable point of W”. The points of 
[T] are just the models of T in QW. 





7 If X is a locale then the identity map X — X, a point of X at stage X, 
is the generic point of X. 


8 If f : X — Y isamap, then postcomposition f o — transforms points of 
X (at any stage) to points of Y (at the same stage). We shall often write 


f(a) for f oz. 


The specialization order is already present (as a preorder) in ordinary topol- 
ogy: x C 2’ if every neighbourhood of x also contains 2’ (i.e. æ is in the 
closure of {x’}). It is often neglected there, because for Hausdorff spaces 
(more precisely for T spaces) it is discrete: x E 2’ iff x = 2’. 

An important fact about the specialization order is that it has directed joins. 
In any poset (P, <), a family of elements (xi)icz is directed if I is inhabited 
and, for any i,j € I, there is some k € J such that x; < x, and x; < £p. 
We also say P is a directed complete poset (or dcpo) if it has a join for every 
directed family. We use an arrow, as in | |; x;, to indicate that a join is of a 
directed family. 


PROPOSITION 8.11 Let X and Y be locales. Then Map( X,Y) is directed 
complete with respect to E. The directed joins are preserved by composition 
on either side. 











Proof Let (fi)ier be directed in Map(X,Y). The join Li fi is given as a 
frame homomorphism by 


E =V,.O- 
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QED 


The directed joins are less familiar from ordinary topology. This is partly 
because so many familiar spaces have discrete specialization order, but also 
because in the absence of sobriety (Sec. 2.4) the directed joins may be missing. 
However, they are fundamental in computer science and provide a means for 
providing the semantics of recursive algorithms. (See, e.g., Plotkin, 1981, Gierz 
et al., 1980, Vickers, 1989.) From the proposition we see one essential feature 
of maps, namely that they preserve directed joins of points. (This is known as 
Scott continuity.) 


DEFINITION 8.12 (J) The one-point locale 1 is presented by the empty theory 
over the empty signature. 1 is Q. (Note—we shall also write 1 for a singleton 
set. In practice this ambiguity should not cause problems.) The global points 
of a locale X are its points at stage 1, i.e. the maps 1 — X. Thus the global 
points of |T] are the standard models of T. 

(2) The empty locale Ý is presented by the inconsistent theory {T — L} 
over the empty signature. QQ is a one-element frame. It has no points except 
at stage Ú. 

(3) The Sierpiński locale S is presented by the empty theory over a one- 
element signature {P}. Its points are equivalent to subsets of the set 1. We 
usually write T for the subset 1 itself (an open point), and | for the empty 
subset (a closed point). Note that LE T. 





REMARK 8.13 The opens U’ of any locale X are equivalent to the maps 
U : X —> S, with U' = U*(P). If f : X — Y is a map, then 


PUTES UPE op), 


and hence corresponds to U o f. Hence we can talk about opens and inverse 
image functions purely in the language of maps. 


2.4 Locales compared with spaces 


Now we have this language of points, opens and maps, all deriving from the 
single notion of geometric theory, we shall compare it with ordinary topology. 

Given a locale X, let us write pt(X) for its set of global points, maps 
x: 1 — X. IfU : X — Sis an open, then the composite U o x is a global 
point of S, and hence a subset of 1. We write x F U iff U ox = T (Le. 
x*(U) = 1), and ext(U) (the extent of U) for {x € pt(X) | x E U}. Because 
x* : QX — Qis a frame homomorphism, we find that the sets ext(U) form a 
topology on pt(X). 
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Now let f : X — Y bea map of locales, giving a point transformer pt( f) : 
pt(X) — pt(Y). If V is an open of Y, then 


z € pt(f) ‘(ext(V))  forEV 
eVofor=T 
= x FE ext(f*(V)). 


It follows that pt(f)~!(ext(V)) = ext(f*(V)), and so pt( f) is continuous. 


= For each locale X, its global points form a topological space pt(X). 


= Foreach map of locales, the corresponding transformation pt( f) of global 
points is continuous. 


This looks promising, but there is not an exact match between locales and 
topological spaces, and we need to understand that. The central connection, a 
categorical adjunction, is summarized in the following result. For an element 
x of a topological space X, we write Ny = {U € QX | x € U} for the set of 
open neighbourhoods of x. This is a completely prime filter in QX . 


PROPOSITION 8.14 Let X be a topological space and Y a locale. Then there 
is a bijection between 


1 maps (continuous, as always) f : X — pt(Y), and 


2 maps g : |Thax| —> Y (homomorphisms g* : QY > QX). 


Proof Consider the following condition on pairs (f, ¢) where f : X — pt(Y) 
and @: QY — QX are arbitrary functions: 


(Va € X, VW € OY) (f(x) € ext(V) ore A(V)). 
This is equivalent to 
(YV € QY) (V) = f-*(ext(V)) 


and, considering the points of pt(Y) as completely prime filters of QX and 
remembering that f(x) € ext(V) iff V € f(x), to 


(Va € X) f(x) = 9 (Na). 


It follows that ¢ is determined by f, and f is determined by ¢. 

Under these conditions, it follows that f is continuous and ¢ is a frame 
homomorphism. Conversely, if f is continuous then inverse image f~! gives 
a corresponding @; and given a frame homomorphism @, we find that each 
¢@ '(M,) is a completely prime filter. QED 
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The first mismatch between spaces and locales is that not every space comes 
from a locale. 

Let (X, QX) be a topological space, with QX the topology—the family of 
open sets. Consider the locale [Thax], whose global points are the completely 
prime filters of QX. For every point x € X, its open neighbourhood filter N, 
is a completely prime filter. However, two points x and y might have the same 
open neighbourhood filter—every open containing x also contains y, and vice 
versa. A space is called Tọ if this never happens, i.e. if Ny = Ny then x = y. 

In addition, there may be a completely prime filter that is not Nt, for any 
point x. 


EXAMPLE 8.15 Consider the set N of natural numbers with a topology in 
which the opens are the upper sets. The completely prime filter of non-empty 
upper Sets is not the open neighbourhood filter of any point. 


A space is sober if the correspondence x +> Sg is a bijection between points 
and completely prime filters. For sober spaces, we might just as well consider 
them as locales—we lose nothing by using the geometric theories to study sober 
topological spaces. 


PROPOSITION 8.16 Let X and Y be sober spaces. Then there is a bijection 
between maps f : X — Y, and maps g : [Thax] — [They] (homomorphisms 
g : QY — OX). 


Proof Apply Proposition 8.14 with [Thay] substituted for the locale Y. So- 
briety assures us that the sober space Y is homeomorphic to pt([Thoy]). QED 


Any “point-free” approach, constructing the points out of the logic, is in- 
evitably sober. Many well-behaved spaces, for instance all Hausdorff spaces, 
are sober, and any space can be “soberified” by replacing it by the space of 
completely prime filters. 

The second mismatch between spaces and locales is that not every locale 
comes from a space. This arises out of an important logical fact, that geometric 
logic is not complete. 

Stone’s Theorem showed how each Boolean algebra is isomorphic to a sub- 
Boolean-algebra of a powerset. In logical terms, there are always enough 
standard models to distinguish between inequivalent sentences. This is a con- 
sequence of completeness. 

By contrast, locales do not always have enough global points to discriminate 
between the opens. For each locale X, the extent homomorphism ext : QX — 
P pt(X) defines a topology on pt(X). The locale X is spatial if ext is 1-1, 
but not all locales are spatial. 

For example, let R be the real line with its usual topology. Let T be Thor 
extended by extra axioms —=U — U for every U € OR. (~~ is the Heyting 
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double negation in QR. Concretely, =U is the interior of the closure of U.) 
pt[T] is a subspace of R, comprising those reals x such that for every U, if x € 
—-U then x € U. There are no such x, for consider U = (—oo, x) U (x, 00), 
which has —-=U = R. But ~~ is a nucleus (see e.g. Johnstone, 1982), and 
an immediate consequence of the general theory is that the opens of [T] are 
equivalent to the regular opens of R, i.e. those U for which ~—-U = U. Hence 
[T] is a non-trivial locale with no global points, hence non-spatial. 

Logically, spatiality is the same as completeness, but there is a difference 
of emphasis. Completeness refers to the ability of the logical reasoning (from 
rules and axioms) to generate all the equivalences that are valid for the models: 
if not, then it is the logic that is considered incomplete. Spatiality refers to 
the existence of enough models to discriminate between logically inequivalent 
formulae: if not, then the class of models is incomplete. 

In classical mathematics, most important locales are spatial; but this can rely 
on the axiom of choice to find sufficient points. In constructive mathematics 
many important locales (such as the real line) behave better in non-spatial 
form, and if we spatialize by topologizing the global points, then important 
theorems (such as the Heine-Borel Theorem) become false. This has led to 
a common misconception that constructive topology is deficient in theorems. 
This is actually not true, and the purpose of this chapter is to show how topology 
and constructive reasoning are intimately related. However, an important step 
is to forego any dependence on spatiality, on relying on a space being carried 
by an untopologized set of points. 

Happily, constructive reasoning itself contains the key to dealing with non- 
spatiality. Spatiality is an issue when we try to deal with a locale in terms of 
its global points, of which there might not be enough. But there are enough 
generalized points. For example, a map of locales is defined by its action on 
the generic point. The generalized points live in non-Boolean lattices (or, as 
we shall shortly see, in the non-classical mathematics of sheaves), and it is 
convenient to deal with them using constructive mathematics as a tool. 


2.5 Example: the localic reals 


As an adaptation of the localic reals in Johnstone, 1982, we present a propo- 
sitional geometric theory Tig with propositional symbols P, (q,r € Q, the 
rationals) and axioms: 


Par A Pyr © VV {Psat | max(q,q') < s < t < min(r,r’)} 


T> Ve lqeEQhif0<cEQ 
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PROPOSITION 8.17 In the theory Tp, we can derive the following. 


Pir Fa \V {Psa lets<t <r} 








Par A Pyr Pmax(q,q'),min(r,r) 
Pyr- Ll ifr<gq 
Py gt F Pyr ifq < q andr' <r 
Pat F Por V Pst ifq<s<r<t 





Proof These are all straightforward except the last. If q < s < r < t then let 
e€ = (r — s)/2. We have 


Pat m Viies | UE Q} N Pat 
FA VAP teg antan | Ue Q}. 





Now for any u € Q, we cannot have both u + € > r and u — € < s, for then 
r—=e<u< s+e, sor— s < 2e, contradiction. Hence either u +€ < r, 
in which case Pmax(u—e,q),min(u+e,t) / Pars Or U — € > s, in which case 
Fmazlazeg minutat) z P; t- QED 


In Sec. 4.7 we shall see how the models of this are equivalent to Dedekind 
sections of the rationals. 

We have a model of the theory in QR, interpreting P,, as the open interval 
(qr) = {x E R | q < x< r}, and hence a frame homomorphism a : 
Q[TR] —> OR. Clearly it is onto, since the open intervals (q,r) form a base of 
opens. It is also 1-1, for suppose ¢ and w are elements of 2[TR] (geometric 
formulae) such that a(¢) C a(w). We show that ¢ F 7. Any finite meet 
of symbols P}, is a join of such symbols, and it follows that any formula 
@ is equivalent to a join of such symbols. Hence it suffices to show that if 
a( Pır) C a(w) then P, F Y. From Proposition 8.17, it suffices to show that 
if a(Pyr) C a(p) and q <q <r’ < r then Pyr F Y. Let S = {s EQ | 
qd < s < r' and Py s F Y}. S is non-empty (because q’ € S) and bounded 
above (by r’), and so it has a supremum, a real number x. Since q' < x < r’ 
we have x € a(P},r) and so x € a(w). Since ~ too is a join of symbols P, u, 
we can find one such that P; «u F Y and z € a(Pru),ie. t<a<ulft< d, 
then Pyu į Y. On the other hand, suppose q’ < t. Choose a rational t with 
t < t! < x. Then by definition of x we have Py » + w. Again, but this time 
using Proposition 8.17, we get Py», į Y. It follows that min(r’,u) € S, so 
x <min(r’,u) < x. Since x < u, it follows that x = r’ < u, so Py pr F Y. 

It follows that the locale [Tig] is spatial, with Q[Tp] = QR. However, 
the proof just given is classical, in particular in its assumption that a non- 
empty set of rationals, bounded above, has a real-valued supremum. There are 
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(Fourman and Hyland, 1979) non-classical examples where [Ta] is not spatial. 
This might be seen as a defect of the locale [Tg], but in fact this non-spatial 
locale has better constructive behaviour than the space IR. For example, the 
Heine-Borel Theorem holds for the locale but not, in general, for the space 
(Fourman and Grayson, 1982). 


3. Predicate geometric logic 


In a sense, the propositional geometric logic is all that is needed for treating 
“topologies as Lindenbaum algebras”. However, there are good reasons for 
extending these ideas to the case of predicate logic. 

The first is Grothendieck’s discovery that there are certain situations that 
involve topology and continuity, but where topological spaces are inadequate 
for expressing them. He invented toposes to cover these situations, and said, 
“toposes are generalized topological spaces”. The generalization is essentially 
that from propositional to predicate geometric logic, and the toposes are a 
categorical version of Lindenbaum algebra appropriate to this predicate case. 
Thus it would be more accurate to say that toposes are generalized locales. 
Again we can view the topos as a space of models, but in general there are not 
enough opens to define the generalized topological structure, and sheaves must 
be used instead. Another point of generalization is that the classes Map(X, Y) 
are no longer posets but have category structure—the specialization order is 
replaced by specialization morphisms. (Specialization morphisms between 
points correspond to homomorphisms between models.) 

A second reason for studying the predicate logic is that quite often it is natural 
to replace a propositional geometric theory by an equivalent predicate theory. 
The reason this is possible in any but the most trivial cases is a remarkable 
consequence of having infinitary disjunctions. These allow sorts to be charac- 
terized uniquely up to isomorphism as, for example, the natural numbers. This 
means that there is an intrinsic type theory in predicate geometric logic, and 
one can work not so much in geometric logic as in a geometric mathematics, 
which turns out to have an intrinsic continuity. 

To fix our logical terminology, we say that a many-sorted, first-order signa- 
ture has a set of sorts, a set of predicate symbols, and a set of function symbols. 
Each predicate or function symbol has an arity stipulating the number and sorts 
of its arguments, and (for a function) the sort of its result. A predicate symbol 
with no arguments is propositional, while a function with no arguments is a 
constant. We shall express the arities of predicates and functions thus: 


PC Aj,...,An (for a predicate) 
PEI (for a proposition) 
f: Ai,..., An — B (fora function) 
c:B (for a constant) 
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We shall also freely use vector notation, writing e.g. A instead of Aj,...,An. 
In many situations, as here, this is to be understood as representing a product. 


DEFINITION 8.18 Let % be a many-sorted, first-order signature. If g is a 
(finite) vector of distinct variables, each with a given sort, then a geometric 
formula over X in context £ is a formula built up using term formation from 
the variables £ and the function symbols of £, and formula formation from the 
terms and the predicate symbols from X using =, ^, T, V (possibly infinitary) 
and 3. Note that, even with infinitary disjunctions, a formula is allowed only 
finitely many free variables, since they all have to be taken from the finite context 
z. Not all the variables in the context have to be used in the formula. 
A geometric theory over X is a set of axioms of the form 


(va) (6 > Y) 


where ġ and w are geometric formulae over © in context £. (We shall also 
commonly write such axioms in sequent form ¢ Fz w.) 

A geometric theory is coherent if all disjunctions used in it are finitary. (Note 
that Mac Lane and Moerdijk, 1992, X.3 uses the word “geometric” to mean 
“coherent”.) 





When we need to make explicit reference to the context of a term or formula, 
we shall use notation such as (Z.t) or (#.¢). 


DEFINITION 8.19 Let T; and To be two geometric theories. A theory mor- 
phism F from T to To comprises the following data. 


1 To each sort A of T), there is assigned a sort F(A) of T2. After this, each 
arity a for T, can be translated to an arity F(a) for To. 


2 To each function symbol f of T,, with arity a, there is assigned a function 
symbol F (f) of Tz, with arity F(a). After this, each term in context (Z.t) 
for T; can be translated to a term in context (Z.F (t)) for To. 


3 Similarly, to each predicate symbol P of T, with arity a, there is assigned 
a predicate symbol F (P) of Tə, with arity F(a). After this, each formula 
in context (Z.d) for T, can be translated to a formula in context (Z.F (¢)) 
for To. 


4 To each axiom (YT) (@ — Y) of Ti, there is an axiom (YT) (F(¢) > 
F(4)) of To. 


Note—theory morphisms are presentation-dependent, and do not provide the 
general notion of map. 
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3.1 Logical rules 


For the logical rules of predicate geometric logic we again follow the account 
in Johnstone, 2002b, D1.3.1. They are expressed using sequents of the form 
bt g w where ¢ and w are formulae in context 7. 

Labelling the turnstile with the context allows us to give a clean treatment 
of empty carriers, following Mostowski (see Lambek and Scott, 1986). This is 
exemplified by the following non-geometric deduction: 


(Var) g(x) 
pla) 
Jr) d(x). 


This purports to prove a sequent (Vx) (x) F (Ax) (x), but that is invalid 
with an empty carrier. The true conclusion is that we can make the inference in 
the context a, which we write (Yx) ¢(@) Fa (Ax) ọ(x). This is valid provided 
a is interpreted. But from that we can not infer (Vz) (x) F (Ax) o(x). (Some 
such device is necessary in constructive logic, where excluding empty carriers 
would be a serious problem. But it ought also to be better known in classical 
logic. See also Example 8.27.) 

The rules of predicate geometric logic are those of the propositional logic 


(with the context labels added) together with the following: substitution is 
prep 
(E/E) Fg (5/2) 
where 5’ is a vector of terms in context y, with sorts matching those of 2; the 
equality rules 





“——~ 

















Trkgt=2, (f=y)Adbbz o(Y¥/Z) 
(Z has to include all the variables in x and 7, as well as those free in @); the 


existential rules B 
$ Fzy Y (dy)¢ Fz Y. 
Jy)o Fz Y’ Fryt ” 











Paane 


and the Frobenius rule 











$ A (Ay)b Fz (Gy) ^y). 
Note that the substitution rule justifies context weakening 
Fz Y 
$ Fzy Y 
In other words, a deduction in one context will still be valid if we add extra 


variables, though not if we remove unused variables (which is what was done 
in the example of (Yx) (x) F (Sx) ¢(x)). 
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Syntax Interpretation 

sort A carrier set {M|A} 

sort tuple A = (A1,... An) | {MA} = TM {M]A:} 

predicate P C A subset {M|P} C {M|A} 

proposition P C 1 subset {M|P} C1 

function f : A B function {M|f} : {M|A} > {M|B} 
constant c: B element {M|c} € {M|B} 

formula in context (Z.¢) subset {M|z.6} C {M|o(z)} 

term in context (Z.t) function {M|z.t} : {M|o(z)} — {M|o(t)} 





Figure 8.1. Interpretations of syntactic elements. 


3.2 Models 


The notion of standard model (in sets) is as expected, except that we allow 
empty carriers. The logical rules, with the context attached to the turnstile, 
are designed to be sound for empty carriers. We shall also introduce a novel 
notation that is useful when dealing with more than one interpretation at the 
same time. 

The interpretation of different syntactic elements is defined in Table 8.1. The 
notation o(t) denotes the sort of a term t, and similarly for a tuple of terms. 
Once the signature ingredients are interpreted (arbitrarily), the interpretation of 
terms and formulae in context follows structurally in an evident way, so that 
{ M|Z.} is the set of value tuples (in {M|A}) for which ¢ holds, and { M |Z.t} 
yields a result in { /|o(t)} for any value tuple substituted for 7. 

If T is a geometric theory over X, then we say that an interpretation M of X is 
a model of T if, for every axiom @ Fz w in T, we have {M|Z.¢} C {M|Z.y}. 

We also define a notion of homomorphism. 


DEFINITION 8.20 Let X be a signature and let M, N be two interpretations 
of &. Then a homomorphism h : M — N comprises a function {h| A} : 
{M|A} = {N |A} for each sort A, subject to the following conditions. We 
shall write {h| A} for the product function T],{hlAi} : {MĀ} > > {N|A}. 


For each predicate P C Ain > and for each function f : A— Bwe require 
{M|P} < {h|A}"!({N|P}) 
{h|B}o {Mf} = {NIF} 0 {hl A}. 


Informally, we may say for any suitable value tuples G in M, that if P(a@) holds 
in M, then P(h(@)) holds in N; and that f (h(a@)) = h(f(a@)). 


The two conditions, for predicates and functions, are not independent. If the 
function f is instead described by its graph, then the predicate condition for the 
graph is equivalent to the function condition for f. 
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Obviously homomorphisms can be composed, and there are identity homo- 
morphisms, and so for any theory T we have a category Mod (T) of models of T. 


PROPOSITION 8.21 Let» be a signature, let M, N be two interpretations of 
£, and let h: M — N be a homomorphism. 


1 Let (&.t) be a term in context. Then 
{N|z.t} 0 {hlo(Z)} = {hlo(t)} o {M|Z.t}. 


2 If (Z.@) is any geometric formula in context, then {h|o(Z)} restricts to 
a function 


{h|x.o} : {M|#.o} > {N|@.9}. 


Proof Induction on the formation of terms and formulae. QED 


The result relies fundamentally on the positivity of the logic. For a logic 
with negation, the homomorphism condition that we gave for predicates is not 
liftable through negation. For this reason in classical logic one may see a 
different notion of homomorphism. 


REMARK 8.22 Categorically, Definition 8.20 amounts to saying that for sorts 
B, sort tuples A and predicates P C A, we have functors |B}, |A} and 
|P} : Mod(T) — Set with natural transformations |P} — |A}; and for 
functions f : A — B there are natural transformations | f} : |A} — |B}. The 
Proposition says this extends to formulae and terms, with natural transforma- 


tions |Z.ġ} — |o (2)} and |a.t} : |o(#)} > |o(t)}. 


If F : T; — T is a theory morphism, then for every model M of Ts there 
is a corresponding model F* M of T\, defined by 


{F"M|—} = {M|F(—)}. 


This is called the F'-reduct of M. F* gives a functor from Mod (T2) to Mod(71) 
(note the reversed direction!). 


3.3 Cartesian theories 


These provide some important examples of geometric theories. They also 
provide the setting for some key constructions of universal algebra (including 
initial and free algebras) that turn out to be “geometric” in nature (Sec. 3.4). 

The best known and simplest amongst the Cartesian theories are the finitary 
algebraic theories, where “finitary” refers to the requirement that all opera- 
tors should have finite arity. Note that, unlike Johnstone, 2002b, Definition 
D1.1.7(a), we allow them to be many-sorted. 
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DEFINITION 8.23 A finitary algebraic theory is a geometric theory presented 
with no predicate symbols, and with axioms all of the form 


(v#) (T +s =2) 


where s and t are two terms in context £. (In other words, the axioms are 
equational laws. ) 


Cartesian theories generalize these, essentially by allowing operators to be 
partial. In that generality they are slightly difficult to formalize and have ap- 
peared in various guises. The definition here (following Johnstone, 2002b, 
D1.3.4) is due to Coste. Equivalent are the essentially algebraic theories of 
Freyd, the left exact theories and sketches (see Barr and Wells, 1984) and the 
quasi-equational theories of Palmgren and Vickers, 2007. 

As the references make clear, Cartesian theories are intimately associated 
with Cartesian categories, i.e. categories with all finite limits. A functor be- 
tween them that preserves all finite limits is a Cartesian functor. 

Most of the examples of Cartesian theories in this chapter are in fact finitary 
algebraic. Readers may safely omit the following definition if they wish, and 
consider Theorems 8.25 and 8.26 in the finitary algebraic case. 


DEFINITION 8.24 (JOHNSTONE, 2002B, D1.3.4) Let beamany-sorted, 
first-order signature, and let T be a coherent theory over it. 

The formulae in context that are Cartesian relative to T are as follows: atomic 
formulae, T, equations and conjunctions of Cartesian formulae; and (Z.(3y)ġ) 
provided (Z, y.ġ) is Cartesian, and the following sequent is derivable from T: 


en o(y'/y) Fztyy! y= y' 


The theory T is Cartesian if there is a well-founded partial order on its 
axioms, such that for every axiom (VZ)(¢ — w), the formulae in context (Z.¢) 
and (Z.p) are Cartesian relative to the previous axioms in T. 





The essential point of this definition is that existential quantification can be 
used only when it is provably unique. This allows a mechanism for dealing 
with partial operations, by replacing them by their graphs. For example, the 
theory of categories is Cartesian but not algebraic. Composition is partial, with 
f o g defined iff the domain of f is equal to the codomain of g. 

As is well-known, every algebraic theory T has an initial model—that is to 
say, the category Mod(T) has an initial object. A consequence of this is that 
reduct functors have left adjoints. This generalizes to Cartesian theories. 


THEOREM 8.25 (INITIAL MODEL THEOREM) Let T be a Cartesian the- 
ory. Then T has an initial model, in other words a model Mo such that for 
every other model M there is a unique homomorphism Mo —> M. 
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Proof In the case where T is an algebraic theory, the initial model is got by 
taking all terms, and then factoring out a congruence generated from the equa- 
tional laws. (The construction of Lindenbaum algebras, as propositions modulo 
provable equality, is a particular instance of this.) Palmgren and Vickers, 2007 
show how a similar proof can cover Cartesian theories, by working in a logic of 
partial terms. The construction first takes all partial terms, and then factors out 
a partial congruence (not necessarily reflexive) of provable equality, in which 
self-equality of a term is equivalent to its being defined. 

More traditional proofs rely on first forming a syntactic category, a Cartesian 
category Cr such that models of T are equivalent to Cartesian functors from Cr 
to Set (Johnstone, 2002b, Theorem D1.4.7), and then appealing to Kennison’s 
Theorem (Barr and Wells, 1984, Theorem 4.2.1). QED 


THEOREM 8.26 (FREE MODEL THEOREM) Let Tı and T be Cartesian 
theories, and let F : Ti — T be a theory morphism. Then the reduct functor 
F* : Mod(I2) —> Mod (T) has a left adjoint Freep. 


Proof Just to sketch the proof, let W be a model of Tı. We can define a 
new Cartesian theory T whose models are pairs (N, f), where N is a model 
of T> and f : M — F*(N) is a homomorphism. T is got by augmenting T> 
with constant symbols for the elements of M, and equations to say that the 
interpretation of those constants respects the structure of Tı. Then an initial 
model of T can be taken for Freep (M). QED 


The best known examples are where T; and Ts are both single-sorted alge- 
braic, with T; the theory with (one sort and) no operators or laws. Its models 
are sets. There is a unique theory morphism F : T} — T», and F* picks out the 
carrier but forgets all the algebraic structure. Then Freep constructs the free 
T>5-model on a set. 

Note that this theorem, and the initial model theorem on which it depends, 
in general rely critically on the fact that we allow empty carriers. 


EXAMPLE 8.27 Consider the algebraic theory with two sorts A and B, two 
constants s and t of sort B, a unary operator f : A — B and axioms 


(Va: A) (T > s = f(a) 
(Va: A) (T > t= f(a). 
Its initial model Mo has {Mo|A} = 0, {Mo|B} = {s, t}. 

Examples like this are sometimes used in equational reasoning to suggest 
that “equality is not transitive if empty carriers are allowed”. This is because 
the equational laws can be presented as s = f(x) and t = f(x), but in Mo we 
cannot deduce s = t. In our treatment we see that the equalities are in context, 


and in effect, “equality in context x” is transitive. We can deduce T Fy s = t, 
which implies s = t provided we can interpret the variable x in A. 
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3.4 Geometric types 


We have presented geometric theories using simple sorts that are declared 
in the signature and thereafter cannot be manipulated in any way. However, it 
would be an obvious convenience if we could perform mathematical construc- 
tions on those sorts to derive new ones. Following Johnstone, 2002b, D4.1 we 
shall use the word type for these generalized sorts, and reserve the word sort 
for what was declared in the signature. 

An important feature of geometric logic is that its infinitary disjunctions 
allow us to characterize some type constructors uniquely up to isomorphism by 
using geometric structure and axioms. 

A fundamental example is the natural numbers. Consider the geometric 
theory with a single sort N, constant 0, unary operator s and axioms 


(Va: N) (s(x) = 0 — L) 
(vz, y : N) (s(2) = s(y) — z = y) 
(Va: N) ae = sO), 


Here, s”(0) stands for the term s(...(s(0))...) with n occurrences of s. The 
notation s” is not a formal part of the logic, but a metasyntax used to describe 
the set of formulae over which the disjunction is taken. In any model, N can 
and must be interpreted as a set that is isomorphic to the natural numbers, by 
a unique isomorphism under which the constant 0 corresponds to the natural 
number 0, and the function s corresponds to the successor operation n œ n-+1. 
Hence, modulo isomorphism, this theory is just a variant of the trivial theory 
with empty signature and no axioms. 

In fact, just within the logic we can prove that N has a universal property that 
characterizes the natural numbers: it is an initial model for the single-sorted 
algebraic theory of induction algebras, which has constant £, unary operator t 
and no axioms. 


THEOREM 8.28 Let a geometric theory have N,0,s as axiomatized above. 
Then, in any model, N is interpreted as an initial induction algebra. 


Proof Let (A,¢,t) be an induction algebra A. Of course N is itself an induc- 
tion algebra under 0 and s; we must show there is a unique induction algebra 
homomorphism from N to A. For uniqueness, suppose f : N — Aisa 
homomorphism. We show that (in sequent form) 


y = f(a) Hey V (æ = 8"(0) Ay = #"(e)). 


neN 
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For 4, we have T + f(s”(0)) = t” (£) by induction on n. For F, combine this 


with 
Tey VV (x = s” (0)). 
nEN 


For existence of f, we show that the formula in context 


(zy. T) = VV (z = (0) Ay = t” (e)) 


nEN 


is the graph of an induction algebra homomorphism, that is to say 


T AT(y'/y) Heyy y = 9! 

Fy (AyD 

TET (0,e/z,y) 

I Fay D(s(z), t(y)/2, y). 


The first two of these state that the relation I is single-valued and total, and 
hence the graph of a function, and the remaining two state that the function 
preserves the induction algebra operations. 

These are all easy except perhaps for the first. For that, we want 

















z= (0) Ay = te Ar = s” (0) Ay = t” (e) syy! Y =Y. 


If n = n that is obvious, while for n Æ n’ we can prove x = s”(0) Az = 
s” (0) L. QED 


Let us stress the fact that the proof was within the formality of geometric 
logic. Thus it will be valid not only for the standard semantics in sets, but also 
for other semantics such as (as we shall see) in sheaves. 

Asis well known, there can be no such characterization of the natural numbers 
in finitary logic. Our ability to do itin geometric logic derives from the power of 
the infinitary disjunctions, and this extends to other “geometric” type constructs. 

One might respond to this power by regarding explicit geometric type con- 
structions as unnecessary, since they can be reduced to first order logic. By 
contrast we shall instead feel justified in using an explicit geometric type the- 
ory, since it does not transcend the scope of geometric logic. 

In the present state of our knowledge we do not have a formal type theory 
along these lines. Instead, we shall use types informally, using the geometric 
type constructs wherever sorts can occur, and also introducing any functions 
and predicates that are associated with those types. In Remark 8.34 we shall 
see a semantic characterization of which type constructs are geometric. 

By geometric type theory we shall understand a geometric theory in which 
geometric type constructs are used in this way. They have the same expressive 
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power as geometric theories. An interpretation must interpret those types and 
the associated predicates and functions in the intended way. 

Acoherent type theory is a geometric type theory in which all disjunctions are 
finite. These are stronger in expressive power than coherent theories, since the 
type constructs may implicitly use infinitary disjunctions for their justification. 

The first example of geometric type constructor, and one of the simplest, is 
the Cartesian product. (In fact all finite categorical limits are geometric.) Other 
examples arise out of the following general principle. Suppose F : T} — Toisa 
theory morphism between Cartesian theories. Then the free model construction 
Freep is geometric. Note that this will usually construct not only types, but 
also functions. In the following example, the principle can be applied with 
both theories algebraic (possibly many sorted) — exercise! But in each case it 
is also possible to specify the type geometrically and give a geometric proof of 
its universal property. 


EXAMPLE 8.29 1 The natural numbers N. 


2 The list type A* over a type A. In Set its elements are finite lists of 
elements of A. A* is the free monoid (having associative binary operation 
with a 2-sided identity element) over A; see Johnstone, 2002a, A2.5.15 
for another treatment. 


3 Coproducts (in Set, disjoint unions). 
4 Coequalizers (or, more particularly, quotients of equivalence relations). 


One very important type constructor is the finite powerset F. This is discussed 
extensively in Johnstone, 2002b, D5.4 under the notation of K. (Constructively, 
the particular notion of finiteness being used is Kuratowski finiteness. The 
Kuratowski finite subsets of a set S are the elements of the U-subsemilattice 
of PS generated by the singletons. S is Kuratowski finite if it is a Kuratowski 
finite subset of itself.) It is a geometric construction because FA is the free 
semilattice over A. (A semilattice is a monoid (A,0,V) in which V is com- 
mutative (x V y = y V x) and idempotent (x V x = x).) Johnstone, 2002b, 
D5.4 gives an alternative description analogous to that of list objects; see also 
Vickers, 1999. 

A particularly important feature of the finite power type is that it enables us 
to internalize universal quantification, provided it is finitely bounded. Suppose 
(x : A,y : B. ¢) is a formula in context. Then so is (S : FA,y: B. (Vx € 
S) @). It is interpreted as follows. Consider {M|xy.¢} as a function from 
{M|A} to P{M|B}. The codomain of this is a semilattice under N, and so we 
get a semilattice homomorphism from F{M|A} to P{M|B}. This transposes 
to a subset of {/|FA x B}, the interpretation of (Yx € S) ¢. 
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3.5 Dedekind sections 


Each real number x is characterized by its Dedekind section, two sets of 
rationals: 


L={qeEQ\¢< zt}, 
R={reQ|a<r}. 


(Variants of this are possible, with < instead of <. But they do not yield a 
geometric theory.) The idea then is to define the real number zx to be the pair 
(L, R) of subsets of Q, and (for q,r € Q) define q < xifq € Land < rif 
rer. 

The pairs x = (L, R) that arise in this way are characterized by the following 
properties. 


1 There is some rational q with q < x. 

2Ifq<qd <atheng < z. 

3 If q < x then there is some rational q’ with q < q' < x. 
4 There is some rational r with x < r. 

5 Tf er <rtheng <r. 

6 If x < r then there is some rational r’ with x < r’ < r. 
7 Itis impossible to have q < x < q. 

8 Ifq <r then either q< xorg <r. 


((8) looks more obvious contrapositively: if r < x < q then r < q. But we 
are axiomatizing <, not <.) 

We can rewrite this, though at some cost in clarity, to a coherent type theory 
Ded with two predicates L, R C Q and axioms 














(Aq : Q) L(q) 

(Va, d' : Q) (g < g ^A Lld) — Lla) 
(Va : Q) (Lla) — (Ad : Q) (q < g AL@))) 
(Er : Q) R(r) 

(Yr, r” : Q) (r < r^ R(r') — R(r)) 
(Vr : Q) (R(r) — (Ir : Q) (r < r A R(r'))) 
(Va : Q) (L(a) A Rg) — L) 

(va,r : Q) (g < r — L(a) V R(r)) 
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To see that this is indeed a coherent type theory, we must show that Q is a 
geometric type construction, and in addition that < (on rationals) is geometric. 
The standard construction of Q is in stages. First, the natural numbers N have 
already been mentioned as an example of a geometric type. The arithmetic 
operations of addition and multiplication can then be defined in the following 
way. Addition is the unique operation + : N x N that satisfies 


(vn: N)O+n=n 
(Vm,n:N) s(m) +n = s(m+n). 


Hence if the symbol + is declared and those axioms are added to the theory, 
there is no change to the models — the operation + is forced to be interpreted 
in the intended way. So its use as a standard mathematical symbol is shorthand 
for that declaration with axioms. Similarly, we can define all the relations 
=,4#,<,>,< and > as subsets of N x N. (In this case, we can even define 
them as operations N x N —2 = 1 + 1. This is because the relations are 
decidable.) For instance, < is the unique relation satisfying 


(Yn: N) 0 < s(n) 


) 
(Ym: N) (m<0-— L) 
(Ym, n : N) (s(m) < s(n) — m <n) 
(Ym, n: N) (m <n > s(m) < s(n)) 


Next, the integers Z are got as a quotient of N x N by an equivalence relation 
~], defined by (m,n) ~1 (m, n’) iff m+n’ = m +n. The pair (m,n) 
represents the integer m — n. Again, it is possible to define arithmetic and 
inequalities. 

Finally, the rationals Q are got as a quotient of Z x {n € N | n Æ 0} 
by an equivalence relation ~2, defined by (p,q) ~2 (p’,q’) iff pq’ = p'q. 
(The pair (p,q) represents the rational p/q.) The inequality < is defined by 
(p,q) < (p,q) if pd’ < p'q. 

So we see that Q and much of its accompanying structure are all geometric 
and can be used as needed in coherent type theories. This is emphatically not 
the case with the reals R. As a set, R is described as a subset of PQ x PQ, and 
the powerset constructor P is not amongst the geometric type constructors. 
(See Remark 8.34.) That is why we have to access the reals in a different way, 
by defining a theory whose models they are. 

In Sec. 2.5 we saw the localic reals, given by a propositional geometric theory. 
In fact (Sec. 4.7), that geometric theory is equivalent to this one, despite the 
fact that one is propositional and the other is predicate with type constructs. In 
Sec. 4.6 we shall see how Ded retains a propositional character from the fact 
that it has no sorts declared. Its types are all constructed out of nothing. 
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4. Categorical logic 


For propositional logic, the standard semantics interprets propositions as 
truth values. For a more general semantics, we interpreted propositions as el- 
ements of more general lattices—Boolean algebras for classical logic, frames 
for geometric logic. Then the Lindenbaum algebra was the lattice (of the ap- 
propriate kind) freely generated by a generic model. 

For predicate logic, the standard semantics is in sets. Categorical logic 
generalizes this by interpreting the symbols in a category, of a kind appropriate 
to the logic, and then the analogue of the Lindenbaum algebra is a category. For 
geometric logic, the appropriate categories are Grothendieck toposes and the 
Lindenbaum algebra for a predicate geometric theory is the classifying topos. 
Our aim now is to explore how the technology of Lindenbaum algebras (as used 
for locales) extends to this setting. 

We start with an introduction to categorical logic, following Johnstone, 
2002b. A more elementary introduction can be found in Goldblatt, 1979. 


4.1 Interpreting logic in a category 


We assume that the reader has an elementary knowledge of category theory, 
including the basic definition, limits and colimits, and adjunctions. 
We shall normally write composition of morphisms in applicative order, 


using “o”. However, on occasion it will be convenient to use diagrammatic 
order instead, with “;”. Thus the composition of morphisms 


will be written usually as g o f, but occasionally as f; g. 

Suppose ¥ is a many-sorted, first-order signature. The usual notion of inter- 
pretation of X in the category Set of sets, as given in Table 8.1, can be extended 
to any category C with finite products. Sets, functions, elements and subobjects 
become objects, morphisms, morphisms with domain 1 (a terminal object, i.e. 
nullary product) and subobjects in C. 

If A is an object in C, we shall write Sube (A) (or often just Sub(A)) for 
the class of subobjects of A. We shall generally assume also that C is well- 
powered, i.e. that each Subc(A) is a set. In our situations Sube (A) is a meet 
semilattice, with greatest lower bounds given by pullbacks of subobjects. If f : 
A — B is a morphism, then pullback gives a meet semilattice homomorphism 
f* : Sube(B) — Sube(A), called inverse image. (Exercise: in Set that is 
exactly what it is.) 

As before, any interpretation of the ingredients of © in C can (given suitable 
categorical structure in C) be extended recursively to terms and formulae in 
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context. We shall assume initially that C has at least all finite limits, in other 
words that it is Cartesian. First, we deal with terms. 

Variables: (x.x) is a term in context, interpreted by the identity morphism 
on {M|o(z)}. 

Substitution: Suppose (Z.t) is a term in context, and (w.5) is a vector of 
terms in context that is type compatible with Z (i.e. o(5) = o(Z)). (Note: we 
have the same context w for every component of 5.) Then (w.t(S/Z)) is a term 
in context. Its interpretation is given by 


({M|w.5}); {M|a.t} : 
{M|o(w)} > {Mlo(2)} > {Mlo(t)} 


where ({M|w.5}) denotes the product tupling ({M|w.s,},...,{M|w.s,}) of 
morphisms. 

Substitution also covers context weakening. Suppose (#.t) is a term in 
context, and w is a variable not in Z. Then (7, w. 7) is a vector of terms in con- 
text, and the substitution (7, w. t(@/Z)) gives a term in context (7, w. t). For 
its semantics, note that ({1/|Z, w. Z}) is the product projection {M|o(Z)} x 
{Mlo(w)} > {Mlo(@)}. 

Now we deal with formulae. Exercise! Check that these all make sense 
in Set. 

Substitution: Suppose (Z.¢) is a formula in context, and (7.t) is a vector 
of terms in context such that o(t) = o(Z). Then (w.@(t/2)) is a formula in 
context given by an inverse image (a pullback), 


{M|.9(E/2)} = ({M|w.1})" ({M|z.9}). 


Again, this also covers context weakening. If (£.@) is a formula in context, and 
w is a variable not in 7, then (7, w. @) is also a formula in context given by 
substituting # for z. 
Equality: Let (#.t) and (T.t) be two terms in context, with o(t) = o(t’). 

Then (z. t = t’) is a formula in context interpreted by an equalizer 

{Mit} 

— 
{M|z.t =t'} — {M]o(z)} —, {Mlo(t)}. 
{Mat} 


Conjunction: Let (Z.~) and (Z.p) be two formulae in context. Then the 
conjunction rules imply that (Z.¢ A p) must be interpreted by the greatest 
lower bound of subobjects {M |Z.ġ} and {M|Z.w} in {M]|o(Z)}. For T, the 
nullary conjunction, we have {M|z.T} = {M|a(Z)}. 

For the remaining geometric connectives, \/, |, 4, we need extra structure 
on the category C. 

For existential quantification, C must have images: the image of f : A > 
B, if it exists, is the smallest subobject of B through which f factors. A 
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consequence of C having images (for all morphisms) is that there are “direct 
image” functions 3; : Sub(A) — Sub(B), left adjoint to f*. (In Set the 
adjunction appears as f(S) C T iff S C f~1(T).) As was first understood by 
Lawvere, this is exactly the content of the existential rules. Hence if (Z, y. ¢) 
is a formula in context, so is (7. (Sy) @) and it is interpreted by 


{M|2. (Sy) 6} = dr({ M2, y. of) 


where m : {M|o(Z)} x {M|a(y)} — {M]|o(#)} is the product projection. 

For disjunction, the disjunction rules imply that if (#.¢;) are formulae in 
context, then {M|Z. V; ;} has to be the least upper bound of the subobjects 
{M|z.¢;} in {M|o(Z)}. Hence our categorical structure must include those 
least upper bounds. False is just a nullary disjunction. {/|z.1} must be the 
least subobject of {M|o(zZ)}. 

Negation, implication, universal quantification: These connectives enter ge- 
ometric logic only at the level of axioms, and we interpret them in a different 
way. Suppose we have formulae in context (7.ġ) and (7.4). Then an interpre- 
tation M satisfies the axiom (YT) (¢ — w), symbolically 


MF (ve) ($ > 4), 


if {M|Z.ġ} < {M|z.~}. Negation can also be treated this way, by taking ~¢ 
as@— Ll. 

As usual, if T is a theory over signature X, then M is a model of T if it 
satisfies every axiom in T. 

At this point it is very useful to consider the notion of generalized element, 
analogous to generalized points. 

















DEFINITION 8.30 LetC be a category and A an object init. A generalized el- 
ement of A is any morphism whose codomain is A. The domain of the morphism 
is called the stage of definition of the generalized element. 

A generalized element at stage 1 is a global element. 


Consider for instance the above assertion M F (VZ) (@ — y). We might 
like this to mean that every element of { M |o (Z) } that satisfies ¢ (i.e. it factors 
via {M|Z.¢}) also satisfies 7. But this is only a weak assertion if there is a 
shortage of morphisms from 1 to {M|o(z)}. 

Our interpretation of the assertion M F (VZ) (¢ — y) can now be explained 
naturally in terms of generalized elements, for it says that every generalized 
element of ¢ is also in 7. To see this one way round, consider the inclusion 
{M|z.¢} — {M]o(Z)}. This is a generalized element (with stage of definition 
{M|z.¢}) that satisfies ¢. In fact it is the generic element of din M. Ifitis also 
to satisfy y, then we get {M|Z.¢} < {M|Z.w}. Conversely, every generalized 
element of ¢ factors through the generic element, so if this is in w so too is 
every generalized element of @. 
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Normally, when we intend our language to be interpreted in categories like 
this then by “element” we shall mean generalized element. 

Homomorphisms between interpretations are defined just as in Definition 
8.20, modulo obvious changes—the carrier functions {f|A} become mor- 
phisms, {h|A}~! becomes {h|A}*, subset inclusion C becomes subobject in- 
clusion <, etc. Proposition 8.21 still holds, by induction on the formation of 
terms and formulae. 

Again we have a category Modc(T) of models of T in C, and for any theory 
morphism F : Tı — To, we have an F-reduct functor F* : Mode(T2) —> 
Mode(7;). 


4.2 Grothendieck toposes 


To interpret geometric logic categorically, we shall use Grothendieck toposes. 
These are usually defined as “categories of sheaves over Grothendieck topolo- 
gies”, but that is in effect referring to a representation theorem, Giraud’s Theo- 
rem (8.67). This says that a category is equivalent to such a category of sheaves 
iff it has certain structure and properties. Since the structure and properties can 
be related directly to the geometric logic, we shall use it as our definition. It can 
be described in various ways; our presentation here is the oo-pretopos with sep- 
arating set of objects of Johnstone, 2002b, Theorem C2.2.8 (vii) and Johnstone, 
2002a, A1.4. 


DEFINITION 8.31 A category E is a Grothendieck topos if it has the following 
properties. 


1 E has all finite limits. 

2 €E has images (p. 458) and image factorization is preserved by pullback. 
3 E is well-powered (i.e. for every object A, the class Subg(A) is a set). 
4 


For each object A, the poset Subg(A) has arbitrary joins (least upper 
bounds), and they are preserved by pullbacks f*. 


5 Every set-indexed family of objects of E has a disjoint coproduct. (A 
coproduct A = ` A; is disjoint if all the coproduct injections A; — A 
are monic, and the meet of A; and A; in Subg(A) is less than V{4; | 
k =iandk = j}. Hence ifi + j then A; and A; are disjoint subobjects 
of Sube(A).) 


6 Every equivalence relation R = A in E is a kernel pair. (Fuller details 
can be found in Mac Lane and Moerdijk, 1992, Appendix, Theorem I or 
Johnstone, 2002a, A1.3.6. If a pair of morphisms a,b: R — A is such 
that (a,b) : R — Ax Ais monic, then they can be thought of as a relation 
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on A. Then the usual notions of reflexive, symmetric and transitive can 
be translated into categorical terms by the usual logical interpretation. 
Our condition then says there is a quotient morphism q : A — B such 
that (a,b) is the kernel pair of q, i.e. they complete the pullback square 
of q pulled back against itself.) 


7 E has a separating set S (not a proper class) of objects (i.e. if f,g: A —> 
B are such that for every u : C — A with C € S we have fou = gou, 
then f = g). 


Condition (1) says that € is Cartesian, and enables us to interpret the logic 
of conjunction and equality, as well as substitution. 

Adding condition (2) makes € regular (Johnstone, 2002a, A1.3.3) and en- 
ables us to interpret the logic of 5; preservation under pullback gives the Frobe- 
nius rule. It also enables the technique (e.g. Theorem 8.28) of defining a 
morphism by its graph. 

Adding conditions (3) and (4) makes € geometric (Johnstone, 2002a, A1.4.18) 
and enables us to interpret the logic of arbitrary disjunction; preservation under 
pullback gives the frame distributivity rule. This structure is already sufficient 
for interpreting all the first-order part of geometric logic. 

Adding condition (5) makes € co-positive, and then adding (6) makes it 
an oo-pretopos (Johnstone, 2002a, A1.4.19). These enable us to interpret the 
geometric type theory. In particular, the Initial and Free Model Theorems 8.25 
and 8.26 still work for models of Cartesian theories in €. The proofs can be 
checked to go through; more explicitly, the proof in Palmgren and Vickers, 
2007 is valid in Heyting pretoposes, and that includes Grothendieck toposes. 

Condition (7) is a “smallness” condition. It allows us to deduce—in 
Giraud’s Theorem—that although € is large, it can still be generated from a 
small structure. 

We wish to find the appropriate notion of Lindenbaum algebra, in the form of 
a Grothendieck topos, for a geometric theory. This will be called the classifying 
topos for the theory. The central result for an ordinary Lindenbaum algebra 
was Proposition 8.3. We shall replace “Boolean algebra” by “Grothendieck 
topos”, but we also need to know the appropriate notion of “homomorphism of 
Grothendieck topos”. 

All the structure needed for geometric logic and type theory (including the 
image factorization and the joins of subobjects) can be constructed using finite 
limits and arbitrary colimits, and conversely those are geometric type con- 
structs (characterizable uniquely up to isomorphism by geometric structure and 
axioms). We shall therefore be interested in functors between toposes that 
preserve colimits and finite limits. A functor preserves colimits if it has a 
right adjoint, and for Grothendieck toposes the converse can also be shown 
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(Johnstone, 2002b, Remark C2.2.10). A geometric morphism is an adjoint pair 
of functors between toposes for which the left adjoint (which preserves all 
colimits) preserves finite limits. 

At this point we are going to introduce some non-standard notation arising out 
of the fundamental split personality of toposes—spatial (generalized spaces) or 
logical (generalized universes of sets). The real interest of geometric morphisms 
is that they are the topos analogue of continuous map: they are a notion from 
the spatial side. In fact, we shall often refer to them as maps. On the other 
hand the Grothendieck toposes as we know them up till now, the categories 
with structure that was used to interpret logic, are very much the generalized 
universes of sets. We shall introduce notation that distinguishes between the 
two sides in the same way as we distinguished between locales and frames. 
Thus although technically we are dealing with those categories, we shall use 
notation that allows them to pretend to be spaces. 

If we declare a symbol (e.g. X) to denote a topos, we shall nonetheless 
reserve its use for the topos in its spatial aspect. When we want to refer to it in 
its logical aspect, in other words the actual category, we shall write SX. We 
shall call the objects of SX the sheaves over the topos rather than the objects 
of the topos. The symbol S can be read as standing for “sheaves”. 

We therefore define: 


DEFINITION 8.32 Let X and Y be two toposes. A geometric morphism (or 
map) f : X — Y is a pair of functors 


ft: SY > SX 
fe: SX > SY 


such that f* is left adjoint to f, and f* preserves finite limits. 
f* is called the inverse image part, and f, the direct image part. 
We write Map( X,Y ) for the class of geometric morphisms from X to Y. 


Note the directions! The structure preserving functor is f*, and this goes 
in the reverse direction to the geometric morphism f. As we shall see later, 
f* is analogous to the inverse image function for a continuous map between 
topological spaces. (However, the “direct image part” is not analogous to the 
direct image function.) 


PROPOSITION 8.33 Let f : X — Y be a geometric morphism. Then f* 
preserves free model constructions for Cartesian theories. 


Proof (See Johnstone, 2002b, D5.3.7 for the case of free algebras over sets for 
a single-sorted theory.) Let a : Tı — T be a theory morphism between two 
Cartesian theories. Let A be a T\-algebra in SY, and let h : A > a*(T2(A)) 
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be a free T>-algebra over it. (h is a T;-homomorphism.) It is required to show 
that f*(h) : f*(A) > f*(a*(T2(A))) is a free To-algebra over f*(A). 

For models of Cartesian theories, any functor that preserves finite limits will 
transform models to models. This applies to both f* and f,. They also both 
preserve model reduction a*. Moreover, the adjunction of f* and f, extends 
to models: there is a bijection between homomorphisms f*(A) — B in SX 
and homomorphisms A — f,(B) in SY. If B is a reduct a*(B’), then we see 
that T,;-homomorphisms f*(A) — B are equivalent to T>-homomorphisms 
T2(A) — f,(B’) and hence to T>-homomorphisms f*(T2(A)) — B’. Hence 
f*(T2 (A)) is the free T>-model over f*(A), as required. QED 


REMARK 8.34 Wecan now State a general semantic characterization of “geo- 
metric type construct”. They are those constructs that can be carried out in any 
Grothendieck topos, and are preserved by inverse image functors of geometric 
morphisms. (Remember that those inverse image functors are the analogues of 
homomorphisms between Lindenbaum algebras.) Those we have seen include 
finite limits (in set-theoretic terms: products, singletons (as terminal object), 
fibred products (pullbacks) and equalizers); arbitrary colimits (disjoint unions, 
quotients); images; free model constructions for theory morphisms between 
Cartesian theories (including the natural numbers, finite powersets and list 
objects); and integers Z and rationals Q, and associated structure including 
arithmetic and inequalities. 

These are type constructs that can be permitted, informally, in a geometric 
or coherent type theory. 


Our next result is the analogue of the specialization order on maps between 
locales (Definition 8.10). 


THEOREM 8.35 Let X and Y be toposes. Then: 


1 Map(X,Y) isa category. The morphisms are called specialization mor- 
phisms, or natural transformations. If a is a specialization morphism 
from f to g, then we write a: f => g. 


2 Composition with maps on either side is functorial. 


3 Composition with maps satisfies the “interchange law”. Suppose a : 
f = ginMap(X,Y), and 3: h => kin Map(Y, Z). Then the following 
diagram commutes. 


hof N 
hoal lkoa 
hog — kog 
Bog 


This allows us to define a horizontal composition 8 o a : ho f => kog. 
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Proof 1. Mac Lane and Moerdijk, 1992, Sec. VII.1. A morphism a from f 
to g, is defined as a natural transformation from f* to g*. These are equivalent 
to natural transformations from g, to f+. (Note the reversal of direction.) 

2, 3. These are obvious and come from horizontal composition of natural 
transformations. (Mac Lane, 1971) QED 


An important feature of maps is that we can take filtered colimits. These 
are a categorical generalization of the directed joins of locale maps (Proposi- 
tion 8.11). 


DEFINITION 8.36 Let C be a category. Then C is filtered if it satisfies the 
following conditions. 


1 Chas an object. 


2 If Aand B are objects of C, then there is an object C with morphisms 
f:A-Candg: BC. 


3 If A and B are objects of C, and f,g : A — B, then there is an object C 
and morphism h : B — C such thatho f = hog. 


To put this more concisely, C is filtered iff every finite diagram in C has a 
cocone. A poset is filtered iff it is directed. 

Composition with maps preserves filtered colimits, and this preservation of 
filtered colimits is an important property of maps, analogous to Scott continuity. 


THEOREM 8.37 Let X and Y be Grothendieck toposes. Then: 
1 Map(X,Y) has all filtered colimits. 


2 Composition with maps on either side preserves filtered colimits. 


Proof 1. Suppose we have a filtered diagram of maps f;. Then colim; f; is 
calculated by 


(colim f:)*(B) = colim( f} (B)) 


Regardless of filteredness, this will preserve colimits. The filteredness ensures 
that it preserves finite limits, because filtered colimits commute with finite 
limits. 

2. For precomposition by g : W — X, this follows from the fact that g* 
preserves colimits. For postcomposition by h : Y — Z, it is trivial. QED 
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4.3 Elementary toposes 


Grothendieck toposes have the structure needed to interpret geometric logic. 
Surprisingly, they also turn out to have structure for interpreting full first-order 
logic and even higher-order logic, though that structure is not geometric—it is 
not preserved by inverse image functors. (In this it is like the Heyting arrow, 
which exists in frames but is not preserved by frame homomorphisms.) This 
led to a generalized notion of topos, the elementary topos, which embodies the 
finitary part of that fuller structure. 

This structure allows =, — and V as connectives for constructing formulae, 
and so allows the coherent axioms to be formulae. It is only with this step 
that the differences between classical and constructive logic become visible. 
Characteristically classical axioms such as 


~m — @ (double negation rule) 
ov rAd (excluded middle) 


cannot be stated in coherent form, since they require negation to be used as a 
connective. 

A minimal definition uses the notion of powerobject P(A), an object whose 
elements at stage B are the subobjects of B x A (so the global elements are the 
subobjects of A, like a powerset). 


DEFINITION 8.38 An elementary topos is a Cartesian category with a power 
object P(A) for every object A. 


The standard texts (Mac Lane and Moerdijk, 1992, IV.1, Johnstone, 2002a) 
show how much more structure can be deduced from this. In particular, an 
elementary topos is Cartesian closed: if A and B are objects, then there is a 
further object B4, the exponential, whose elements at stage C are in bijection 
with the morphisms from C x A to B. It also has finite colimits, and the 
subobject pullback functions f* have both left adjoints 4—as required in 
Sec. 4.1 to interpret I—and right adjoints Y s—which are needed for V. 

The powerobject P(1) is known as the subobject classifier, Q. Its elements 
at stage A are the subobjects of A. In particular, its global elements are the 
subobjects of 1 and can be thought of as truth values. In Set, Q = 2 where 2 
is defined as the coproduct 1 + 1. But, logically, this implies excluded middle, 
and does not hold in general. 2 is the object of decidable truth values. 

Note that we obtain type constructors in elementary toposes that are non- 
geometric—not preserved by inverse image functors. Important examples in- 
clude Q, powersets, function sets (exponentials) and the set of reals. 

The existence of the subobject classifier has a big effect on the way the 
logic is interpreted. Subobjects of A are now equivalent to their characteristic 
morphisms A —> Q, and so logical formulae can be interpreted as terms of type 
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Q. This is formalized in the Mitchell-Bénabou language (see Mac Lane and 
Moerdijk, 1992, VI.5). Moreover, the logical connectives are interpreted as 
operations on Q. This account of interpreting logic in toposes is rather different 
in appearance from the one we have described, though for coherent logic they 
are equivalent. 

An elementary topos need not have a natural numbers object (characterized 
as initial induction algebra). However, it is of vital importance when it is 
present since then analogues of Theorems 8.25 and 8.26 will hold. Johnstone, 
2002b, D5.3.5 covers the case of free algebras over sets for finitarily presented 
single-sorted algebraic theories. 


4.4 Classifying toposes 


We can now give the definition of classifying topos, as Lindenbaum algebra 
for predicate geometric theory. This is the analogue of Proposition 8.3, though 
note that we have replaced homomorphisms by “maps”, going in the opposite 
direction. 

Suppose T is a geometric type theory, X a Grothendieck topos and M a 
model of T in SX. Then for every Grothendieck topos W we have a functor 


(—)"(M) : Map(W, X) > Modsw (T) 


that takes a map f to the model f*(M). This is indeed a model, because f* 
preserves all the geometric structure used to define modelhood. 

Note also that it is a functor—natural transformations between maps are 
taken to homomorphisms between models (exercise !). 


DEFINITION 8.39 LetT be a geometric type theory. A classifying topos for T 
is a Grothendieck topos |T], equipped with a generic model G of T in S{T], such 
that for every Grothendieck topos W the functor (—)*(G) is an equivalence of 
categories. 


We adapt the definition from Johnstone, 2002a, B4.2.1(b) with changes of 
notation. 

Note the effect of having only an equivalence. The correspondence between 
models and maps is only up to isomorphism—if M is a model in SW, then 
there is some map f : W — |T] such that W = f*(G). However, given maps 
f and g, there is a bijection between homomorphisms f*(G) — g*(G) and 
natural transformations f = g. A consequence of this is that the classifying 
topos itself is defined only up to categorical equivalence. 

We now have an alternative reading to the symbol S. If S stands for “sets”, 
then S/T] can be read as “the category of sets with a model of T freely adjoined”. 
This is in line with some existing notation (Johnstone, 2002a, B4.2.1) and is 
analogous to notation such as R[X] for a polynomial ring. 
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There are some crucial results that cannot be proved without a closer ex- 
amination of the structure of classifying toposes. We defer that to Sec. 5, and 
meanwhile look at the use of classifying toposes. The crucial results are: 


a Every geometric theory has a classifying topos (Theorem 8.65). 


= Every geometric type theory has a classifying topos (Theorem 8.66, with 
some restrictions on the generality). 


m Every Grothendieck topos classifies some geometric theory (Theorem 8.67). 


= For propositional geometric theories, the maps between the locales are 
equivalent to the maps between their classifying toposes (Theorem 8.71). 
Hence for these the locale and topos treatments are equivalent, and locales 
can be considered a special case of toposes. 


Just as for locales, we define a point of a topos X at stage W to be a map 
from W to X. Then maps act as point transformers by postcomposition. 

The empty theory (Ø, Ø) with no symbols and no axioms has a unique model 
in any category, given by the vacuous interpretation, so Modsw (0, Ø) is the 
category with one object and one (identity) morphism. We write 1 for its 
classifying topos. 


PROPOSITION 8.40 S1 ~ Set. 


Proof Mac Lane and Moerdijk, 1992, Sec. VII.1. Every set A is a coprod- 
uct of copies of the terminal object (singleton set), which we shall also write 
1. (There should be no confusion between the different 1s.) Hence for any 
Grothendieck topos X an inverse image functor !* : Set — SX has to take 
each set A to a coproduct of an A-indexed family of copies of 1. Moreover, any 
such functor preserves finite limits. (This is non-trivial, and relies on the prop- 
erties of Grothendieck toposes.) The category of such functors is equivalent 
to the category with one object and one morphism. In other words, there does 
exist such a functor, and for any two such functors (with different choices of 
coproducts) there is a unique natural isomorphism between them. It is easy to 
show from the adjunction that if B is an object of SX then !,.(B) is isomorphic 
to the set of global elements of B, morphisms 1 — B. QED 


As before, points at stage 1 are called global. The global points of [T] are 
equivalent to the models of T in Set. 

Let O be the theory with one sort and no functions, predicates or axioms. 
Categorically, it is the theory of “objects”, since a model in SX is just an 
object of SX, and its classifying topos is called the object classifier (not to be 
confused with the subobject classifier Q that exists in any elementary topos). 
In generalization of the terminology for spaces, we call the objects of SX the 
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sheaves over X, and they are equivalent to maps from X to [O]. Since the 
global points of [O] are sets, our intuition is that [O] is “the space of sets”, and 
in Sec. 5 we shall see why it is a reasonable intuition to think of a sheaf over 
X as acontinuous map from X to a space of sets. 


4.5 Maps between classifying toposes 


Now that our “Lindenbaum algebras” are Grothendieck toposes, we can—as 
we have seen—interpret large amounts of ordinary mathematics internally in 
them. This makes them very different from the lattices we used for propo- 
sitional logics, and this has a profound effect on the way we can use these 
logical techniques. It makes it possible to treat classifying toposes [T] in a very 
spatial way. 

Suppose T; and To are two geometric theories. By definition of classifying 
toposes, a geometric morphism f : [T] — [T2] is equivalent to a model M of 
T> in S{T]. Now all the objects and morphisms in S[T\] are constructed out 
of the generic model G of T}, and indeed can be constructed using finite limits 
and arbitrary colimits. It follows that M too has to be constructed out of the 
generic T-model. Let us portray this naively as a model transformation. 


1 We declare “Let G be a model of T}.” 
2 We construct a model M of To. 


Within the scope of the declaration (1), our logic and mathematics are to 
be interpreted in S[T;] with G the generic T;-model. This means it must be 
constructively valid. We thus have a temporary change of mathematics. Back 
outside the scope of the declaration, returning to our ambient mathematics, we 
find our model construction gives a geometric morphism f : [Tı] — [To]. 

The same technique also works for natural transformations. If we define two 
models M and M’, and a homomorphism ôg : M — M’, then that gives us 
two maps f, f’ : [Tı] — [To] and a natural transformation 6: f > f’. 

On the face of it, in step (2) we could use any mathematics validly inter- 
pretable in S[T;]. For instance, we might use Q or function types, since in fact 
S[T\] is an elementary topos. However, there are good reasons for restricting 
to geometric constructions. 

If we have a point x of [Tj] at stage W—that is to say, a model x*(G) of 
Tı in SW, G being the generic model of Tı, then we can apply f to it by 
composition and get a model x*( M) of Tə in SW, corresponding to f o x. If 
the construction of M from G (in S[T{]) is geometric, then it is preserved by 
x*, and so the same construction constructs «*(M) out of x*(G). Hence the 
geometric construction works uniformly, not only for the generic point but for 
all points. 
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We therefore see that geometric morphisms between classifying toposes can 
be viewed as geometric model transformations. 

It is the geometric working that enables us to view a topos spatially as com- 
prehending all generalized points, because it allows us to transport our math- 
ematics from one stage of definition to another along inverse image functors. 
Since, as we shall see later, geometric morphisms generalize continuous maps 
in topology, another way to view the role of the geometric constructions is that 
they have an intrinsic continuity. 

This same view of map also provides a good way to think about generalized 
points. A point of X at stage Y, in other words a map Y — X is conveniently 
thought of as a point of X “parametrized by” a variable point of Y. 


EXAMPLE 8.41 Reduct maps. Let F : U — T be a theory morphism between 
geometric theories (Definition 8.19). Then every model of T is trivially a model 
of U by model reduction. This defines a reduct map Red r : [T] — [U]. 


In fact any geometric morphism can be expressed as a reduct map. 


THEOREM 8.42 Let f : [T] — [U] be a geometric morphism. Then there is a 
geometric theory T" equivalent to T and with a theory morphism F : U — T' 


such that f factors as [T] ~ [T"] a [U]. 


Proof In S[T] we have the generic model G of T and a model M of U 
given by f. The result can be proved using the conventional techniques of 
Sections 5.3 and 5.4. It appears in detail in Viglas, 2004. However, here is 
a more conceptual reason. Each ingredient of M can be constructed from the 
ingredients of G using colimits and finite limits. T can be extended with sorts 
for such colimits or finite limits, together with structure and axioms to force 
then to be those colimits or limits. The extended theory is equivalent to T, 
since its models are determined up to unique isomorphism by their T-reducts. 
But there is also an obvious theory morphism from U. QED 


From this one can easily deduce results such as the following. 


PROPOSITION 8.43 Let fi : Y; — X be a map between Grothendieck toposes 
(i = 1,2). Then there is a pseudo-pullback square 


Zi 2 Y 
pl S Ilf 
Yo — X 
f2 


(A pseudo-pullback is like a pullback except that the square is required to 
commute only up to isomorphism. ) 
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Proof Suppose Y; and X classify theories T; and U. We can factor each f; as 
an equivalence followed by a reduct map for a theory morphism F; : U > T}. 
Hence, we may assume that each f; is already a reduct map. Now define P as 
follows. It has T; and T> put together disjointly, giving a theory morphism from 
each T;. This now has two copies of U, the images of the two theory morphisms. 
Add function symbols and axioms to make mutually inverse homomorphisms 
between those two copies of U. Then a model of P comprises a model M; of 
each T;, and an isomorphism between their U-reducts, and this is exactly what 
is needed for the pseudo-pullback property for Z = [P]. QED 


Hence, the points of Z are equivalent to triples (y1, y2, 0) where each y; is a 
point of Y; and 8 : fi(y1) = fo(y2) is an isomorphism. When X = 1, we get 
a product Yı x Yə whose points are pairs of points from Y; and Yo. 

By similar means we can construct a topos Z’ whose points (y1, y2, 0) have 
6: fi(y1) > fo(y2) a homomorphism. The resulting square 


pi 


Z" peang, Yı 
0 
pl = lf 
Y— X 
f2 


is called a comma square. It does not commute, but has a natural transformation 
from fı o p) to fz o ph. 

Our next result shows vividly how geometric morphisms between classifying 
toposes can appear like functors between model categories—indeed, by taking 
points one extracts the functors. But as a geometric morphism, it carries extra 
information that it has continuity properties—for example, that it preserves 
filtered colimits. (This fact about geometric morphisms was exploited in Viglas, 
2004 for proving that certain functors preserved filtered colimits.) 

Its notion of adjunction between toposes is technically possible because 
the natural transformations make the category of toposes and maps into a 
2-category. An adjunction between X and Y comprises maps F : X — Y 
and G : Y — X (the left and right adjoints), and natural transformations 
n: Idx = Go F ande: F o G = Idy such that the two composites 


(Fon); (eo F): F> FoGoFs3F 
(ņnoG); (Goe): G=>=GoFoG>G 


are both identities (cf. Mac Lane, 1971, IV.1 Theorem 2(v)). 

This can all be worked through in terms of geometric transformations. How- 
ever, Viglas, 2004 simplifies it greatly. Once the maps F and G have been 
defined, it suffices to use a geometric argument of the following form (where 
x and y are points of X and Y respectively). It is analogous to a more fa- 
miliar characterization of adjunction, but the geometricity guarantees all the 
functoriality and naturality required. 
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= For each specialization (homomorphism) ¢ : x = G(y), define a(¢) : 
F(x) => y. 


= For each y : F(x) => y, define G(w) : x > G(y). 
= Show ((a(¢)) = ¢ and a(3(y)) = Y. 


THEOREM 8.44 Let F : Ti — To be a morphism between two Cartesian 
theories. Then the reduct map Redr : [T>] — [Ti] has a left adjoint Freep : 
[Ti] — [T]. 


Proof Constructing a free T>-model over a T|-model is geometric, and so the 
map Freep is defined by saying for any T,-model M, Freer(M) is the free 

T>-model over it. 
The adjunction arises here because the corresponding adjunction within any 
Grothendieck topos is geometric. (This can be proved from Proposition 8.33.) 
QED 


4.6 Localic toposes 


We now have two ways to deal with propositional geometric theories: as 
locales or as toposes (which in this case are called localic). Theorem 8.71 will 
show that locales and localic toposes are equivalent, but for the moment we 
look at some of the topos behaviour in its own right. 


DEFINITION 8.45 A topos is localic if it classifies a propositional geometric 
theory. 
A geometric type theory is essentially propositional if it has no sorts. 


The theory Ded of Dedekind sections (Sec. 3.5) is essentially propositional. 

We conjecture that the next result holds more generally, for instance when 
type constructs are applied to propositions (as subsingletons). However, our 
restricted proof is at least enough to cover our applications. 


THEOREM 8.46 Let T be an essentially propositional geometric type theory 
whose types can all be constructed in the empty theory. Then |T] is localic. 


Proof If 7 is a type in T, it has an interpretation [|r|] x in any Grothendieck 
topos SX. We write [|r|] for [|7]]1 (in Set). 

We may assume without loss of generality that T is presented without any 
function symbols, but only predicates. This is because any function symbol can 
be replaced by a predicate for its graph, with axioms for single-valuedness and 
totality. We now show how T may be converted into an equivalent propositional 
geometric theory T”. 
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The propositions of T” are as follows. If S C Tı x ... X Tn is a predicate 
symbol in T, then for each @ € [[}_,||7i|] we introduce a proposition Sz. Now 
for each formula in context (Z.¢) in T, and for each & € [],[|o(2i)|], we define 
a formula ¢¢ in T” by induction as follows. 


1 (Voy )a = Vj (ġj)a, and similarly for conjunctions. 
2 (x; = De) = V{T | Qi = aj}. 
3 (Ey) oa = Vi dap |b € [lo(y)I]}- 


Finally, for each axiom (VZ) ($ — y) of T, we give T’ axioms dg — wz 
(@€ Thillo). 

Note that, even if T is a coherent type theory (no infinitary disjunctions) and 
finitely presented (only finitely many symbols and axioms), T” is likely to have 
infinitely many symbols and axioms, and infinitary disjunctions. 

We now show that T and T” are equivalent. In Set, [|r|] S dae||r|] 1 and 
since this is geometric it also holds in any SX. It follows that subobjects of 
[|7|]x correspond to [|7|]-indexed families of subobjects of 1 in SX. Hence 
structures for T are equivalent to structures for T”. Now suppose that M is 
a structure for T, and M’ the corresponding structure for T’. By structural 
induction on the formula ¢, one can then show that for any formula in context 
(Z.p), the subobject {M|¢} of [],{M|a(x;)} corresponds to the family of 
subobjects { M'|ġa} for & € [];||o(a;)|]. From this one deduces that M is a 
model for T iff M’ is a model for T”. QED 


“——~ 





PROPOSITION 8.47 Let X be a localic topos and let x and x! be points of it. 
Then there is at most one homomorphism from «x to x’. 


Proof We can take X = [T] where T is propositional. But then with no sorts, 
a homomorphism needs to supply no carrier functions. The sole requirement is 
that if P is a propositional symbol and {z|P} holds (topologically, x is in the 
open P), then so does {2"| P}. QED 





Hence if X is localic then Map(Y, X) is a preorder. We write x E a’ if 
there is a homomorphism from z to 2’; this is called the specialization order 
on points. Later we shall prove that this agrees with the specialization order we 
have already defined for locales. 

Sec. 4.5 showed that maps between toposes can be defined as geometric 
model transformations, and this still applies to localic toposes. But Theo- 
rem 8.71 will show that those also define maps between the locales. If the 
locales are spatial, Proposition 8.16 shows that we then get continuous maps 
between the spaces. Thus geometricity of the model transformation is enough 
to guarantee continuity. As a logical approach to continuity, geometric logic 
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works by starting with ordinary logic and then removing the structure (e.g. 
negation) that makes it possible to define non-continuous functions. Compare 
this with other approaches, such as topology itself, or the modal logic of interior, 
that work by adding structure to support a bureaucracy of continuity proofs. 


4.7 Example: the reals 


We have now seen two geometric theories that purport to represent the real 
line. In Sec. 2.5, Tig was a propositional geometric theory described as the 
localic reals, while in Sec. 3.5 Ded was a predicate geometric theory whose 
models are the Dedekind sections. We now show that they are equivalent. 

By the proof of Theorem 8.46 we see that Tig is equivalent to a theory Th 
with a single predicate symbol P C Q?, and axioms 








P(q,r) A P(r") larq'r’ (Ast)(P(s,t) A max(q, q) <s<t<min(r,7’)) 
0 <ete (Aq)P(q—€,¢+€) 











“——~ 


Given a model of Tk, we define a Dedekind section (L, R) geometrically by 


L= {q € Q| (Ar)P(ar)} 
R= {r € Q | (34)P(q,r)}. 





It is easy to see that this is a Dedekind section. If q < r, let e = (r — q)/2 
and find s such that P(s — £€,s + £). If q > s—eandr < s+ e then 
qte>s>r—eandr—gq < 2e, a contradiction. Hence either q < s— € € L 
orr > s+e € R. (Note that the order on Q is decidable, so we can use this 
proof by contradiction.) Also, P(q,r) holds iff q € L and r € R. 

Conversely, suppose (L, R) is a Dedekind section, and define P(q,r) if 
q E€ Landr € R. The first axiom for P is clearly satisfied. For the second, 
take £ > 0 and find qo € L and ro € R so that P(qo, ro). Find n € N such 
that ro — go < 2”tte. By induction on n, we show that there is some u with 
P(u—¢,u+e). Ifn = 0, we can take u = (qo + ro)/2, for u—E < qo < ro < 
u +E. Now suppose n > 1. Let 5; = qo + i(ro — qo)/4 (0 < i < 4). Since 
81 < 82 < 83, We have (1) either sı € Lor s2 € R, and (2) either so € Lor 
s3 E€ R. Examining the possibilities, we can find qı and rı from amongst the 
sis with Plas rı) and rı — qı = (ro = qo)/2 De 

The above geometric constructions give us maps f : [Tp] — [Ded] and g : 
[Ded] — [Tk]. Composing them, we see that go f S Idpryj and fog © Idi Ded): 
Hence the two theories are equivalent. 

We have proved this solely on the hypothesis that the classifying toposes 
exist. We have not had to analyse the structure of the classifying toposes at 
all, beyond the knowledge that they are Grothendieck toposes and have generic 
models. 
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Having shown these theories are equivalent, it is possible now to define the 
real line R to be the classifying topos | Ded], the “space of Dedekind sections”. 
This may seem heavy-handed. However, it tells us what the real numbers are 
(the points of R, i.e. the models of Ded). It also defines the topology. The opens 
of | Ded] (i.e. the subobjects of 1 in S| Ded]—see Sec. 4.4) are equivalent to 
those of [Tig], and Theorem 8.71 will show that they are equivalent to elements 
of the frame Q|Tg] as defined in Sec. 2.5. 

Let us use this definition of R to define a map. 


EXAMPLE 8.48 Addition + : R x R — R is defined as follows. If x and y 
are points of R, then q < x + y if q = qı + q2 for some qı < x and q2 < y, 
and x +y <r ifr = rı + rfor some x < rı andy < Ta. 


See how we have defined a map (a geometric morphism) just by defining 
a geometric construction on points. Remarkably, this is enough to guarantee 
continuity of the corresponding function between spaces. 


5. Sheaves as predicates 


We have already defined sheaves as the objects of classifying toposes. This 
broadens the normal usage, which defines sheaves in a more technical way. In 
this section we analyse more closely the structure of classifying toposes and 
come to the technical definition. This will enable us to prove the crucial results 
announced in Sec. 4.4. 

Sheaves were defined first over topological spaces. Grothendieck subse- 
quently generalized the definition to sheaves over a site, but as he stressed, the 
category of sheaves (the Grothendieck topos) is more important than the site 
(which is in effect a particular form of geometric theory). The reason for this is 
essentially our “Lindenbaum algebra” methodology—it is in terms of the topos 
that we get a good definition of map. 

In propositional logic each proposition @ corresponds to a map |} from 
models to truth values. In the geometric context this corresponds to a map 
from the locale to S, and in ordinary topology this is an open. 

In predicate logic, each formula (%.¢) corresponds to a function |Z.ġ} 
from models to sets. In a geometric context this is a map from the classi- 
fying topos to the object classifier [O], and in ordinary topology it is sheaves 
that provide the corresponding “continuous set-valued map”. 


5.1 Sheaves over a topological space 


Suppose X is a space and S is a “continuous set-valued function” on it. For 
this to make sense, we certainly need a set S(x) for each point z; this is called 
the stalk of S at x. If we let Y be the disjoint union of the stalks we shall have a 
projection 7 : Y — X, and the stalks S(x) are recoverable as fibres 7~!({}). 
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We shall introduce a class of continuous maps 7, the local homeomorphisms, 
for which it turns out that sheaves can be derived by using the fibres as the 
stalks. 

Each stalk is to be a set, and we take from that that the fibre 7~1({x}), as 
a subspace of Y, should have the discrete topology. Thus within fibres, the 
topology of Y should be discrete. On the other hand, across the fibres we might 
argue that the topology of Y should be no finer than is got from X in order that 
the dependence of 7~1({a}) on x should be “continuous”. 

Those are vague, but let us suggest that the second “across fibre” condition 
requires 7 to be an open map—it takes open sets to open sets. 

For the first “within fibre” condition, consider that a space Z is discrete 
iff the diagonal inclusion A : Z => Z x Z, A(z) = (z,z), is open. To 
see this, take z € Z. By definition of the product topology, there are open 
neighbourhoods U and V of z such that U x V C A(Z). By replacing U and 
V by their intersection, we might as well assume U = V. Now if z’ € U then 
(z,2/) E€ U x U C A(Z), so z’ = z. Hence U = {z} and so {z} is open, so 
the topology is discrete. 

Generalizing to 7: Y — X, we use a “fibrewise discrete” property that the 
inclusion A : Y — Y xx Y should be open. (Y x x Y is the fibred product, 
or pullback, { (y1, y2) | ™(y1) = m(y2)}.) This is more than enough to imply 
that every fibre is discrete in its subspace topology. 


PROPOSITION 8.49 Leta: Y — X be a map of spaces. Then the following 
conditions are equivalent. 


I wand the diagonal inclusion A: Y — Y Xx Y are both open. 


2 Eachy € Y has an open neighbourhood V such that 7 restricted to V 
is a homeomorphism onto an open neighbourhood of n(y). 


Proof First, note that A(Y) is open in Y xx Y (as subspace of Y x Y) 
iff every (y, y) € A(Y) has a basic open neighbourhood in Y xx Y that is 
contained in A (Y), in other words we can find neighbourhoods V; and V2 of y 
such that Vj x Van Y xx Y C A(Y). By restricting to Vi N V2 we might as 
well assume Vı = V2. The condition V x VM Y xx Y C A(Y) says that 7 is 
1-1 on V. To summarize, A is open iff every y € Y has an open neighbourhood 
V on which z is 1-1. 

(1)=(2): Ify € Y, choose V as above. m is a continuous bijection 
from V onto 7(V), and since m is open, we deduce that this bijection is a 
homeomorphism. 

(2)=(1): Let W be open in Y. If y € W, then we can find Vy in condition 
(2) and 7(W N Vy) is open. 7(W) is the union of these open sets 7(W N Vy) 
and hence is open. Hence 7 is an open map. Openness of A follows from what 
we have already said. QED 
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DEFINITION 8.50 Leta : Y — X bea continuous map between two topolog- 
ical spaces. 7 is a local homeomorphism (over X) if it satisfies the equivalent 
conditions of the proposition. 


If m; : Yi — X are two local homeomorphisms over X, then a morphism 
from 7 to 72 is amap f : Yı — Y> such that m20 f = mı. We obtain a category 
LocHom x of local homeomorphisms over X. 

It will turn out from a long train of argument that LocHomy is a 
Grothendieck topos. Local homeomorphisms are equivalent to sheaves as 
presheaves, and then from the more general topos theory they are classify- 
ing toposes. However, it is an illuminating exercise to prove it directly. The 
geometric constructions needed in LocHom, can all be constructed stalkwise 
by elementary means. 

For any map 7: Y — X, a local section of 7 is a map o : U — Y, with 
U open in X, such that 7 o o = Idy. An open V as in Proposition 8.49 (2) 
is equivalent to a local section of 7 whose image is open. The other main 
definition of sheaf uses sections, through the notion of presheaf : a presheaf on 
any category C is a contravariant functor from C to the category Set of sets. For 
a topological space, a presheaf on X is defined to be a presheaf on QX. As with 
any poset, the objects of QX are its elements, and the morphisms are the pairs 
(U,V) with U C V —in other words, there is a single morphism from U to V 
provided U C V. A presheaf F' on QX has a set F(U) for each U € QX, and 
if U C V there is a restriction from F'(V) to F(U), which we shall normally 
write o > o|U. 

A morphism of presheaves is just a natural transformation. For presheaves 
over a space X, this means that a morphism from F to G has a family of 
functions fy : F(U) — G(U) (U € QX) that commute with the restriction 
maps. 


DEFINITION 8.51 Let X be a topological space. A presheaf F on X is a 
sheaf if it satisfies the following pasting condition. 

Let U; € QX (i € I), and suppose for each i we have o; € F(U;) such 
that for all i, j we have o;|(U; N Uj) = o;|(Ui O Uj). Then there is a unique 
o € F(U; Ui) such that for all i we have o|U; = oi. 

A morphism of sheaves is just a presheaf morphism. We get a category SX 
of sheaves over X. 


Note the uniqueness. As an immediate consequence, by taking 7 = Ø we see 
that if F is a sheaf then F (Ø) is a singleton. Note also that the same definition 
of presheaf and sheaf work over a locale. 


EXAMPLE 8.52 Leta : Y — X be a local homeomorphism, and let the 
presheaf Sect(m) be defined by 


Sect(7)(U) = {0 : U > Y | o isa local section of x}. 
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The restriction maps are ordinary domain restriction of functions. Then Sect(7) 
is a sheaf. 

The process is functorial. If nm; : Y, — X (i = 1,2) are two local home- 
omorphisms over X, and f : Yı — Yə is a morphism between them, then 
composition with f gives a sheaf morphism Sect(m) — Sect(72). We get a 
functor Sect : LocHomy — SX. 


THEOREM 8.53 Sect : LocHom, — SX is an equivalence of categories. 


Proof (Sketch) It is necessary to show that the functor Sect is full and faithful, 
and essentially surjective. From Sect(7) we can recover the stalks, since the 
stalk at x is the colimit of the sets Sect(7)(U) as U ranges over the open 
neighbourhoods of x. Furthermore, we can recover the topology since the 
images of the local sections form a base. Starting from an arbitrary sheaf F', 
the same construction yields a local homeomorphism whose sheaf of sections 
is isomorphic to F'—this proves essential surjectivity. 

Faithfulness is easy, but for fullness one must show that if m; : Y; > X 
(i = 1,2) then every sheaf morphism a : Sect(71) — Sect(m2) comes from a 
morphism f from 7 to 7. If y € Y}, find a section ø : U — Yı whose image 
contains y. Then f(y) is defined as ay(o)(71(y)). One must prove that this 
definition is independent of choice of o, that f is continuous, that m2 0 f = mı 
and that Sect(f) = a. QED 


5.2 Sheaves and local homeomorphisms for toposes 


For any topos X, the sheaves over X (the objects of SX) are equivalent to 
the maps X — [O]. Hence, by the methods of Sec. 4.5, to define a sheaf S we 
declare “let x be a point of X” and then, geometrically, define a set S(x). We 
therefore think of S as a continuous set-valued map on X. (However, except 
on global points, these are not sets in the sense of set theory, with the structure 
all defined through the € relation. Geometric type theory is not done that way.) 
We call S(x) the stalk of S at x, and this notation also suggests we might view 
the sheaf as a set parametrized by a variable point of X. 

Both the point transformation and the parametrization involve a radically new 
notion of continuity, since [O] has far too few opens to be a useful topological 
space in anything like the conventional sense. An open of [O], a map [O] — S, 
is a geometric definition of a truth value for each set S. There are three obvious 
ways to do this: constant T, constant L and by the formula (da € S) T. In 
effect, we have three open subspaces of “the space of sets”: the whole space, 
the empty space, and the space of inhabited sets. We shall later (Example 8.74) 
be able to prove that—at least classically—these are the only three, and from 
the localic point of view [O] cannot be distinguished from S. (Technically, S is 
the “localic reflection” of [O|—Definition 8.72.) 
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EXAMPLE 8.54 LetT bea geometric theory. The functors and natural trans- 
formations |B}, |Z.~}, |@.t}, etc. of Remark 8.22 define sheaves and sheaf 
morphisms. 


Stalks can be gathered together to make a new topos, analogous to a local 
homeomorphism. Let ©, elt be the theory with one sort and one constant 
symbol, and let p : [O, elt] — [O] be the obvious reduct map (which forgets 
the constant). The points of [O, elt] are pairs (A,a) where A is a set (or, in 
general, a sheaf over a topos) and a a global element of A. 


PROPOSITION 8.55 Let X be a topos and A : X — [|O] a sheaf. Let X/A 
be the topos given by the pseudo-pullback 


X/A — [O,elt] 
ASP = jp 
x — [0] 
A 


Its points are pairs (x,a) with x a point of X anda € A(x). Then S(X/A) is 
equivalent to the slice category (SX) /A, whose objects are morphisms in SX 
with codomain A. 


Proof S(X/A)isgotfrom SX by freely adjoining a globalelemente : 1 > A. 
From this one can construct, for any v : C — A in SX, the pullback along e 
giving an object e*C, and the result says in effect that every object of S(X/A) 
comes from some v in this way. 

It is straightforward to check that (SX)/A is a Grothendieck topos. (The 
corresponding fact for elementary toposes is the “Fundamental Theorem of 
Topos Theory”, Johnstone, 2002a, A2.3.) We have a functor g* : SX — 
SX/A, with q*(B) the projection B x A — A, and it preserves colimits and 
finite limits. In SX/A the final object is Id4 : A — A, and q* A has a global 
element e given by the diagonal A: A — A x A. Every object v : C — Ais 
the pullback of q* (v) against e. 

Now suppose we have a map f : Y — X with a global element e’ : 1 — 
f*(A). The result amounts to showing that f* factors via q* and a functor 
r* : SX/A — SY that preserves colimits and finite limits, and takes e to 
e’. Clearly r*(v) has to be (up to isomorphism) defined as the pullback of 
f*(v) against e’, after which it remains only to check that it has the required 
properties. QED 


Note a corollary to this. A map X/B — X/A over X is equivalent to 
a global element of (A x B — B) in SX/B, and this is just a morphism 
B — Ain SX. Hence sheaf morphisms are equivalent to maps between the 
corresponding fibred spaces. 
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In the pseudo-pullback square in the proof of Proposition 8.55, the map A*p 
on the left maps (x, a) to x. The maps that arise in this way from sheaves over X 
are called local homeomorphisms or étale maps (see Johnstone, 2002b, C3.3.4). 
For each point x, the pseudo-pullback of A*p against x is in effect the stalk at 
x. In fact, we have three equivalent categories to represent sheaves over X: 
SX, Map(X, [O]) and the category of local homeomorphisms with codomain 
X. In Joyal and Tierney, 1984 the local homeomorphisms are characterized in 
a way analogous to the first condition of Proposition 8.49, using a topos notion 
of open map. 


5.3 Sites 


A canonical form of geometric theory is that deriving from a site. We give 
the definition from Johnstone, 2002a, A2.1.9. 


DEFINITION 8.56 Let C be a small category. A coverage J on C assigns to 
each object A of C a collection J(A) of families (fi : Ai > A | i € I) of 
morphisms targeted at A, subject to the condition that for each such family in 
J(A), and for each morphism g : B — A, there is a family (hy : By > B | 
i’ € I’) in J(B) such that each g © hy factors via some fi. 

A category equipped with a coverage is called a site. 


The definition in Mac Lane and Moerdijk, 1992, III.2 Definition 1 is slightly 
different, as a category equipped with a Grothendieck topology. In this, the 
covering families are all required to be sieves, i.e. closed under precom- 
position. The difference is explained in Johnstone, 2002b, C2.1.8 (where a 
Grothendieck topology is called a Grothendieck coverage). Any coverage gen- 
erates a Grothendieck topology that is equivalent to it for its intended purposes. 


DEFINITION 8.57 Let (C, J) be a site. Then the geometric theory CtsFlat 
(C, J) of continuous flat functors over (C, J) has sorts X 4 and functions uf : 
Xa — XB for the objects A and morphisms f : A — B of C, and axioms 
(Va: Xa) ula, (x) = x (A € Ob(C)) 
(Vx : Xa) uglus(£)) = uges(@) (f : AS B, g: B — C) 
V scone) OT : Xa) T 
(Va: Xa,y: XB) Vacone) V ¢:.c+4Vg.c+B O2 : Xc) (x = plz) ^y 
= ug(2)) 
(Va: Xa) (us(x) = g(x) 
— Voeone Viz : Xc) £ = un(z) | kh: C >A, foh=gohk} 
(f,g:A— bB) 
(Va: XA) Vier(Gy: Xa) £ = uz, (y) (fi: Ai —> Alt © D) in J(A)) 
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Its models in a Grothendieck topos € are the continuous filtering functors 
(or continuous flat functors) from C to E€ (Mac Lane and Moerdijk, 1992, VII 
Sec. 7-9). The first two axiom schemas stipulate functoriality, the next three are 
the flatness (or filtering property) and the final one is the continuity. Note that 
if C has all finite limits, then (Mac Lane and Moerdijk, 1992, VII.9 Corollary 
3) the flat functors from C to a Grothendieck topos are exactly the finite limit 
preserving functors. 

Without the final axiom in Definition 8.57 we have the theory Flat(C) of flat 
functors over C. 


EXAMPLE 8.58 Any Cartesian theory T is equivalent to Flat(C) where C is 
the opposite of the category of finitely presented T-models. For a discussion 
of some non-Cartesian theories of the form Flat(C), as well as the construc- 
tive notion of “finite” used in “finitely presented” (stronger than Kuratowski 
finiteness) see Vickers, 2001. 


5.4 Sheaves as presheaves 


In this section we return to the main question left over from Sec. 4.4: what 
are the sheaves over a classifying topos? It is only when this has been answered 
that we can be sure classifying toposes exist. We outline the proof in stages: 
first, theories Flat(C); then CtsFlat(C, J); then geometric theories in general; 
and then geometric type theories. 


LEMMA 8.59 IfC is a small category then the presheaf topos Set?” classifies 
Flat(C). 


Proof (Sketch. cf. Proposition 8.40.) For any Grothendieck topos X we want 
a correspondence between flat functors F : C — SX and functors Set” — 
SX preserving colimits and finite limits. 

The Yoneda embedding VY : C — Set?” acts as a free cocompletion of 
C (Mac Lane and Moerdijk, 1992, I.5 Corollary 4): any functor from C to a 
cocomplete category factors uniquely (up to isomorphism) via y and a colimit 
preserving functor. This gives an equivalence between functors C — SX and 
colimit preserving functors Set” — SX. 

However, Set” is also a Grothendieck topos (Mac Lane and Moerdijk, 
1992, I). The content of Mac Lane and Moerdijk, 1992, VII.7 Theorem 2 is then 
that flatness of the functor C — SX is equivalent to finite limit preservation by 
the colimit preserving functors Set” — SX. (Note also that J is flat.) QED 


EXAMPLE 8.60 (See Example 8.58.) Let T be a Cartesian theory, and let C 
be its category of finitely presented models so that T is equivalent to Flat(C°”). 
Then S{T| ~ Set©. Thus, as set-valued map on points (models of T), a sheaf 
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is determined by its action on the finitely presented models. This also follows 
from the fact that every model is a filtered colimit of finitely presented models, 
and maps preserve filtered colimits. 


We now turn to theories CtsFlat(C, J). Let F : C — F be a flat functor to 
a Grothendieck topos. The continuity axiom says that for each covering family 
(fi : Ai > A)ier in J(A), 


(Va: F(A)) \/ (Ay: F(4:)) £ = F(A)(y). 


iel 





Categorically, this says that the cotupled morphism f = [F(fi)]icr : SOier 
F'(A;) — F(A) is epi, in other words that its image is the whole of F(A). 

To analyse this, we calculate what the image is in general. The result content 
is roughly that of Mac Lane and Moerdijk, 1992, VII.7 Lemma 2. However, 
we sketch a “geometric” proof that does not rely on the subobject classifier. 


LEMMA 8.61 Let (C, J) be a site, and let F : C — SX be a flat functor to a 
Grothendieck topos. Let (fi : Ai > A)ier be in J(A). Then the image of 


F = [F (fier : XL F(As) > F(A) 
icI 
is the colimit of a diagram F o A as follows. Let D be the full subcategory of 
the slice category C/A whose objects are morphisms g : C — A that factor 


through some fi. A is the obvious functor from D to C, taking (g : C — A) 
to C. 


Proof Let ju; : F(A;) — `; F(Ai) be the coproduct injection, and let f = 
m o q be the image factorization of f. 

We define a cocone from F o A to Im f as follows. If g : C — A factors via 
some f;, then F'(g) factors via f and hence uniquely via m, as m o vg (say). 

Now suppose we have acocone from F o A to some K, given by morphisms 
vi, : F(C) — K for each g : C — A in Ob(D). If we are to have a colimit 
morphism a : Im f — K, then it is determined uniquely by a o q (because q 
is epi) and hence by the morphisms ao go pi = a o vf = Vp. This proves 
uniqueness for a. 

The morphisms v, give us a morphism a! = [v4]; : D7; F(Ai) > K, and 
we should like a’ to factor as a o q for some a : Im f — K. This will be as 
required, since if g = f; o g' : C — A then 


Vi, = vi, oF(g') = d'op o F(g') = aogouio F(g') = acvy, o F(g') = aovg. 


To prove existence of a, we interpret logic in SX. Viewing Im f as a quotient 
of X`; F(A;), it suffices to show that if f(a) = f(b) (a,b € >>, F(A;)) then 
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a' (a) = a' (b). Suppose a = p;(a'), witha’ € F(A;), and similarly b = p;(’), 
so F(f;)(a’) = F(f;)(b'). By flatness of F we can find an object C in C, 
morphisms g : C — A; andh: C — A; with fi o g = fj o h, and c € F(C) 
with a’ = F(g)(c) and b = F (h) (c). Then 


al(a) = a! o p 0 F(9)(6) = v4, 0 F(g)() = Vhogle) 
= Vi on(€) = Vy © F(h)(e) = a 0 pj 0 F(h)(0) = a(b). 


QED 


At this point we can introduce the notion of sheaf over a site. (In Mac Lane 
and Moerdijk, 1992, III.4 this is for a slightly different definition of site, but 
the difference is not of great significance here.) 


DEFINITION 8.62 Let (C, J) be a site, and let S be a presheaf over C. S is a 
sheaf if it has the following pasting property. 

Suppose (fi : A; > A)ier is in J(A). Suppose for each i € I we have 
zi E€ S(A;), with the family (x;)ier “matching” in the sense that if C is an 
object of C and g; : C — Aj, gj : C — Aj; are morphisms with fio gi = fjo gj 
then S(gi) (xi) = S(g;)(x;). Then there is a unique x € S(A) such that 
zi = S( fi) (x) for all i. 


LEMMA 8.63 Suppose F : C — SX is flat, and let f : X — |Flat(C)] be the 
corresponding map. Then F is continuous iff for every object U of SX, the 
presheaf f,(U) is a sheaf. 


Proof Suppose (fi : Aj — A)ier is in J(A). By Yoneda’s Lemma, if S 
is a presheaf then a family (x;);ce7 of elements x; € S(A;) corresponds to a 
family (;);c¢7 of morphisms &; : V(A;) —> S. Let A : D — C be the diagram 
described in Lemma 8.61. Then we find that the family (;); is matching iff 
the family (£;); extends (uniquely) to a cocone from Yo A to S. The existence 
of x is equivalent to the factorization of this cocone through (A). If S is of 
the form f,(U), then the cocone of presheaves corresponds to a cocone in SX 
from f*o Y o A = F o A to U. By Lemma 8.61 we know F is continuous iff 
for every covering (f;) we have that the colimit of F o A is F(A), i.e. for every 
U, every cocone F o A — U factors via F(A) = f* o Y(A) — U and this 
gives us our Y(A) — f,(U) as required for finding x. Hence F is continuous 
iff for every covering and for every U we can perform the pasting with f,(U). 
But this just says that every f,(U) is a sheaf. QED 


If we define Sh(C, J) to be the full subcategory of Set” comprising the 
sheaves, then we see that the maps X — [Flat(C)] corresponding to continuous 
flat functors are the ones whose direct image part factors via Sh(C, J). 
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THEOREM 8.64 If (C, J) is a site then Sh(C, J) is a classifying topos for 
CtsFlat(C, J). 


Proof This is the content of Mac Lane and Moerdijk, 1992, VII.9 Corollary 
2, which states that—in the conventional notation—there is an equivalence of 
categories between Map(€, Sh(C,./)) and the category of continuous filtering 
functors C — € (i.e. models of the site theory). In our notation we can thus 
take Sh(C,.J) as S[CtsFlat(C, J)]. In outline, the rest of the proof is as follows. 

First, Sh(C, J) is indeed a topos. (This includes the fact that it is an elemen- 
tary topos. This is perhaps unexpected, since the argument from classifying 
toposes worked with the geometric structure.) 

Next, the inclusion Sh(C, J) — Set” is the direct image part of a geometric 
morphism. Proving the existence of the inverse image part, the “associated 
sheaf functor” or sheavification, is of fundamental importance. If S is already 
a sheaf, then it is its own sheavification. 

After all that, proving that Sh(C, J) classifies flat continuous functors is more 
or less Lemma 8.63. QED 


We can also calculate the stalks explicitly. Let x be a global point, a continu- 
ous flat functor from C to Set, and S a sheaf. The stalk S'o x can be calculated 
in two stages (Mac Lane and Moerdijk, 1992, Sec. VII.5). First, let Uo be the 
disjoint union over all objects A of C of the products x(A) x S(A). Next, if 
f : A — Bisa morphism in C, and a € x(A) and b € S(B), we identify 
(a, S(f)(b)) and (a(f)(a),b) in Up and generate an equivalence relation ~ 
thereby. Then the stalk is Up/~. This construction is geometric, and can be 
reproduced for non-global points. 


THEOREM 8.65 Every geometric theory is equivalent to a site theory CtsF lat 
(C, J), and hence has a classifying topos. 


Proof Let T be a geometric theory over signature ©. By Johnstone, 2002b, 
Lemma D1.3.8, every geometric formula in context over X is logically equiva- 
lent to one of the form V; (ağ; ) i where each ¢; is a Horn formula (a conjunction 
of equations and predicate symbols applied to terms). It follows that each axiom 
in T is equivalent to a set of axioms of the form Y% Fz \/;(Syi)¢;. Moreover, 
by replacing ¢; by Y A ¢; and using the distributivity and Frobenius rules, we 
may assume that ¢; zg, V. From X can be constructed (Johnstone, 2002b, 
D1.4) a syntactic category C, Cartesian (i.e. with all finite limits), such that in 
any Cartesian category D we have that interpretations of X in D are equivalent 
to Cartesian (finite limit preserving) functors from C to D; and recall that be- 
cause C is Cartesian, flat functors from C to a topos are the same as Cartesian 
functors. The objects of C are the Horn formulae in context, modulo renaming 
of variables, and the morphisms are the formulae that are “provably the graphs 
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of functions”, modulo logical equivalence. Now suppose w Fz \/;(Syi) di is 
one of the axioms in T. In C we have diagrams 


1 l 
(Z.p) — (Z.T) 


where the right-hand arrow is the product projection, and the left-hand arrow 
follows from our assumption that ¢; Fzg, 7. We take those left-hand arrows, 
as į varies, as covering (Z.~)), and use these covers to generate a coverage J of 
C. Models of T are equivalent to models of CtsFlat(C, J). QED 


We should now like a result of the form “every geometric type theory has 
a classifying topos”. This is difficult, since our notion of geometric type the- 
ory is only informal. The following argument from Johnstone, 2002a, B4.2 
uses a particular restricted formalization that nonetheless seems ample to cover 
examples that arise in practice. 


THEOREM 8.66 Normally, geometric type theories have classifying toposes. 
(The proof is not completely general.) 


Proof Johnstone, 2002a, Definition B4.2.7(c) gives a definition of geometric 
theory that includes features of geometric type theory. According to that defi- 
nition, a geometric theory T is built up in a finite sequence To,...,7, = T. To 
declares finitely many sorts, and each subsequent step is of one of two forms. A 
simple functional extension T;, of T; declares a function symbol f : Fy, —> Fo, 
where F and Fa are geometric types. A simple geometric quotient T;41 of T; 
is based on a morphism u : Fı — F> of geometric types. T;+ı adds axioms 


u(x) = u(x”) are = x 
T Fyr (Gx : Fi) y = u(x) 


and thus forces u to be an isomorphism. 

In each case, if T; has a classifying topos, then we can identify the geometric 
types (Fı, F2) and morphisms (u) with objects and morphisms of S|T;], and 
one can construct a classifying topos for T;_1. Hence every geometric theory 
by that definition has a classifying topos. 

These two steps provide a completely general way of introducing function 
symbols, and also axioms @ Fz ~, for satisfaction of the axiom is equivalent 
to saying that the inclusion morphism Y A ġ — ¢ is an isomorphism. As for 
predicate symbols P C A, these can be introduced with a sort P’ and function 
ip : P! + Å which must then be constrained to be monic (to give a subobject 
corresponding to P). This is done by an axiom 











ip(£) = tp(#’) Fa a:p £= 2, 
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Hence all the ingredients of geometric type theory can be introduced by 
these steps. QED 


Since only finitely many steps are allowed, it would seem that the geomet- 
ric type theory according to that definition should be finitely presented—only 
finitely many symbols and axioms. However, in practice one can get round that 
by internalizing the indexing set of an infinite family of symbols or axioms. For 
example, consider modules over a ring R. The algebraic theory of these would 
normally be presented with a (possibly infinite) R-indexed family of unary 
operators o, for scalar multiplication. But the set R is a constant geometric 
type (a coproduct of an R-indexed family of copies of 1) over any theory, and 
modules M can equivalently be presented using an operator o : Rx M — M. 
(Exercise: formulate this using simple functional extensions and simple geo- 
metric quotients.) 


THEOREM 8.67 Let E be a category. Then the following are equivalent. 
1 E is a Grothendieck topos (as defined in Definition 8.31). 
2 E is equivalent to Sh(C, J) for some site (C, J). 


3 E is classifying topos for some geometric theory. 


Proof (1) (2) is known as Giraud’s Theorem. See Johnstone, 2002b, C2.2.8, 
where condition (vii) is our condition (1). For an alternative version, see 
Mac Lane and Moerdijk, 1992, Appendix, Theorem 1. (2) is usually taken 
as the definition of Grothendieck topos. 

(2)<(3): Theorems 8.64 and 8.65. QED 


5.5 Sheaves for locales 


We now turn to the question of how continuous maps between spaces and 
locales relate to geometric morphisms between toposes. 


PROPOSITION 8.68 Let X be a Grothendieck topos. Then Subsx(1) is a 
frame. 


Proof Johnstone, 2002b, C1.4.7. In fact Subsx(S) is a frame for any 
sheaf S. QED 


Now for any propositional geometric theory T, topos models in SX are 
equivalent to frame models in Subs x (1). It follows that T and They7) (Defi- 
nition 8.9) are equivalent with respect to topos models. 

Let A be a frame. As a poset it can also be considered a category, and we 
can define a coverage J on it as follows. Let a € A, and let {b; |i € I} C {b | 
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b < a}. Then {b; | i € I} € J(a) ifa < Viez bi. (Exercise: this is indeed a 
coverage in the sense of Definition 8.56.) 


PROPOSITION 8.69 The theories CtsFlat(A, J) and Thy are equivalent. 


Proof Asacategory, Ais Cartesian (products are meets, and equalizers are triv- 
ial). Hence, flatness of a functor is equivalent to preservation of finite limits. 
The top element of A must map to the terminal object 1, and all the other ele- 
ments of A to subobjects of 1 (because if a functor preserves finite limits then it 
preserves monics, and all the morphisms in A are monic). Hence a flat functor 
over A is equivalent to a function A — Sub(1) that preserves finite meets. 
Continuity then says that the function preserves arbitrary joins too. QED 


If A is a frame, then for a presheaf S : AP — Set, ifa < bin A and 
x E€ S(b), then we write x|, for S(a < b)(x), the restriction of x to a. 


THEOREM 8.70 Let A be a frame, and let X be the topos [Tha]. 


1 A sheaf over X is equivalent to a sheaf over the locale for A (Defini- 
tion 8.51, replacing QX by A, and N and |] by ^ and \/). 


2 There is an order isomorphism between Subs x (1), the set of subsheaves 
of 1 over X, and A. 


Proof (1) is calculated directly from Definition 8.62 using Theorem 8.64 
and Proposition 8.69. For (2), the terminal sheaf 1 is defined by l(a) = 1 
(i.e. some singleton) for every a € A. This can be calculated directly, but 
it also follows from the fact that the embedding SX — Set” is a right 
adjoint and hence preserves all limits, and finite limits in Set“” are calculated 
argumentwise. Now the subsheaves of 1 are the sheaves S for which every 
S(a) is a subsingleton. 

For every b € A we have a subsheaf S, of 1 defined by Sp(a) = 1 iffa < b. 
(In fact these make up the generic point of X in SX.) Clearly if Sp = Sy then 
b < b < b, sob = b'. On the other hand, suppose S is a subsheaf of 1 and let 
b be the join of those a € A for which Sa is inhabited. By pasting we find that 
S(b) is inhabited, and it follows that S = Sp. QED 


It follows that for any propositional geometric theory T we have Subs)7)(1) = 
Q(T]. 


THEOREM 8.71 Let T and T’ be propositional theories. Then there is an 
equivalence between 


1 locale maps |T] — |T"), and 
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2 topos maps |T] — |T". 


Proof A topos map [T] — [T] is equivalent to a model of Theyry in S[T], 
i.e. a frame homomorphism Q[7"] > Subsjrı (1) = Q[T]. QED 


Referring back to Proposition 8.16, we see that for sober spaces, continu- 
ous maps are equivalent to geometric morphisms between the corresponding 
toposes. We have now justified the key fact that underlies this chapter: toposes 
generalize topological spaces (at least in the sober case), and geometric mor- 
phisms are the continuous maps at topos generality. 

We now know that locales and localic toposes are equivalent. We write X 
without any bias either way, and refer concretely to the frame as QX and to the 
category of sheaves as SX. More generally, for any Grothendieck topos X we 
can write QX for the frame Subs x (1) without creating any ambiguity in the 
localic case. We call its elements opens of X, equivalent to maps X — S. 


DEFINITION 8.72 Let X be a Grothendieck topos. Then the localic reflection 
of X is the locale Loc( X) whose frame is QX. 


PROPOSITION 8.73 Let X be a Grothendieck topos. Then there is a map 
a: X — Loc(X) such that any map f : X — Y with Y a locale factors 
uniquely (up to isomorphism) via a. 


Proof This is immediate from the fact that if Y is a locale, then geometric 
morphisms from X to Y are equivalent to frame homomorphisms from QY to 
Subsx 1. QED 


If : x => yis a Specialization morphism between points of X, then 
a(x) E a(y). Hence x and y are identified by a if there are specialization 
morphisms going in both directions between them. Thus the localic reflection 
can lose a lot of structure. 


EXAMPLE 8.74 Consider the object classifier [O]. Classically, if A and B 
are two sets then there is a function from A to B unless A is inhabited and B 
is empty. Hence we might expect Loc({O]) to have two points for two classes 
of sets: inhabited, and empty. We can calculate that in fact Loc({O]) ~ S. 
The theory O is algebraic, and its category of finitely presented algebras is 
the category Fin of finite sets. (Constructively, this is “finite” in a strong 
sense, meaning isomorphic to {1,...,n} for some natural number n.) Hence 
S[O] ~ Set¥™. A sheaf S : Fin > Set is a subsheaf of 1—an open—iff 
every S(A) is a subsingleton, and we find it is determined up to isomorphism 
by S(0) C S(1) C 1. It can be calculated that the frame of these is isomorphic 
to OS. 

Thinking of [O] as a generalized space, we now see how far it is from be- 
ing an ungeneralized space. Its opens are simply too few to characterize the 
generalized topological structure and we have to use sheaves instead. 
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6. 


HANDBOOK OF SPATIAL LOGICS 


Summary of toposes 


Let us summarize the key points of this story. 


1 


The usual semantics of first-order logic provides meaning in sets: sorts are 
sets, function symbols (and terms generally) are functions, and predicates 
(and formulae) are subsets of products. This tells us, for each theory, what 
are the models of that theory. 


Categorical logic uses the same idea to provide meaning in more general 
categories: sorts are objects, function symbols and terms are morphisms, 
and predicates and formulae are subobjects of products. It tells us what 
the models of a theory are in more general categories. 


The logic has to be matched to the categorical structure. The ability 
to interpret logical connectives, and the validity of logical axioms in an 
interpretation, both depend on the structure and properties of the category. 


It is natural to form axioms in two stages as (VZ)(@ — Y). Then ¢ and w 
are formulae, using connectives appropriate to the categorical structure, 
and the form of the axioms compares two subobjects (for ¢ and ~) and 
uses minimal categorical structure. 


The logic we are particularly interested in, geometric logic, is interpreted 
in Grothendieck toposes. However, it is only a fragment of what can be 
interpreted there. Its formulae use ^, V, = and 3. 





It is related to geometric morphisms between toposes, in that the ge- 
ometric logic is preserved by the inverse image functors of geometric 
morphisms. 


To emphasize the difference between spatial and logical aspects of toposes, 
we use a non-standard notation with simple symbols to denote a topos 
“as generalized topological space”, and we apply an S to denote the 
same topos “as generalized universe of sets” (in other words, the cate- 
gory discussed above where the logic is interpreted). Thus a geometric 
morphism f : X — Y comprises two functors f* : SY — SX and 
fe: SX — SY. 


There are type constructors that can be considered to be within the scope 
of geometric logic. These include free algebra constructions. Although 
we have not defined the precise range of these type constructors, we have 
introduced the phrase geometric type theory for theories that use those 
finitary constructors we know to be of this kind. They are equivalent in 
expressive power to geometric theories. 


Locales and Toposes as Spaces 489 


9 


10 


11 


12 


13 


14 


15 


Coherent theories and coherent type theories are similar to the geometric 
versions but do not use infinitary disjunctions. Coherent type theories are 
intermediate in expressive power between coherent theories and geomet- 
ric theories. It is found in practice that once the finitary type constructors 
are brought in, the infinitary disjunctions of geometric logic are often not 
needed. 


We define a (generalized) point of atopos X to be a geometric morphism 
whose codomain is X. It is a global point if its domain (its stage of 
definition) is the topos 1 where S1 = Set. 


Each geometric type theory T has a classifying topos [T] whose points 
at stage Y are the models of T in SY. S[T] is generated by a “generic” 
model of the theory and is an analogue of Lindenbaum algebra for a 
predicate geometric theory. 


The Grothendieck toposes are the classifying toposes for geometric type 
theories. They can be constructed as toposes of sheaves over sites. 


A geometric morphism from X to Y transforms, by composition, points 
of X (at any stage of definition) to points of Y. 


By the definition of classifying topos, we define a geometric morphism 
from |T;] to [T2] by constructing a model of T> in S[T\]. Since S[T;] is 
generated by a generic model of T4, this appears formally as declaring, 
“Let M be a model of T4,” and then constructing a model of Th out of 
it. To be valid in S[T}], the construction must be intuitionistically valid; 
and to be uniform over all stages of definition it must be geometric. 


Thus we think of Grothendieck toposes as generalized spaces of models, 
and geometric morphisms as maps between those spaces. 


For some examples of the techniques in use, see Vickers, 1999, Vickers, 2001 
and Vickers, 2004. In particular, Vickers, 2001 discusses toposes X for which 
SX is a presheaf category, with reference to examples such as the simplicial 
sets Mac Lane and Moerdijk, 1992, Sec. VIIL8. 


7. 


Other directions 


We have focused on the relationship between geometric logic and the categor- 
ical structure of Grothendieck toposes, to give an introduction to how toposes 
can be understood as generalized topological spaces. However, the connections 
between logic and toposes go far beyond this and most of the standard texts 
describe a range of broader applications. We now briefly mention just a few 
other aspects of topos theory that are relevant to the logic of space. 
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7.1 Fibred locales 


We have already seen how a map f : X — Y can be understood as a 
generalized point of Y, continuously parametrized by a variable point of X. 
In terms of the non-classical mathematics of sheaves, this is a model in SX of 
whatever theory Y classifies. 

However, we can also look at the parametrization the other way round. For 
each point y of Y, we get a fibre X, = f—'({x})—indeed, this still makes 
sense for toposes, by taking the pseudo-pullback of f along y. Hence this is a 
space “parametrized by a variable point of Y”. We have seen one example of 
this already, in sheaves and local homeomorphisms. There is a particular “lo- 
calic” kind of map f between toposes, essentially meaning that X is presented 
by no new sorts relative to Y (and in particular any map between locales is 
localic). This gives a notion of “fibred locale” over Y , and it turns out that this 
is equivalent to doing locale theory constructively in SY. 

Joyal and Tierney, 1984 give a straightforward approach to this using frames 
and we shall sketch that. (Vickers, 2004 gives a more geometric account.) The 
notion of frame (and frame homomorphism) can be defined in any elementary 
topos. However, the theory is not finitary algebraic and makes essential use 
of the elementary topos structure: to define arbitrary joins on A requires a 
morphism from the powerobject P(A) to A. 

Frame structure is preserved by direct image functors f, (though not by 
f*), and the subobject classifier is Q is always a frame. Hence for any map 
f:X >Y, f.(Qx) is a frame in SY. On the other hand, given a frame A in 
SY, we can replicate the construction of the category of sheaves to get a localic 
map p: Z — Y such that p,(Qz) = A. In fact we find a duality between 
frames in SY and fibred locales over Y. 


EXAMPLE 8.75 Let S be a sheaf over a topos X, and let f : X/S — X be 
the map of Proposition 8.55. One can calculate that the subobject classifier in 
SX/S is S x Qx — S and its image under f, is P(S). Relative to X, it is 
therefore the discrete locale (i.e. all subsets open) corresponding to S. 


7.2 Powerlocales 


Powerlocales are the localic analogue of hyperspaces, spaces whose points 
are subspaces of other spaces. If X is a locale, then there are various kinds 
of powerlocales whose points are different kinds of sublocales (the localic 
analogue of subspace) of X. 

In some ways the starting point is the Vietoris powerlocale V X, which 
bears a direct relationship to the Vietoris hyperspace and was first studied in 
Johnstone, 1985. In computer science an analogous “Plotkin powerdomain” has 
been used to give semantics for non-deterministic programs—that is, programs 
for which the result is in some sense a range of points. It was noticed (Smyth, 
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1978) that its topology is generated by two coarser topologies that give two 
powerdomains that are interesting in their own right, and these were transferred 
(Robinson, 1986) to locales to give the upper and lower powerlocales Py X 
and Pz X. Computer science applications in localic form have appeared in 
Abramsky, 1991la and Abramsky, 1991b. The three principal powerlocales 
(Vietoris, upper, lower) are summarized in Vickers, 1997. Their relationship 
with the predicative mathematics of formal topology is discussed in Vickers, 
2006 and Vickers, 2005. More recently (Johnstone and Vickers, 1991, Vickers, 
2004, Vickers and Townsend, 2004) it has been noticed that both the upper and 
lower powerlocales embed in a larger double powerlocale, which can be got as 
either Py Pz X or Pz Py X (they are homeomorphic). 

Each powerlocale has a good logical content, long understood in computer 
science. Given a locale X, each powerlocale embodies a logical theory whose 
models are certain kinds of sublocales of X. A sublocale is in effect a theory 
got by adding extra axioms to that for X, thus specifying a part of the class of 
models of X. Some topological properties of X, compactness being a good 
example, can be discussed in terms of points of the powerlocales (Vickers, 
1995, Vickers, 2006). 

The logical approach relies on the idea that, given a logic of points, we get 
a “logic of parts”, reminiscent of modal logic. For each property U of points, 
an open of the original locale, we get two properties of parts: LU says that the 
part is wholly inside U, while OU says that the part has at least one point in U 
(i.e. it meets U). Suitable axioms for the properties DU are that O preserves 
finite meets and also directed joins — this latter turns out to be necessary for 
good results, and imposes a compactness condition on the parts. From these 
we get the upper powerlocale. A suitable property for © is that it preserves all 
joins, and from that we get the lower powerlocale. Taking the properties DU 
and ©U together, we need extra axioms to show their interaction: 














OU A OV = O(UAV), 
OU v V) => DU v OV. 








From these we get the Vietoris powerlocale. 

At that first stage, the powerlocales are defined directly in terms of the frames. 
However, one can also investigate them as theory constructions. That is to say, 
if the original space (the “logic of points”) is given as a theory rather than 
as a frame, we show how to gain theories of the powerlocales. The proofs 
uses “coverage theorems’, results that transform a presentation of the frame 
by generators and relations into a presentation of the same structure but by 
generators and relations with respect to different algebraic operators. 
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7.3 Modal logic 


One direction that might particularly interest readers of this book is the 
connection with modal logic. The pointers that follow here were supplied by 
the Second Reader of this chapter. Classical as well as non-classical modal- 
ities have been studied along topos-theoretic lines by Reyes with others: see 
Lavendhomme et al., 1989; Reyes, 1991; Makkai and Reyes, 1995; Reyes and 
Zolfaghari, 1991; Reyes and Zolfaghari, 1996. Categorical semantics for super- 
intuitionistic and modal predicate logics were developed by Ghilardi (Ghilardi, 
1989; Ghilardi, 1991; Ghilardi, 1992) and Shehtman and Skvortsov (Sheht- 
man and Skvortsov, 1990; Skvortsov and Shehtman, 1993; Skvortsov, 1996; 
Skvortsov, 2003); see also Suzuki, 1990; Suzuki, 1993; Isoda, 1997; Nagaoka 
and Isoda, 1997; Shirasu, 1998. A modal intuitionistic calculus of nuclei was 
developed by Goldblatt (Goldblatt, 1981; Goldblatt, 1979). 


8. Conclusions 


It seems obvious, even trite, that a logic of finite conjunction and arbitrary 
disjunction might be related to the finite intersections and arbitrary unions of 
open sets in topology. Locale theory shows how propositional geometric theo- 
ries can be studied topologically. Nonetheless, geometric logic is very peculiar 
from the perspective of traditional logic. Its incompleteness seems a grave 
disadvantage, while its type-theoretic content in a first-order logic comes as a 
surprise. 

Our basic message is that in a constructive geometric mathematics, topology 
appears as an emergent feature: the logical theories describe classes of models 
with an intrinsic topology (in Grothendieck’s generalized sense, using sheaves 
when there are not enough opens), and mathematical constructions have an 
intrinsic continuity. 

Paradoxically, the constructivity provides the way around the incomplete- 
ness. Normally one thinks of constructivity as the enemy of completeness, 
because so many completeness proofs are classical. But by allowing for con- 
structive mathematics one gains access to a more complete range of models of 
each geometric theory. Amongst the Grothendieck toposes each theory has its 
classifying topos, equipped with the generic model. It serves as “generalized 
Lindenbaum algebra’, but can also be thought of as “the space of models”. 
Geometric morphisms are logic-preserving functors between the toposes, but 
can also be used (in the reverse direction) as continuous maps of models, at any 
stage. This is without reference to the concrete class of standard models, of 
which there might anyway be insufficient because of the incompleteness. The 
propositional fragment can alternatively be treated using locales (and frames 
as Lindenbaum algebras), but the two treatments are equivalent. In the spatial 
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case the geometric morphisms recover the known notion of continuous map 
between spaces (modulo issues of sobriety). 

That broad story underlies much of topos theory, though there are also many 
deep non-geometric uses of Grothendieck toposes. 

The type theoretic content is an unfamiliar development in first order logic. 
In a sense it is superfluous, since it does not essentially extend the scope of 
geometric logic. Nonetheless it makes the logic more convenient and in par- 
ticular it can be used to eliminate infinitary disjunctions in favour of finitary 
constructions. 

Combining coherent logic with some of the geometric type constructors, 
we get a coherent type theory. This must be less expressive than geometric 
logic, yet it is already enough to capture important topological examples such 
as the real line. An exciting thought is that this may provide an example of 
topology emerging from a finitary type theory, with finite coproducts and the 
inductive construction of free models. Such a coherent type theory would be 
better described in its own terms, with a corresponding class of categories to 
interpret it. A promising candidate class for these categories is the arithmetic 
universes of Joyal. By contrast with Grothendieck toposes, these categories do 
not automatically have function spaces or subobject classifiers. This is going to 
require a much more careful syntactic formulation of the coherent type theory, 
probably including aspects of dependent type theory. Some preliminary results 
have been found in Maietti, 2003. 
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1. Introduction 


As follows from the title of this chapter, our primary aim is to analyse possible 
solutions to the equation 





(9.1) Spatial logic + Temporal logic = «x 











where the items on the left-hand side are some standard spatial and tempo- 
ral logics, and + is some ‘operator’ combining these two logics into a single 
one. The question we are concerned with is how the computational complexity 
and the expressive power of the component logics are related to the complex- 
ity and expressiveness of the resulting spatio-temporal logic x under various 
combination operators +. 
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To convey the flavour of the problems we are facing when attempting to 
answer this question, let us consider two standard spatial and temporal logics 
and try to combine them. 

Recall from Ch. 5 of this Handbook that one of the basic and natural logics 
for reasoning about space is the ‘modal’ logic S4,, equipped with the Boolean 
operators over subsets of a topological space and the ‘modal’ operators I and 
C interpreted as the topological interior and closure, respectively. In this lan- 
guage we can say, for example, that two spatial objects X and Y are externally 
connected, EC(X, Y) in symbols, in the sense that X and Y share some points 
but none of them belongs to the interior of X or Y. This can be expressed, e.g., 
by means of the following constraints: 


XAY#0 and IXnNIY =6. 


Reasoning in S4,, is perfectly well understood; it is known to be PSPACE- 
complete, and various reasonably effective reasoning systems are available. 

For the temporal component we take the standard linear temporal logic L7 £ 
which extends propositional logic with the temporal operators O (‘tomorrow’), 
Or (eventually), and Op (always in the future). LT ZL is interpreted over 
the flow of time consisting of the natural numbers (N, <). For example, the 
following formula says that a day is Saturday if, and only if, the next day is 
Sunday: 

















Opn(Saturday > OSunday). 


Reasoning in LT £ is also thoroughly investigated; it is PSPACE-complete as 
well, and a number of temporal reasoning systems have been implemented. 

Now our aim is to construct a combination of S4,, and LT £L where we could 
express, for example, that today spatial objects X and Y are not externally 
connected, but tomorrow they are: 


AEC(X,Y) A OEC(X,Y), 


or that the spatial object X today is externally connected with the space OX it 
will be occupying tomorrow: 


EC(X, OX), 


or that, starting from some future moment, X will never change its position: 














OpUin(X SOX). 


Having efficient spatial and temporal reasoners S and T at our disposal (for S4,, 
and LT L, respectively), the quickest way of constructing a combined spatio- 
temporal reasoning system is to organise their joint work in a modular way: first, 
say, S treats the input constraints regarding formulas that start with temporal 
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operators as atomic, then T' deals with them regarding formulas with spatial 
operators as atomic, etc. Clearly, the resulting system works in PSPACE. But 
unfortunately, such a reasoner does not take into account any interaction be- 
tween the spatial and temporal operators: the problem is that a spatio-temporal 
formula is recognised as valid by this reasoner only if it is valid in arbitrary 
fusions of topological models with (possibly many) isomorphic copies of the 
flow of time (N, <). In such models, spatial objects are not moving in the same 
space over the same flow of time because the topological space at moment n 
may have absolutely nothing to do with the space at moment n + 1, or, dually, 
every point of space has its own history. In particular, one could expect the 
constraint 
OEC(X,Y) > EC(OX, OY) 


to be a valid principle of spatial-temporal logic—yet, our reasoner would not 
confirm this: it would claim that the negation of this formula is satisfiable. 

Of course, from a purely semantical perspective, this problem can easily be 
overcome by restricting the class of intended models to those where the same 
topological space is kept along the whole time line. In other words, we can 
assume that the underlying topological space does not change in time; what 
changes is the position, shape, size, etc. of spatial objects. Mathematically this 
means that the intended spatio-temporal models for combinations of S4,, and 
LT CL are the Cartesian products of topological spaces and (N, <). 

Such models provide a natural interpretation for the formulas considered 
above, with the last one being valid in all of them. But on the other hand, in 
order to deal with them we need a new, perhaps more sophisticated reasoning 
system. Is it, at least in principle, possible to design an effective complete and 
sound system of this kind? 

A moment’s reflection about the possible computational behaviour of such a 
system brings to memory another model, which logicians and computer scien- 
tists know all too well. We mean Turing’s model of computation. The tape of a 
Turing machine can be regarded as a somewhat simplified model of space where 
a ‘spatial object’ is the collection of cells containing a certain symbol from the 
alphabet. Putting the problem in this perspective, one can immediately start 
suspecting that perhaps even a modestly expressive spatio-temporal language 
could be able to describe the change of spatial objects over time which corre- 
sponds to the computation of a Turing machine. And if this is indeed the case 
then, using the operator © for ‘eventually’ it appears almost trivial to state 
that the Turing machine eventually reaches the halting state on a given input, 
which would mean that reasoning in the hybrid language cannot be decidable 
(or, even worse, that the set of valid spatio-temporal formulas is not recursively 
enumerable). 

Now, obviously, the topological language S4,, and many other languages to 
be considered in this chapter are not designed to represent knowledge about the 
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tape of a Turing machine (to begin with, there is no obvious topology on such 
a tape). Some much smarter ‘encoding techniques’ may be needed to prove 
that combinations of S4,, and LT £ (and similar logics) are undecidable. Yet, 
the first major result of this chapter shows that the intuition behind the simula- 
tion of Turing machines discussed above is correct: naive and straightforward 
combinations of spatial and temporal logics (interpreted in Cartesian products 
of time and space) almost invariably lead to undecidable hybrids. 

The second major result, however, is that by closely inspecting the expressive 
means required to simulate Turing machines one can still find hierarchies of 
useful and expressive hybrids of S4,, and LT CL, their fragments, and some 
related logics which are decidable and of reasonably low complexity. 

The structure of this chapter is as follows. In the next section, we discuss in 
more detail, but still on a rather abstract level, our main paradigm of “snapshot 
spatio-temporal models’ and most important reasoning problems relevant to 
these models. 

Then, in Sec. 3 and 4, we discuss in detail the ingredients of the spatio- 
temporal logics to be constructed and investigated in this chapter. We con- 
sider two families of spatial logics. The first one is comprised of formalisms 
designed for reasoning about topological relations among spatial objects and 
ranging from RCC-8 to S4,,, possibly with component counting. A remarkable 
feature of these logics is their ‘computational robustness’ in the sense that the 
complexity of reasoning gradually increases from NP for RCC-8 to PSPACE 
and NEXPTIME for S4, without and with component counting, respectively. 
Moreover, each complexity ‘jump’ in this hierarchy is clearly connected to the 
corresponding increase in the logic’s expressiveness. Our second family of 
spatial logics consists of formalisms that are capable of reasoning about dis- 
tances in metric spaces. Some of these logics will contain S4,, and, therefore, 
combine topological reasoning with reasoning about distances. These logics 
are also computationally robust, with the typical complexity being EXPTIME. 

The introduction to temporal logic systems in Sec. 4 is much shorter, as 
we only consider two approaches to logic modelling of time: time as a linear 
discrete sequence of time points or snapshots, and time as a tree-like structure 
of such snapshots representing some aspects of non-determinism. Other flows 
of time, say, continuous time, are not discussed, but pointers to the literature 
are provided. 

Having introduced the logical systems for space and time, in Sec. 5 we 
discuss general combination principles—requirements and constraints for the 
operator + in (9.1)—which will guide us when designing combined spatio- 
temporal systems. Then, in Sec. 6 and 7, we use these principles to construct 
spatio-temporal logics out of the components introduced in Sec. 3 and 4. As 
before, the emphasis of this investigation is on the trade-off between the expres- 
sive power and the complexity of reasoning. We shall discover, in particular, 
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that unlike the ‘robust’ component logics, their spatio-temporal hybrids turn 
out to be much more sensitive to seemingly minor changes in expressiveness. 

In Sec. 8, we consider a somewhat different paradigm of spatio-temporal 
models and languages for reasoning about them: here we formalise spatio- 
temporal reasoning within the framework of dynamical systems based on topo- 
logical and metric spaces with continuous and isometric functions, respectively. 
As logics for dynamical systems are discussed in detail elsewhere in this Hand- 
book (see Ch. 10), we concentrate here on the connection between the spatio- 
temporal systems introduced before and the dynamical systems perspective. It 
will turn out that in some cases the connections between the two approaches 
are so strong that results can be mutually imported from one area to the other. 

Finally, in Sec. 9, we briefly discuss the relation between spatio-temporal 
logics and other temporalised formalisms, for example first-order temporal 
logics and temporal epistemic logics. 

The reader who considers computational complexity less important and is 
interested in logic modelling of (relativistic) space-time using classical first- 
order logic is referred to Ch. 11. 


2. Static and changing spatial models 


The intended models of standard spatial logics are usually based on ‘math- 
ematical spaces’ such as (variations of) topological or metric spaces and their 
relational or algebraic representations or abstractions. We will consider many 
examples of such models and spaces in Sec. 3.1; more can be found elsewhere 
in this Handbook. Meanwhile, in order to discuss basic principles of intro- 
ducing a temporal dimension into otherwise static spatial models, we neglect 
the concrete structure of these ‘mathematical spaces’ and concentrate on the 
generic properties of the models. 

To represent spatial entities in models we require a countably infinite supply 
of spatial variables (that is, unary predicates) po, p1,.... Thus, a generic spatial 
model can be thought of as a structure of the form 


(9.2) (Gay ey es) 


where G is the underlying “mathematical space’ (say, a metric or topological 
space, or a structured collection of polygons on the Euclidean plane) and the 
p% are interpretations of the spatial variables as subsets of the domain of 6. 

Depending on the underlying spatial ontology, one can distinguish between 
two types of models: 


a point-based models, where spatial objects are (explicitly or implicitly) 
thought of as consisting of sets of points, and 


m models with extended spatial entities as basic elements (say, regions or 
intervals) together with certain relations between them. 
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Point-based spatial models. In a point-based model of the form (9.2), 
the underlying ‘mathematical space’ G is a collection of points equipped with 
“‘point-wise defined’ operators (like a metric or topological space). Interpreta- 
tions p” of spatial variables p; (that is, subsets of the domain of G) represent 
spatial objects. Thus, a spatial object is identified with the set of points it 
occupies. By imposing various constraints on these interpretations—say, by al- 
lowing only polygons, circles or regular closed connected sets—we can reflect 
the desired requirements on the form of spatial objects. 


Region-based spatial models. In a region-based model of the form (9.2), 
spatial objects are represented as (unstructured) elements of the underlying 
‘space’ ©. We may consider as the domain of G, for instance, the collection of 
polygons on the Euclidean plane and completely forget about the plane itself. 
The ‘structure’ of spatial objects is reflected then by certain relations among 
them (say, polygon x has a common edge with polygon y) which should be 
specified in G (for details and further references see Ch. 2, 3 and 7 of this 
Handbook). Spatial variables are again interpreted as sets of elements of the 
domain of G, for instance as a set of polygons approximating the map of the U.K. 
(including the Isle of Wight, the Hebrides, and other islands), or the singleton 
set containing (the polygonal approximation of) the Isle of Man. 


In this chapter we only consider point-based spatial models, although some 
results and constructions can be generalised to region-based ones. 


Snapshot spatio-temporal models. The intended models of temporal 
logics are supposed to represent the change of states—which, in our case, 
should be spatial models (9.2)—over time, under actions, etc. In most cases it 
makes sense to assume that space always remains the same. Moreover, one can 
usually simulate expanding, shrinking or varying space in some ‘sufficiently 
large’ constant space (e.g., Gabbay et al., 2003). The motion of spatial objects 
can therefore be modelled by changing the interpretations p% of spatial objects 
from one state to another. (A different approach to modelling motion was taken 
by Muller (1998), who considered a moving object as a single spatio-temporal 
entity.) 

There are many different time paradigms developed in philosophy, mathe- 
matics, physics, computer science and other disciplines: linear and branching, 
discrete and dense, point-based and interval-based, etc. (e.g., Gabbay et al., 
1994; Gabbay et al., 2000; Fisher et al., 2005). In this chapter we mainly focus 
on the flow of time that can be represented by the natural numbers (N, <), 
where < is the temporal precedence relation between time points. In this case 
our generic snapshot spatio-temporal model is simply an infinite sequence 


(9.3) Mo = (G, pe, p...) Mi = (6, pp", p™,...), 
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Figure 9.1. Linear snapshot model with a moving spatial object. 


of spatial models of the form (9.2) with the same space G; see Fig. 9.1. In 
Sec. 4.2 and 6.2 we will briefly consider temporal and spatio-temporal models 
with branching (tree-like discrete) time that can capture some aspects of non- 
determinism. In either of these time paradigms the points of time can be taken as 
primitive temporal entities, assumed to be generated by state transition systems 
(a standard computer science approach), or by dynamical systems (a usual way 
in mathematics). 
As an illustration let us consider the following example. 


Spatial transition systems. Our main example running throughout the 
chapter is a spatial transition system which describes the changing geography 
of the Earth as we see it every day in BBC’s weather forecasts, say, in Ten 
O’Clock News. Every day the state of the map is represented by a spatial 
model 

M = (E, pp", Vi", ---), 


where € is a suitable mathematical model of the Earth surface and each pt 
is the space occupied by the geographical object modelled by p; (either static 
as a town, a county or dynamic as a night frost or rainfall area, etc.) on that 
day. Starting from a certain day in the past, we can trace then the day-after- 
day changes that have happened till the present moment. Depending on our 
philosophical, religious, etc. views we can regard the future to be deterministic 
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or not. In particular, we can imagine that today’s state may evolve in many 
different ways. 

In computer science, such scenarios are often described in terms of state tran- 
sition systems—spatial transition systems in our context—which, in general, 
are tuples of the form 


(9.4) (S,>—, u, ©, so), 


where S is a nonempty set of states, — is a binary transition relation on S 
without dead-ends (states without outgoing —), u is a function associating 
with each state s € S a spatial model p(s) of the form (9.2) based on the same 
space ©, and sọ is the initial state. Possible evolutions (or transformations) of 
this initial state are sequences 


(9.5) S97 S81 > 8Q>—>..., 


where s; € S for all 1 € N. Each of these evolutions obviously generates a 
linear snapshot model 


L(So), (81), u(s2), Eau 


of the form (9.3). In the deterministic case (as in the second example below) 
we have a single evolution. In general, however, the transition relation — 
of a spatial transition system can represent non-deterministic rules. Then it 
generates a discrete tree of evolutions (9.5). 

What precisely can be told about these models depends of course on the 
concrete spatial and temporal logics we use. Here we give a few examples 
of English statements about our ‘geographical transition system’ that will be 
represented as spatio-temporal formulas in Sec. 5 and 6.1. 


(A) If two clouds are disconnected now, then at the next moment they either 
remain disconnected or become externally connected. 


(B) Kaliningrad is disconnected from the EU until the moment when Poland 
becomes a tangential proper part of the EU, after which Kaliningrad and 
the EU are externally connected forever. 


(C) The current position of a hurricane overlaps its position in an hour. 


(D) If tomorrow object X is at the place where object Y is today, then Y will 
have to move by tomorrow. 


(E) The space occupied by Europe never changes. 
(F) In two years the EU will be extended with Romania and Bulgaria. 


(G) It will be raining over every part of England ever and ever again. 
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(H) If the Earth consists of water and land, and the space occupied by water 
expands, then the space occupied by land shrinks. 


(I) Two deserts that expand by at least a mile in all directions every year 
must eventually intersect. 


Reasoning tasks. We have not introduced yet any formal languages capable 
of talking and reasoning about spatio-temporal models—they depend on the 
concrete spatial and temporal logics we combine as well as the combination 
principles to be discussed later on in Sec. 5. Nonetheless, it does make sense 
to consider on this abstract level the main reasoning problems one might be 
interested in for some fictional language £. 

The most general and important problem we are going to consider is 


= satisfiability of spatio-temporal constraints. 


Suppose that we have formulated a finite set [ of £-formulas representing con- 
straints on possible spatio-temporal scenarios. Then we are facing the following 
questions. Is this set I’ satisfiable (or consistent)? In other words, does there 
exist a spatio-temporal model realising these constraints? And if so, how such 
a model may look like? For example, can it be given by a finite transition 
system? Can it be based on a finite space? 

Of particular interest to us will be algorithmic properties of the satisfiability 
problem. Is this problem decidable? Thatis, does there exist an algorithm which 
is capable of deciding, given an arbitrary finite set I’ of constraints, whether T 
is satisfiable? Are finite sets of satisfiable constraints recursively enumerable? 
What is the computational (worst-case) complexity of the satisfiability problem? 

Note that the deduction (or entailment) problem ‘given a finite set I of 
constraints and an £-formula y, decide whether y holds in all spatio-temporal 
models where I holds?’ is usually reducible to the satisfiability problem. 

The satisfiability problem can be restricted to certain classes of £-formulas 
and constraints. Here is a typical example. We describe the behaviour of spatial 
transition systems by imposing some local constraints [ which specify possible 
initial states and transitions from each given state to the next ones. This is done 
in some sublanguage £),, of L. We can also specify (by means of £;,--formulas) 
states with some desirable property y or some ‘bad’ property Y. And then we 
are interested in the algorithmic properties of 


= the reachability problem relative to Loc: ‘is it the case that every model 
where constraints I’ hold contains a state satisfying y? or 


= the safety problem relative to Lig: ‘is it the case that no model where 
constraints I‘ hold contains a state satisfying Y? 


In the extreme case, when £;,, is expressive enough to describe (up to iso- 
morphism) any particular spatial transition system, checking for reachability, 
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safety or some other properties are instances of classical model checking prob- 
lems (see, e.g., Clarke et al., 2000, and references therein). 


3. Spatial logics 


In this section we introduce the ‘mathematical spaces’ and spatial logics 
capable of talking and reasoning about these spaces that will serve as the spatial 
components of our spatio-temporal formalisms. 

We consider spatial logics of two types: (i) those that can represent and reason 
about topological relations among spatial objects, and (ii) those that can addi- 
tionally take into account distances between objects. The former are interpreted 
over topological spaces and the latter over metric (or more generally, distance) 
spaces. The choice of these logics is motivated by the following reasons. First, 
topological and metric spaces belong to the most important and well-understood 
structures representing space. Reasoning about topological relations between 
regions suchas ‘X is externally connected to Y’ or ‘X is tangential proper part of 
Y” has proved to be one of the most successful approaches to qualitative spatial 
knowledge representation and reasoning (KR&R) in artificial intelligence; see, 
e.g., (Cohn and Hazarika, 2001) and references therein. Extensions of ‘topo- 
logics’ with distance operators like JS“X giving the a-neighbourhood of X or 
X & Y giving the set of points that are closer to X than to Y are becoming an- 
other interesting research stream (Kutz et al., 2003; Wolter and Zakharyaschev, 
2003; Wolter and Zakharyaschev, 2005a) that is especially close to the authors’ 
hearts. Other important aspects of space such as, e.g., orientation have been 
considered as well; however, no combinations with temporal logics have been 
constructed so far. We believe that the approach to combining spatial logics of 
topological and metric spaces with temporal ones to be presented later on in 
this chapter can be extended to other spatial formalisms as well. 





3.1 Metric and topological spaces 


Metric spaces. A metric space is a pair (A, d), where A is a nonempty set 
(of points) and d is a function from A x A into the set RŽ? (of non-negative 
real numbers) satisfying the following axioms 


(9.6) identity of indiscernibles: d 
(9.7) symmetry: d(x,y) = d(y, £), 
(9.8) triangle inequality: d(x,z) < d(x,y) + d(y, z), 


—— 


ey) =O iff a= y, 


for all x,y,z € A. The value d(x,y) is called the distance between points x 
and y. Given a metric space (A, d), a point x € A anda nonempty Y C A, 
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define the distance d(x, Y ) between x and Y by taking 
d(x, ¥) = inf{d(z, y) | y € Y}. 


As usual, d(y, Ø) = co. The distance d( X,Y) between two nonempty sets X 
and Y is 
d(X,Y) = inf{d(z,y)| LE X, ye Y}. 


Distance spaces. Although acceptable in many cases, the defined concept of 
metric space is not universally applicable to all interesting measures of distance 
between points, especially those used in everyday life. Consider, for instance, 
the following two examples: 


(i) If d(x, y) is the flight-time from z to y then, as we know it too well, d is 
not necessarily symmetric, even approximately (just take a plane from London 
to Tokyo and back). 


(ii) Often we do not measure distances by means of real numbers but rather 
using more fuzzy notions such as ‘short,’ ‘medium’ and ‘long.’ To represent 
these measures we can, of course, take functions d from A x A into the subset 
{1,2,3} of R? and define short := 1, medium := 2, and long := 3. So we 
can still regard these distances as real numbers. However, for measures of this 
type the triangle inequality (9.8) does not make sense (short plus short can still 
be short, but it can also be medium or long). 


Spaces (A, d) satisfying only the axiom (9.6) are called distance spaces. 


Topological spaces. A topological space is a pair (U,1) in which U is a 
nonempty set, the universe of the space, and I is the interior operator on U 
satisfying the Kuratowski axioms: for all X,Y C U, 


Wxny) =1xniy, LX CILX, IX CX and IU=U. 


The operator dual to I is called the closure operator and denoted by C: for every 
X CU, we have CX = U —I(U — X). Thus, IX is the interior of a set X, 
while CX is its closure. X is called open if X = IX and closed if X = CX. 
The complement of an open set is closed and vice versa. The boundary of a 
set X C U is defined as CX — LX. Note that X and U — X have the same 
boundary. 

Topological spaces are often (equivalently) defined as pairs (U, ©), where © 
is a family of (open) subsets of U such that O is closed under arbitrary unions 
and finite intersections. 


Metric spaces and topology. Each metric space (A, d) gives rise to the 
interior operator Ig on A: for all X C A, 


[gx = {x € X | de > 0 Yy (d(x,y) <€ > y E X)}. 
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The pair (A, Iq) is called the topological space induced by the metric space 
(A,d). The dual closure operator C4 in this space can be defined by the 
equality 





CaX = {x € A | Ve > 0 Jy E X d(z,y) < e}. 


We briefly remind the reader of a few standard examples of metric and 
topological spaces that will be used in what follows. 


Euclidean spaces. The one-dimensional Euclidean space is the set of real 
numbers R equipped with the following metric on it 


Let X C R. A point x € R is said to be interior in X if there is some £ > 0 
such that the whole open interval (x — £, x + €) belongs to X. The interior 
LX of X is defined then as the set of all interior points in X. Itis not hard to 
check that (R, T) is the topological space induced by the Euclidean metric dı. 
Open sets in (R, I) are (possibly infinite) unions of open intervals (a, b), where 
a < b. The closure of (a, b), for a < b, is the closed interval [a, b], with the end 
points a and b being its boundary. 

In the same manner one can define n-dimensional Euclidean spaces based 
on the universes R” with the metric 


d(x, y) = Y Xi (ti — yi)’, 


where x and y are n-dimensional vectors (£1,..., £n) and (y1,..., Yn), Te- 
spectively (in the definition of interior points x one should take n-dimensional 
e-neighbourhoods of x). 


Metric spaces on graphs. Another well known example is metric spaces 
on graphs: the distance between two nodes of a graph is defined as the length 
of the shortest path between them. 


Aleksandrov spaces. A topological space is called an Aleksandrov space 
(Alexandroff, 1937) if arbitrary (not only finite) intersections of open sets are 
open. Aleksandrov spaces are closely related to quasi-ordered sets, that is, 
pairs 6 = (V, R), where V is a nonempty set and R a transitive and reflexive 
relation on V. Every such quasi-order © induces the interior operator Ig on V: 
for X CV, 


Ie X = {x € X |VWy E V (tRy >y € X)}. 


In other words, the open sets of the topological space Tg = (V, Ie) are the 
upward closed (or R-closed) subsets of V. The minimal neighbourhood of a 
point x in Ze (that is, the minimal open set to contain x) consists of all those 
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points that are R-accessible from x. It is well-known (e.g., Bourbaki, 1966) 
that Te is an Aleksandrov space and, conversely, every Aleksandrov space is 
induced by a quasi-order. 


For various generalisations of metric and topological spaces (like semi- 
metrics, closure spaces and digital topology) see Ch. 12. 


3.2 Topo-logics 


In this section, we introduce and discuss a number of logical formalisms 
which can represent and reason about topological relations among spatial ob- 
jects interpreted over topological spaces. Our choice of logics was guided by 
two criteria: (i) they should be sufficiently expressive to represent interesting 
and useful topological knowledge as identified in the qualitative spatial reason- 
ing community; (ii) on the other hand, reasoning with such logics should be 
decidable and, if possible, of low computational complexity. Another impor- 
tant constraint on logics in the framework described in Sec. 2 is that change in 
time is modelled by changing the extensions of unary predicates representing 
spatial objects. 

The most developed and systemically studied spatial logics satisfying our 
criteria are fragments of a ‘propositional’ logic in which ‘propositional vari- 
ables’ (= unary predicates) denote spatial objects, and topological relations 
among them are represented by means of the interior and closure operators, the 
universal and existential quantifiers over space, as well as the Booleans. This 
logic, originally introduced as a modal logic, is known as S4,,. As we shall see 
below, it can be regarded as the logic of topological spaces providing a common 
roof to some other formalisms developed by the spatial community such as the 
RCC-8 or 9-intersection region connection calculi (where topological relations 
between regions are regarded as primitive). 

Our exposition basically follows Gabelaia et al., 2005a, where the reader 
can find more details, references and proofs. For historical references and 
motivation see Ch. 5 of this Handbook. 


Modal logic of topological spaces. SA, is the well known propositional 
modal logic S4 extended with the universal modalities. The ‘pedigree’ of S4 
is quite unusual. It was introduced independently by Orlov (1928), Lewis 
(Lewis and Langford, 1932), and Gédel (1933), without any intention to reason 
about space. Orlov and Gödel understood it as a logic of “‘provability’ (in order 
to provide a classical interpretation for the intuitionistic logic of Brouwer and 
Heyting) and Lewis as a logic of necessity and possibility, that is, as a modal 
logic. That it can be regarded as the logic of topological spaces was discovered 
by Stone (1937), Tarski (1938), Tsao-Chen (1938) and McKinsey (1941). 
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In the spatial context it is useful to distinguish between spatial terms and 
spatial formulas of S4., as explained below. Spatial terms are expressions of 
the form: 


(9.9) T = pm | 7 | ann | nUn | Ir | Cr, 
where 
€a the p; are spatial variables, 


a —,[] and U are the standard Boolean operators (to be interpreted by the 
set-theoretic complement, intersection and union), 


= [and C are the interior and closure operators, respectively (they cor- 
respond to the box and diamond of the modal logic S4 but are denoted 
differently to emphasise their topological nature). 


A topological model is a structure of the form 
(9.10) M = (ope pr aces 


where T = (U, I) is a topological space and p” C U for all i. The extension 
(or interpretation) t™ of an arbitrary spatial term r in M is defined inductively 
by taking: 


7” =U", (ann) = on, (In) = Ir”, 


(mn Um)” = r Ur and (Cr) = Cr”. 


To be able to express how spatial terms 7, and 7» are related to each other 
we require (at least) the atomic formula 7, E T> with the obvious intended 
meaning: (the extension of) 7, is a subset of (the extension of) 72. By taking 
Boolean combinations of such atoms, we arrive at what will be called spatial 


formulas (or S4,-formulas): 








yp = nEn | 7 | pidge | 41 V $2, 


where the 7; are spatial terms. Formally, the language of S4,, as defined above 
is weaker than the standard one, say, from Goranko and Passy, 1992. However, 
one can easily show that they have precisely the same expressive power: see, 
e.g., Hughes and Cresswell, 1996 or Aiello and van Benthem, 2002. 

Spatial formulas can be either true or false in topological models. The truth- 
relation IN = y—a spatial formula y is true in a topological model IN—is 
defined in the following way: 





a MHAE iff rP Cr, 


eME-yp iff MEy, 
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e ME Yi Av. iff M pı and ME vo, 





e ME YL V yo iff ME p1 r ME vo. 





Clearly, the traditional universal modalities V and J of S4,, are expressible in 
the above language: Vr can be regarded as an abbreviation for (T E 7) and 3r 
for =(7 E L), where T and L are constant terms denoting the whole space and 
the empty set, respectively. In what follows we will also freely use two other 
‘atomic’ formulas 7, = 72 and 7, Æ Tə standing for (7 E T2) A (72 E 71) and 
(71 = T2), respectively. 

Say that a spatial formula ọ is satisfiable (in a class K of topological models) 
if there is a topological model M (from X) such that Wt = vy. A spatial formula 
y is satisfiable in a class of topological spaces if there is a topological model 
SM based on a space from this class such that M = y. 

This seemingly simple spatial language S4„ can express rather complex 
relations between sets in topological spaces. For example, the formula 


(Ep) A PEC) A (PAL) A (Ig=1) 


says that a set q is dense in a nonempty set p, but has no interior. As an example 
one can take q to be the rationals Q and p to be R in the Euclidean space (R, I). 























In the following theorem we collected the most important facts about S4,,; for 
proofs and discussions see, e.g., Nutt, 1999, Areces et al., 2000 and references 
therein. 


THEOREM 9.1 (i) A spatial formula is satisfiable iff it is satisfiable in an 
Aleksandrov space. 

(ii) S4, enjoys the exponential finite model property in the sense that every 
satisfiable spatial formula ¢p is satisfiable in a topological space whose size is 
at most exponential in the size of yp. 

(iii) Satisfiability of spatial formulas in topological models is PSPACE- 
complete. 


The language of the modal logic S4 mentioned above coincides with the 
language of S4,,-terms. Say that a spatial term (= S4-formula) is satisfiable 
if there is a topological model where the term is interpreted as a nonempty set. 
Although being of the same computational complexity as S4 (which is also 
PSPACE-complete), the logic S4,, is more expressive. For example, spatial 
formulas can distinguish between arbitrary and connected topological spaces 
(we remind the reader that a topological space is connected if its universe cannot 
be represented as the union of two disjoint nonempty open sets). Consider the 
formula 


(9.11) (Cpl p) A (pElIp) A (p41) A (PAT) 
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Figure 9.2. Regular closure. 


saying that (the extension of) p is both closed and open, nonempty and does 
not coincide with the whole space. It can only be satisfied in a model based 
on a disconnected topological space, while all satisfiable S4-terms are satis- 
fied in connected (e.g., Euclidean) spaces. For we have the following result 
(McKinsey and Tarski, 1944): 


THEOREM 9.2 An S4-formula is satisfiable iff it is satisfiable in any of (and 
so in al) R”, n > 0. 


Another example illustrating the expressive power of S4, is the formula 


(9.12) (pAL) A (pPECp) ^ (PECp) 


defining a nonempty set p such that both p and its complement p have empty 
interiors. In fact, the second and the third conjuncts say that both p and p consist 
of boundary points only. 








Regions = regular closed sets. In qualitative spatial KR&R, it is quite 
often assumed that spatial terms can only be interpreted by regular closed (or 
open) sets of topological spaces (e.g., Davis, 1990; Asher and Vieu, 1995; 
Gotts, 1996). One of the reasons for imposing this restriction is to exclude 
from consideration such ‘pathological’ sets as in (9.12). Recall that a set X is 
regular closed if X = CIX, which clearly does not hold for any set satisfying 
(9.12). Another reason is to ensure that the space occupied by a physical 
body is homogeneous in the sense that it does not contain parts of ‘different 
dimensionality.” For example, the one-dimensional curve in Fig. 9.2 disappears 
from the subset X of the Euclidean plane (Rĉ?, I) if we form the set CIX. 
The latter is regular closed because CICIX = CIX, for every X and every 
topological space. 

In this section, we will consider several fragments of S4, dealing with reg- 
ular closed sets. From now on we will call such sets regions. 


RCC-8. Perhaps the best known language devised for speaking about re- 
gions is RCC-8 which was introduced in the area of Geographical Information 
Systems (Egenhofer and Franzosa, 1991; Smith and Park, 1992) and as a 
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decidable subset of Region Connection Calculus RCC (Randell et al., 1992). 
The syntax of RCC-8 contains region variables r,s,... and eight binary 
predicates: 


=» DC(r,s) — regions r and s are disconnected, 
= EC(r,s)— r and s are externally connected, 
m EQ(r,s 


=m PO(r,s 


) — rand s are equal, 

) — rand s partially overlap, 

=» TPP(r,s) —risa tangential proper part of s, 

=» NTPP(r,s) —ris a nontangential proper part of s, 

= the inverses of the last two—TPPi(r,s) and NTPPi(r,s), 


which can be combined using the Boolean connectives. 

The arguments of the RCC-8 predicates, that is, region variables, are in- 
terpreted by regular closed sets—i.e., regions—of topological spaces. The 
following was shown in Renz, 1998 and Renz and Nebel, 1999: 


THEOREM 9.3 (i) Every satisfiable RCC-8 formula is satisfiable in any of 
R”, for n > 1 (with region variables interpreted by connected regions only, 
ifn > 3). 

(ii) The satisfiability problem for RCC-8 formulas in topological models is 
NP-complete. 


The expressive power of RCC-8 is rather limited. It only operates with ‘sim- 
ple’ regions and does not distinguish between connected and disconnected ones, 
regions with and without holes, etc. (Egenhofer and Herring, 1991). Nor can 
RCC-8 represent complex relations between more than two regions. Consider, 
for example, three countries (say, Russia, Lithuania and Poland) such that not 
only each one of them is adjacent to the others, but there is a point where all the 
three meet (see Fig. 9.3). It can easily be shown that a ternary predicate like 


(9.13) EC3(Russia, Lithuania, Poland) 


cannot be expressed in RCC-8. 

To analyse possible ways of extending RCC-8, it will be convenient to view it 
as a fragment of S4, (that RCC-8 can be embedded into S4, was first shown by 
Bennett (1994); we present here a slightly different embedding and the purpose 
of changes will become clear in the context of BRCC-8 and RC). Observe first 
that, for every spatial variable p, the spatial term 


(9.14) CIp 
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Russia Lithuania 


Poland 


Figure 9.3. Russia, Lithuania and Poland. 


is interpreted as a region (i.e., a regular closed set) in every topological model. 
So with every region variable r of RCC-8 we can associate the spatial term 
or = Clp,, where p, is a spatial variable representing r, and then translate the 
RCC-8 predicates into spatial formulas by taking 


EC(r,s) = =(or N os = L) A (Ior Tos = L), 

















DC(r,s) = (or M os = L), 

EQ(r,s) = (or E os) A (0s E or), 

PO(r,s) = (To, N Ios = L) A =(or E os) A (as E or), 
TPP(r,s) = (or E Qs) N 7(@s E Or) A 7( Or E Ios), 








NTPP(r,s) (or E Ios) A 7(@s E or) 


(TPPi and NTPPi are the mirror images of TPP and NTPP, respectively). It 
should be clear that as a result we obtain the following: 


THEOREM 9.4 An RCC-8 formula is satisfiable in a topological space iff its 
translation into S4,, defined above is satisfiable in the same topological space. 


This translation shows that in RCC-8 any two regions can be related only in 
terms of truth/falsity of atomic spatial formulas of the form 


(ano = L), (Ioi N Io = L), (01E 02) and (o; E Io2), 


where o; and o9 are atomic region terms, that is, spatial terms of the form (9.14). 
This observation suggests two ways of increasing the expressive power of 
RCC-8: 








(i) by allowing the formation of complex region terms from atomic region 
terms, and 


(ii) by allowing more ways of relating them (i.e., richer languages of atomic 
spatial formulas). 


From now on we will not distinguish between a region variable r and the 
atomic region term o, representing it, and use expressions like DC(r,s) and 
(or N os = L) as synonymous. 
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BRCC-8. The language BRCC-8 of Wolter and Zakharyaschev, 2000 (see 
also Balibiani et al., 2004) extends RCC-8 in direction (i). It uses the same 
eight binary predicates as RCC-8 and allows not only atomic regions but also 
their intersections, unions and complements. For instance, in BRCC-8 we can 
express the fact that a region (say, the Swiss Alps) is the intersection of two 
other regions (Switzerland and the Alps in this case): 


(9.15) EQ(SwissAlps, Switzerland M Alps). 


We can embed BRCC-8 into S4,, by using almost the same translation as in 
the case of RCC-8. The only difference is that now, since Boolean combina- 
tions of regular closed sets are not necessarily regular closed, we should prefix 
compound spatial terms with CI. In this way we can obtain, for example, the 
spatial term 

CI (Switzerland N Alps) 


representing the Swiss Alps. In the same manner we can treat other set-theoretic 
operations, which leads us to the following definition of Boolean region terms: 


o `= Cip | Clo | Ci(eiMe2) | CI(o1 UO 92). 


Thus BRCC-8 can be regarded as a syntactically restricted subset of S4,- 
formulas. It follows from the above definition that Boolean region terms denote 
precisely the members of the well-known Boolean algebra of regular closed sets. 

It is of interest to note that Boolean region terms do not increase the com- 
plexity of reasoning in arbitrary topological models: the satisfiability problem 
for BRCC-8 formulas is still NP-complete. However, it becomes PSPACE- 
complete if all intended models are based on connected spaces (BRCC-8 can 
distinguish between connected and disconnected spaces because we can ex- 
press that regions rı and r2 are nonempty non-tangential proper parts of a region 
s # T, and the union of rı and rg is precisely s: 


TAN (-0¢(ri, r) A NTTP(ri,s)) ^A NTTP(s,s’) A EQ(ri U r2,s). 
i=1,2 


To satisfy this formula, it suffices to take a discrete topological space with three 
points. But if these constraints are satisfied then both s and its complement are 
open and nonempty, which means that the space cannot be connected.) 

On the other hand, BRCC-8 allows some restricted comparisons of more 
than two regions as, e.g., in (9.15). Nevertheless, as we shall see below, ternary 
relations like (9.13) are still unavailable in BRCC-8: they require different 
ways of comparing regions; see (ii). 


RC. Egenhofer and Herring (1991) proposed to relate any two regions in 
terms of the 9-intersections—3 x 3-matrix specifying emptiness/nonemptiness 
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of all (nine) possible intersections of the interiors, boundaries and exteriors of 
the regions. Recall that, for a region X, these three disjoint parts of the space 
(U, 1) can be represented as 


IX, XAOS) aid U-X, 


respectively. By generalising this approach to any finite number of regions, we 
obtain the fragment RC of S4,,: its terms are defined as follows 


== Cip | Clo | Cl(oiMe2) | Clo: 02), 
n= o | Io | 7 | nnr | nUT, 








and spatial formulas are constructed from atoms of the form 7, E Tə using the 
Booleans (as in the full S4.,). In other words, in RC we can define relations over 
regions in terms of inclusions of sets formed by using arbitrary set-theoretic 
operations on regions and their interiors. However, nested applications of the 
topological operators are not allowed (an example where such applications are 
required can be found below). 

Clearly, both RCC-8 and BRCC-8 are fragments of RC. Moreover, unlike 
BRCC-8, the language of RC allows us to consider more complex relations 
between regions. For instance, the ternary relation required in (9.13) can now 
be defined as follows: 


EC3(11, 12,13) = 7(0rn M 0r N 0r = L) A (Tor, Io, =L) A 
(Lor, Tor; =L) A (Ior Mor, = L). 


Another, more abstract, example is the formula 








an- No 0 Ihn -nio of n-ne 1 n- niy AL 


which says that 





regions 01,...,0; meet somewhere inside the region occupied 
jointly by all o},..., 0}, but outside the regions o},..., o; and 
not inside @f’,..., of’. 


Although RC is more expressive than both RCC-8 and BRCC-8, reasoning 
in this language is still of the same computational complexity (Gabelaia et al., 
2005a): 


THEOREM 9.5 The satisfiability problem for RC-formulas in arbitrary topo- 
logical models is NP-complete. 


The proof follows from the fact that every satisfiable RC-formula can be satis- 
fied in an Aleksandrov space that is induced by a disjoint union of n-brooms— 
i.e., quasi-orders of the form depicted in Fig. 9.4. Topological spaces of this 
kind have a rather primitive structure satisfying the following property: 
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Figure 9.4. Satisfying EC(Russia, Poland) and EC3(Russia, Lithuania, Poland) in 2- and 
3-brooms. 


(rc) only the roots of n-brooms can be boundary points, and the minimal 
neighbourhood of every boundary point—i.e., the n-broom containing 
this point—must contain at least one internal point and at least one 
external point. 


For example, spatial formula (9.12) cannot be satisfied in a model with this 
property, and so it is not in RC. 

Given a satisfiable RC-formula p, we can always satisfy it in a model of this 
kind the size of which is a polynomial (in fact, quadratic) in the length of y, 
and so we have a nondeterministic polynomial time algorithm. Actually, the 
proof is a straightforward generalisation of the complexity proof for BRCC-8 
(Wolter and Zakharyaschev, 2000): the only difference is that in the case of 
BRCC-8 it was sufficient to consider 2-brooms (which were called forks). This 
means, in particular, that ternary relation (9.13)—which is satisfiable only in a 
model with an n-broom, for n > 3—is indeed not expressible in BRCC-8 (see 
Fig. 9.4). 


REMARK 9.6 In topological terms, n-brooms are examples of so-called door 
spaces where every subset is either open or closed. However, the modal theory 
of n-brooms defines a wider and more interesting topological class known as 
submaximal spaces in which every dense subset is open. Submaximal spaces 
have been around since the early 1960s and have generated interesting and 
challenging problems in topology. For a survey and a systematic study of these 
spaces see (Arhangel’skii and Collins, 1995) and references therein. 


RC™™.  Onecould go even further in direction (ii) and impose no restrictions 
whatsoever on the ways of relating Boolean atomic region terms. This leads us 
to the maximal fragment RC” of S4,, in which spatial terms are interpreted 
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qi 92 ng qı q2 


5 9 ° depth 0 
qi q2 © depth 1 
qi q2 ° depth 2 


Figure 9.5. Model satisfying formula (9.16). 


by regular closed sets. The syntax of its spatial terms is defined as follows: 
T = Cp | 7 | annn | nUn | Irr | er 


and spatial formulas are constructed as in S4,,. To understand the difference 
between RC” and RC, consider the following RC”-formula 


(9.16) (CIqı NICIqg AL) A 
((CIq, NICIg;) E C(ICIg, N Clgqz N ICIg2)). 





It says that the boundary of Clq, is not empty and that every neighbourhood 
of every point in this boundary contains an internal point of Clq; that belongs 
to the boundary of CIq2 (compare with property (rc) above). The simplest 
Aleksandrov model satisfying this formula is of depth 2 (whereas n-brooms are 
of depth 1); it is shown in Fig. 9.5. 

The price we have to pay for this expressivity is that the complexity of R. 
is the same as the complexity of full S4,, (Gabelaia et al., 2005a): 


C max 


THEOREM 9.7 The satisfiability problem for RC™”-formulas is PSPACE- 
complete. 


This logic can also be regarded as a fragment of S4,, with all variables 
interpreted by regular closed sets. 


S4,, with component counting. There are many ways of increasing the 
expressive power of S4,, itself. For instance, Pratt-Hartmann (2002) proposes 
an extension with component counting. We remind the reader that a subset 
X of a topological space (U, Il) is said to be connected if there do not exist 
two sets Y1, Yo C U such that X C Yi U Y2, XN Y; Æ Ú, for i = 1,2, and 
XNCY,NCY, = Í. Intuitively, connected sets can be thought of as consisting 
of ‘one piece.’ Then a component of a set X is a maximal connected subset of 
X. For example, the subset X of the Euclidean plane (IR?, I) in Fig. 9.2 has 
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only one component and so is connected, whereas its regular closure CLX is 
not connected and has two components. 

The language 7 CC of Pratt-Hartmann, 2002 extends the set of atomic spatial 
formulas of S4,, with the following construct: 


SFr, 
where 7 is a spatial term (as on p. 510) and k € N. The formula c<*r is true iff 
the interpretation of 7 has at most k components. In particular, c<!7 is true iff 7 
is connected and =c<"r is true iff 7 has at least k + 1 components (sometimes 
denoted by c2**+17), This extension turns out to be quite expressive: for 
example, the 7 CC-formula 


(cp A Stp A (piMp,#L)) —> cp, Up) 
says that the union of two connected intersecting sets is also connected (here 
pı — p2 is an abbreviation for =ọ1 V p2). As usual, the increased 
expressivity results in higher complexity. The following was proved by 
Pratt-Hartmann (2002): 


THEOREM 9.8 The satisfiability problem for TCC-formulas in topological 
models is NEXPTIME-complete for the binary coding of the numerical 
parameters. 


To conclude this section, we summarise the inclusions between the (propo- 
sitional) spatial languages introduced above: 


RECS. GS (BROCE: GRE! GRO «GS. G1). °C Er: 


3.3 Logics of distance spaces 


Suppose now that we are interested in spatial logics that are capable of 
reasoning about spatial models based on various distance spaces, i.e., models 
of the form 


(9.17) M = (Ope pr esa), 


where D = (A, d) is a distance space introduced in Sec. 3.1. If D is actually 
a metric space then we can still use S4,, or its fragments interpreted in the 
topological space induced by D. However, the topological interior and clo- 
sure operators Iz and Cg only deal with points that are ‘infinitely close’ to the 
given spatial object (cf. the definitions in Sec. 3.1). Being equipped with the 
distance function over the space, we can extend (or replace) qualitative topolog- 
ical reasoning by means of reasoning about distances between spatial objects. 
In addition to (or instead of) operators interpreted by the topological interior 
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and closure, we can introduce operators capable of expressing, say, that the 
distance from a region X to a region Y is not more than 17. 

Following the ‘operator-based’ approach from topological logic, we arrive 
then to languages with ‘bounded quantifiers’ like J<* ‘somewhere at distance 
< a’ or Y$? ‘everywhere within distance d for a < d < b, where a and b are 
some numbers from R=° (or rather QZ? to avoid the problem of representing 
the reals). 

Given a spatial model Mt of the form (9.17), we interpret such operators in 
the natural way: 





(a<*r)™ = {rE A| Fy (d(z,y) <aAye TAN, 

(>r) = {xE A]|3y(d(z,y)>an^yEr™, 

(VSor)" = {ee Al Vy (a < d(x,y) <b> y E€ 1™}, 
etc. 








Before introducing formal languages based on these operators, itis worth having 
a closer look at some of them. One might ee tempred to assume that the 


























‘doughnut’ -operator 3 a he b can be expressed via I<? and 4°? by the equivalence 
Aste = 34<?7 n 4°*7. Fig. 9.6 shows that oe is not the case. In the figure, 
































we depict the regions 3<? X, 3>19X and 3 Sy. get for the region X consisting 
of the two black boxes. In particular, of all points on the plane only those in 
the white diamond in Fig. 9.6 (b) do not belong to 371°X. 3$? 4X is 3<?2.X 
without the three white areas in Fig. 9.6 (c). As follows from this example, 
IS oX G JPX Max, 

In our discussion of languages for distance spaces we will formulate most 
results for metric spaces only. The reader is invited to consult the literature cited 
below to obtain detailed information about the behaviour of those languages 
over more general distance spaces and over Euclidean spaces. 




















Full ‘modal’ logic of distance spans; The logic MS of distance spaces 
with the operators J=*, 3<*, 3>¢, J$? (and their duals V=*, V<*, etc.) inter- 
preted as defined aoe was troduced and analysed in (Kutz et al., 2003). 
Formally the spatial terms of this logic are defined as follows: 


























a <a >a <b 
T es aa 























pe | a hae ae le | T| T| 
where a,b € Q2° with a < b, and the £; are location constants (or nominals) 
interpreted by single points, so that the {4; } are interpreted by singleton sets. As 
before, the formulas are constructed from atoms of the form 7, E Tə using the 
Booleans (~, A, etc.); we use 7; = Tə as an abbreviation for 71 E To A 72 E 7}. 

Considering first the expressive power of MS, one can show that over models 


of the form (9.17) based on metric spaces it is as expressive as the two-variable 
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(b) 


Figure 9.6. Distance operators. 


fragment of first-order logic with equality, individual constants, unary predicate 
symbols p;(x) corresponding to spatial variables, and binary relation symbols 


d(x,y) <a, d(x,y) =a, 


for a € QZ?, which are interpreted in metric spaces in the obvious way 
(Kutz et al., 2003). Moreover, the translation between the two languages is 
effective. 

This expressive completeness result indicates already that MS is indeed 
quite expressive. Analysing its computational properties, Kutz et al. (2003) 
proved that the satisfiability problem for M S-formulas over arbitrary metric 
spaces is undecidable. In fact, the following much stronger theorem holds: 


THEOREM 9.9 No algorithm can decide whether an arbitrarily given MS- 
formula all of whose distance operators are of the form dsa, fora € N”, is 
satisfiable in a model based on a metric space. 





The proof of this result is based on the observation that one can ‘enforce’ the 
N x N grid using the ‘punctured’ centres of circles provided by 4S}. 

It is worth noting that in contrast to the undecidability result above, the sat- 
isfiability problem for M S-formulas in arbitrary distance spaces and symmet- 


ric distances spaces is decidable. This observation follows from the standard 
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translation of MMS into the two-variable fragment of first-order logic (which 
is decidable in NEXPTIME) and the fact that reflexivity and symmetry of re- 
lations can be expressed in first-order logic using two variables only. This 
argument does not work for satisfiability in metric spaces because the triangle 
inequality cannot be expressed in first-order logic with two variables. 








The logic with 3<% and 3>°. Without the doughnut operators MS of- 
ten becomes decidable and has the finite model property with respect to the 
intended models, that is, a formula satisfiable in a (possibly infinite) metric 
model is satisfiable in a finite metric model. For example, denote by MSE? 
the fragment of MS with spatial terms of the form 


T u= pi | {G} | F | ann | Ir | 3747, 








where a € Q2°. Kutz et al. (2003) proved that this logic has the finite model 
property and that the satisfiability problem for its formulas is decidable in 
NEXPTIME under the unary coding of parameters. Actually, this result was 
improved in Wolter and Zakharyaschev, 2005b: 


THEOREM 9.10 The satisfiability problem for MS -formulas in metric 
spaces is EXP TIME-complete under the unary coding of numeric parameters 
in distance operators. 


The complexity of MS*"*-satisfiability under the binary coding of param- 
eters remains an open research problem. 








The logic with 3< and 3<*. Another interesting fragment of MS is based 
on the operators 3<* and 3$% (Wolter and Zakharyaschev, 2003). The spatial 
terms of the resulting logic MS**< are defined as follows: 








<a 





| eee 





To ae ee en EE ce | o atr 


where a € Q>? (by including 0 in the parameter set we would not increase 
the expressive power of the language, but some formulations may become 
awkward). The logic MS*"< has the finite model property, and EXPTIME- 
completeness can now be proved even for the binary coding of parameters: 


THEOREM 9.11 The satisfiability problem for MS*<-formulas in metric 
spaces is EXP'TIME-complete under both unary and binary coding of param- 
eters in distance operators. 


The crucial observation in the proof of this result is that (modulo the interpretation 
of nominals) the logic turns out to be complete with respect to tree metric spaces, 
a feature not shared by the languages considered above. Completeness with 
respect to tree metric spaces makes this language also amenable to tableau- 
based decision procedures (Wolter and Zakharyaschev, 2003) which are not yet 
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available for the language MMS**. An intriguing fact is that the fragments 
with only strict operators 3< and only non-strict ones JS? behave similarly in 
the following sense: 








THEOREM 9.12 Let ọ be a formula whose only distance operators are of the 
form A<*. Let y' be the result of replacing occurrences of 3<“ in y with 3S". 
Then ¢p is satisfiable in a metric space iff y' is satisfiable in a metric space. 











Of course, in the theorem above one cannot always choose the same metric 
space. In fact, it is worth noting that the language MMS*<"< is properly more 
expressive than its fragments with only the operators 3<® and 3<“, respectively. 
Namely, using both operators we can say that the distance between two sets p 
and q is precisely a: 


(p m 9g # L) A (p m ges 























‘Modal’ logics of metric and topology. The logics of metric spaces we 
have considered so far can represent certain knowledge about distances between 
spatial objects, but are not suitable for reasoning about the induced topology. To 
see this for MSE? and MS**<, recall that both of them have the finite model 
property: every satisfiable formula is satisfiable in a finite metric space. Thus, 
these languages cannot distinguish between finite and infinite metric spaces. On 
the other hand, every finite metric space induces the trivial topology in which 
every set is both closed and open. It follows that every satisfiable formula 
is satisfiable in a metric space with a trivial topology and that therefore the 
languages cannot represent anything interesting about the topology induced by 
a metric space. A similar argument can be used to show that MS itself cannot 
be used for representing topological knowledge. 

To be able to reason about both metric and topology we can combine one of 
the metric logics above with one of the topo-logics considered in Sec. 3.2. Only 
one such combination has been investigated in detail so far: the extension of S4,, 
with the metric operators 3<® and 4S% of MS*<"< (Wolter and Zakharyaschev, 
2005a). The terms of the resulting language we call MT are defined as follows: 























T = pi | 7| ann | Ir | See | Cr, 


where a € Q>?. Notice that MT does not contain contain nominals {4;}. 
Although it would be definitely useful to have nominals in the language, we 
do not include them into the signature because nothing is known about the 
algorithmic properties of MMT extended by nominals. Unlike its parts S4, and 
MS*=<, the logic MT does not have the finite model property with respect to 
metric spaces because the topology induced by a finite metric space is trivial. 
For example, the term 
p ll Cp 
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is not satisfiable in any finite metric model, yet it is satisfiable in every 
Euclidean space. 

It turns out, however, that the intended metric models for this logic can 
be represented in the form of relational structures (or Kripke frames), which 
can be regarded as partial descriptions of metric models. This representation 
theorem—in fact a generalisation of the McKinsey and Tarski (1944) represen- 
tation theorem for topological spaces—treduces reasoning with infinite metric 
models to reasoning with finite relational models and can be used to show the 
following 


THEOREM 9.13 The satisfiability problem for MT -formulas in metric spaces 
is EXP'TIME-complete under the binary coding of parameters. 


To understand the interaction between the topological and distance operators, 
it is worth taking a look at the axioms required to describe this interaction. It 
turns out that to axiomatise the MT -formulas that are valid in all metric models, 
we need the axioms governing the behaviour of the distance operators, those 
for the topological operators, and only two axioms where both are involved: 


Cr lia et 
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The logic MT is also decidable over the real line, where it has been con- 
sidered in the framework of reasoning about real-time systems (Hirshfeld and 
Rabinovich, 1999). It becomes undecidable, however, when we take R? as the 
intended metric space. 


Closer operator. The representation of knowledge about distances in (frag- 
ments of) MS is restricted to absolute distances. In particular, in MS it is not 
possible to compare distances between spatial objects without estimating the 
absolute values for the distances. A purely comparative approach to represent- 
ing and reasoning about distance spaces would need predicates like ‘X is closer 
to Y than it is to Z’ which are quite common in our everyday life (‘the body was 
in the middle of the room, rather closer to the door than to the window’). In the 
framework of spatial logics we have considered so far this predicate can be rep- 
resented using the binary closer operator — with the following interpretation 
in distance models Mt = (D, p, p”, swaths 


(9.18) men) = everday") <d@,7n)} 


In other words, 7, © 72 is (interpreted by) the set containing those objects of A 
that are ‘closer’ (or ‘more similar’) to 7, than to T2. Formally, the terms of the 
language CSL of comparative distances (or similarity) are defined as follows: 


T ste: pi | ae | eagles: lS a Ta 
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The language CSC turns out to be quite powerful. Using it we can express the 
interior (and so the closure) operator by taking 


lS: oT 7; 
Indeed, by the definition above, we have 
(Ir)™ = {re A |d(z, ^A — T”) > 0}. 


We can also express the existential (and so the universal) modality: 





ap Se = l 


because d(x, Ø) = oo. Thus, CSL contains S4,, and can be regarded as a 
qualitative spatial formalism for reasoning about comparative distances and 
topology. One more interesting operator is 





ho Sp = (Ti E n) O(n & 71) 


which defines the set of points located at the same distance from 7, and 72. 
As a small illustrating example consider the formula 











(9.19 pElq&r) A-galrepy ArCE(p8&q) Ape, 


One can readily check that it is satisfiable in a three-point non-symmetrical 
model, say, in the one depicted below where the distance from x to y is the 
length of the shortest directed path from z to y. 


r 


ge 


Pp q 





On the other hand, it can be satisfied in the following subspace of R 


oZ Q $1 EE O 


© ® © 
q r P q r P 





The following result has been obtained in Sheremet et al., 2006: 


THEOREM 9.14 The satisfiability problem for CS L-formulas in metric spaces 
is EXPTIME-complete. 


Investigating the algorithmic properties of the combination of CSL with 
fragments of MS in order to facilitate reasoning about topology, comparative 
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distances, and absolute distances in one formalism is a challenging research 
problem. Define the terms of the language CMS by taking 


T = p| {4} | 7 | ann | Ir | 47 | nen, 

















where a € Q?°. CSL enriched with nominals can represent Voronoi tes- 

sellations of various spaces. For example, let location constants £4, l2, £3 be 

interpreted by the points a1, a2, a3 of R? in Fig. 9.7. Then the CMS-terms 
{i} = {Gj} ULE}, for {i,j,k} = {1, 2, 3}, 


define the Voronoi tessellation of R? corresponding to the set {a1, a2, a3}. 








Figure 9.7. Closer operator and Voronoi tessellation. 


Nothing is known about the algorithmic properties of CMS interpreted over 
arbitrary metric spaces. However, if one considers metric spaces satisfying the 
min condition 


d(X,Y) = min{d(z,y)| xE X,y EY}, 


forall sets X and Y, then the topology induced by the metric space is trivial again 
and CMS can represent knowledge about comparative and absolute distances 
only (note that, by the definition, d(X,Y) = inf{d(a,y) | x € X,y € Y}). 
Then we have the following result (Sheremet et al., 2005a, 2005b): 


THEOREM 9.15 The satisfiability problem for C MS-formulas in metric spaces 
with the min-condition is EXP'TIME-complete under the binary coding of pa- 
rameters. 


Rather unexpectedly, over the real line R the logic CS£ turns out to be un- 
decidable, which can be proved by reduction of the (undecidable) 10th Hilbert 
problem on the existence of an algorithm solving arbitrary Diophantine equa- 
tions; see, e.g., Barwise, 1977 and references therein. A proof can be found 
in Sheremet et al., 2005b. 
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4. Temporal logics 


Now we briefly remind the reader of the two basic propositional temporal 
logics that will be used for speaking about the temporal dimension of spatio- 
temporal models introduced in Sec. 6: the linear temporal logic LT £ and its 
branching time extension BT (a variant of the well-known computation tree 
logic CT L*). 


4.1 Linear temporal logic CTL 


Temporal logic, as opposed to first-order logic, is an approach to reasoning 
about time (and computation) using temporal connectives and without explicit 
quantification over time. Its most popular variant, the propositional linear 
temporal logic LT L, is successfully applied in model checking as well as 
program verification and specification (e.g., Clarke et al., 2000; Manna and 
Pnueli, 1992; Manna and Pnueli, 1995). 

The intended flow of time for LT CL is any strict linear order (W, <) with 
time points w E€ W and the precedence relation <. In what follows we will be 
mainly interested in (N, <) and arbitrary finite flows of time. LT L-formulas 
are constructed from propositional variables po, p1, . .. using the Booleans and 
the binary temporal operator U (‘until’), the intended meaning of which is as 
follows: 


m puy standsfor ‘vy holds true until w holds.’ 











Other temporal connectives like Op (‘sometime in the future’), Op (‘always 
in the future’), and © (‘at the next moment’) can be defined via U: 














Orp=TUy, FY = `O pange and Oy = 1U y. 





It should be noted that we adopt the ‘strict’ interpretation of temporal operators, 
i.e., Op, Op and U do not not include the present. We will use abbreviations 
ty and © ty for Ory ^g and Ory V y, respectively. (Note also that ‘past’ 
operators like ‘since’ and ‘sometime in the past’ can be added to the language 
of LTL as well. Here we only deal with the ‘future fragment’ of LTL, as this 
restriction does not influence any of the results throughout.) 

To evaluate L7 £-formulas in a flow of time ¥ = (W, <), we have to specify 
first at which time points the propositional variables hold. An LT £L-model is a 
structure of the form 






































M = (5, pt, p, ..), 


where p>” C W for all i. The truth-relation (WR, w) = ¢, or simply w E ¢ if 
understood (which says that an L7 L-formula ọ holds at moment w in W) is 
defined as follows (we omit the clauses for the Booleans): 





wep iff wep”, 
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m w} gly iff there is v > w such that v = w and u H for all 
u € (w,v), 


where (w, v) = {u € W | w < u < v}. Other temporal operators (expressible 
via U) are interpreted according to their meaning. For example, 


a wH Oy iff w+1 |H Oy (where w + 1 is the immediate successor 
of w), 





a wH Opry iff thereisv > w such that v = y. 





A formula ¢ is satisfiable if there is a model Mt over (N, <) and a time point 
n € N such that (M, n) H y. We say that y is finitely satisfiable if there is 
a finite strict linear order Ẹ and a model M over it such that (M, n) H vy for 
some n in ¥. 

The following results are due to Sistla and Clarke (1985): 





THEOREM 9.16 The satisfiability problem for LT L-formulas is PSPACE- 
complete. The problem whether an LT L-formula is finitely satisfiable is 
PSPACE-complete as well. 


This complexity result might suggest that the expressive power of LTL 
is rather limited. Surprisingly enough, this is not the case. According to the 
famous Kamp theorem (Kamp, 1968), the propositional temporal language with 
both ‘until’ and ‘since’ is as expressive as the monadic first-order language over 
(N, <) (which of course is considerably more succinct than LT £). 

We will also consider the fragment LT Lo of LTL containing only Op 
and © as its temporal operators. The following results are due to Ono and 
Nakamura (1980) and Sistla and Clarke (1985): 


THEOREM 9.17 The satisfiability problem for LT Lo-formulas is NP- 
complete. The problem whether an LT Lo-formula is finitely satisfiable is 
NP-complete as well. 









































4.2 Branching time temporal logic BTL 


The temporal logic considered above is not able to express the following 
statements (due to Aristotle): 


a itis necessary that there will be a sea-battle tomorrow, 

= itis possible that there will be a sea-battle tomorrow. 
LT £ can only say 

= Osea-battle, i.e., there will be a sea-battle tomorrow. 


In other words, it does not distinguish between possible, actual, or necessary 
future developments. A natural way to formalise assertions of this sort is to 
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add two more operators A and E to the temporal language and understand them 
as quantifiers over ‘possible histories.’ For example, by interpreting E as ‘it is 
possible that’ and A as ‘it is necessary that,’ we can express the two Aristotle’s 
statements by the formulas ACsea-battle and EOsea-battle, respectively. 

Numerous extensions of LT £ by means of such kind of operators have been 
introduced in various disciplines, in particular, computer science and artificial 
intelligence (Lamport, 1980; Clarke and Emerson, 1981; Emerson and Halpern, 
1986) or philosophy (Prior, 1968); for more references and discussions see 
Thomason, 1984 and Gabbay et al., 2000. Here we only outline the essential 
ideas using the simple extension of LT £ with A and E; it will be called BT £, 
branching temporal logic. 

Having fixed the language, we need to choose time structures that could 
allow for non-trivial interpretations. Clearly, if the flow of time is linear then at 
every moment the future is fixed, and so both Ay and Ey are equivalent to y. 
The flows of time we need should be able to represent different evolutions of 
history. Since, on the other hand, it is natural to assume that, in contrast to the 
future, the past is fixed, trees as defined below appear to be perfect structures 
for modelling different histories (in particular, they correspond to the discrete 
tree of evolutions (9.5) of spatial transition systems). 

A tree is a flow of time Ẹ = (W, <) containing a point r, called the root of 
8, for which W = {v | r < v} U {r}, and such that for every w € W, the set 
{w | v < wh is well-founded and (strictly) linearly ordered by <. A history 
in § is a maximal linearly <-ordered subset of W. Finally, an w-tree is such a 
tree where every history is order isomorphic to (N, <). 

By a branching time model we understand a structure 


B= (EH, pE, Pi ys) 


where § = (W, <) is an w-tree, H a set of histories in ¥—the set of possible 
flows of time in the model—and pe C W for all i. Formulas are evaluated 
relative to pairs (h, w) consisting of an actual history h € H and a time point 
w €E h. In such a pair (h, w), the temporal operators are interpreted along the 
actual history h as in the linear time framework, while the operators E and A 
quantify over the set of all histories 


H(w) ={h' eH] weh} 


coming through w. More precisely, the truth-relation = between models 8 
with pairs (h, w) and BT £L-formulas ¢ is defined inductively in the following 
way (we omit the clauses for the Booleans): 





a (hw) Ep iffw Ep”, 


m (hw) H| puy iff there is v € h such that v > w, (h,v) = y and 
(h, u) = ¢ for all u € (w, v), 
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m (h,w)| Ey iff there is h’ € H(w) such that (h’, w) 
a (h,w) = Ay iff (h’,w) H y forall h’ € H(w). 


Note that propositional variables are assumed to have no temporal aspect in the 
sense that their truth-values at (A, w) do not depend on the actual history h. We 
say that a BT L-formula is satisfiable if there exists a branching time model % 
such that (B, h, w) = ọ for some history h € H and some time point w € h. 

The branching time model defined above reflects the ‘Ockhamist view’ of 
time. We refer the reader to Burgess, 1979, Zanardo, 1996, Gabbay et al., 2000 
and Reynolds, 2002 for more information about this and related approaches. 
Here we only note that our branching time logic is closely related to the compu- 
tational tree logics CT £ and CT £* that are widely used in model checking and 
program verification and specification (Clarke and Emerson, 1981; Emerson 
and Halpern, 1986; Clarke et al., 2000). 

It might seem more natural to quantify with E and A over the set of all 
histories in the tree rather than its subset H. But then we would be forced to 
accept possibly unintended histories in § as possible flows of time. Here is an 
example of a formula satisfiable in a branching time model as defined above, 
but not in a branching time model in which #H is the set of all histories. The 
formula is a conjunction of the following three BT £-formulas: 





P, 





























(9.20) P(Scotland, UK), 
(9.21) AS pOpEC(Scotland, UK), 
(9.22) ADO} (P(Scotland, UK) + EOP (Scotland, UK) ). 


The first formula means that at present Scotland is part of the U.K. The second 
says that in all possible histories, there will be a time starting from which 
Scotland will be externally connected to the U.K. And the last formula claims 
that in all possible histories, it is always the case that if Scotland is part of 
the U.K. then it is still possible that it will remain in U.K. for at least one 
more day. (Since we do not have a combined spatio-temporal language yet, the 
RCC-8 predicates P (Scotland, UK) and EC (Scotland, UK) should be regarded 
as a propositional variable and its negation, respectively.) 

The following result can be obtained using a reduction to satisfiability in 
CT L* (Hodkinson et al., 2001): 


THEOREM 9.18 The satisfiability problem for BT L-formulas is decidable in 
2EXPTIME. 


It seems that the lower bound for the computational complexity of this prob- 
lem is still unknown. 


REMARK 9.19 Similarly to RCC-8, instead of time points one can take ex- 
tended time entities, i.e., intervals, as primitives. This approach to temporal 
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representation and reasoning reflects the fact that certain assertions can be eval- 
uated only at periods of time (e.g., ‘John often drinks beer’). It was developed 
by Allen (1983; 1984), who observed, in particular, that relative positions of 
any two intervals ¿ and j of a strict linear order can be described by precisely one 
of the thirteen basic interval relations: before(i, j), meets(i, j), overlaps(i, 7), 
during(i, j), starts(z, 7), finishes(2, 7), their inverses (before(j,7), meets(j, i), 
etc.), and equal(i, 7). 

We will not consider interval temporal logics in this chapter and refer the 
interested reader to Vilain et al., 1989, Blackburn, 1992, Gabbay et al., 2000 
and Goranko et al., 2004. 


5: Combination principles 


We have defined how the intended models of spatio-temporal logics (yet to 
be constructed) should look like. We have also identified a stock of available 
spatial and temporal logics to be integrated into spatio-temporal formalisms. 
However, we have not discussed yet how the component logics are supposed 
to interact with each other. 

The expressive power (and consequently the computational complexity) of 
combined spatio-temporal formalisms obviously depends on three parameters: 


m the expressiveness of the spatial component, 
m the expressiveness of the temporal component, and 


= the interaction between the two components allowed in the combined 
logic. 


Regardless of the chosen component languages, the minimum requirement for 
a spatio-temporal combination to be useful is the following: 


The language should be able to express changes in time of the 


truth-values of purely spatial propositions. eo 


Languages satisfying (PC) can capture, for instance, some aspects of the conti- 
nuity of change principle (e.g., Cohn, 1997) such as example (A) from Sec. 2: 
“if two clouds are disconnected now, then at the next moment they either remain 
disconnected or become externally connected.’ A natural way to express this 
principle is to encode it into the following “spatio-temporal formula’ 


(A) DC(cloud,, cloud2) > ODC(cloud,, cloudy) V OEC(cloud,, cloud:). 


We may also need to impose some constrains on possible movements of spatial 
objects by comparing their positions at different moments of time. For example, 
the continuity principle above can be further refined by saying that the current 
cloud’s position overlaps with its positions at the next two moments, which 
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Figure 9.8. Temporal operators on regions. 


requires a spatio-temporal formula of the form 
(9.23) O(cloud, Ocloud) A O(cloud, OOcloud), 


where the predicate O(r, s) means that regions r and s have at least one common 
interior point; it can be expressed as a disjunction of all RCC-8 relations but 
DC and EC (see Fig. 9.8 where OX at moment n denotes the state of X at 
moment n + 1). 

The difference between (A) and (9.23) is that in the former case we apply 
temporal operators to spatial formulas, while in the latter to regions. 

Consider now example (G) from Sec. 2: ‘it will be raining over every part 
of England ever and ever again.’ This gives rise to the formula 





(G) P(England, Op © rRain) 


where P(r,s) = TPP(r,s) V NTPP(r,s) V EQ(r,s). Formula (G) can be 
understood as follows: all bits (points) of England will infinitely often occur in 
region Rain, but not necessarily all at the same time. Note that the formula 


Op rP(England, Rain) 
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means that it will be raining over the whole England ever and ever again. 

There is an essential difference between examples (9.23) and (G). In the 
former, we want to control the movements of objects over a fixed finite number 
of steps, while in the latter example we impose restrictions on their ‘asymptotic’ 
behaviour. This leads us to two fundamental principles which will be called lo- 
cal spatial object change principle (LOC) and asymptotic spatial object change 
principle (AOC). 


The language should be able to express changes or evolutions of 


spatial objects over some fixed finite periods of time. hoo) 


The language should be able to express changes or evolutions of 


spatial objects over the whole duration of time. 08) 


In logical terms, (PC) refers to the change of truth-values of propositions, while 
(LOC) and (AOC) to the change of extensions of predicates. 

As we shall see later on in this chapter, different combination principles 
result in spatio-temporal logics of different expressive power and computational 
complexity. 


6. Combining topo-logics with temporal logics 


In this section we introduce and discuss various ways of combining topo- 
logics and temporal logics. First we consider combinations with (fragments of) 
linear temporal logic L7 £ and then with branching time temporal logic BT £. 


6.1 Combinations with linear temporal logic CT £ 


First we construct ‘maximal’ combinations with (fragments of) L7 £ meeting 
all three combination principles (PC), (LOC) and (AOC), and see that such a 
straightforward approach results in undecidable logics. Then we systematically 
weaken the component languages and their interaction. The result is a hierarchy 
of spatio-temporal logics whose complexity ranges from NP via PSPACE, 
EXPSPACE and 2EXPSPACE to undecidable. All omitted proofs and further 
details can be found in Gabelaia et al., 2005a. 

As outlined in the introduction, we represent the motion of spatial objects 
in time using the following kind of ‘snapshot’ models. A topological-temporal 
model (a tt-model, for short) is a pair of the form Mt = (F, Y), where Z = (U, I) 
is a topological space, and %9, a valuation, is a map associating with every 
spatial variable p and every time point n € N a set Y(p, n) C U—the ‘space’ 
occupied by p at moment n. Such a pair Mt = (T, VW) is simply a shorthand 
for the representation (9.3) of spatio-temporal models as a sequence of spatial 
models: 


Mo = (T, U(po, 0), B(p1, 0), . $ 5) Mı = (T, B(po, 1), B(pi, 1), ae = yia 
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Combinations with (PC), (LOC) and (AOC). A ‘maximalist’ approach to 
constructing spatio-temporal logics is to allow unrestricted applications of the 
Booleans, the topological and the temporal operators to form spatio-temporal 
terms. 

Denote by LT L x S4,, the spatio-temporal language given by the following 
definition: 


T z= p|T| nnn | nUn | Ir | Cr| aur, 
(9.24) p i= AEn | 7 | pidge | G1 V p2 | P1U p2. 





Expressions of the form 7 and ọ will be called LT L x S4, terms and formulas, 
respectively. Most of the languages we consider in this subsection are fragments 
of LTL x S4y. 

As before, we can introduce the temporal operators O p, © p, and O applicable 
to LTL x S4u formulas. Moreover, these operators can now be used to form 
LTL x S4,, terms: for example, 

















Orr = 7 Us, rT=0FT and “Or =i, 











where the intended meaning of L and T is the empty set and the whole space, 
respectively. 

LT L x S4,, formulas are supposed to represent propositions speaking about 
moving spatial objects represented by LT L x S4,, terms. The intended truth- 
values of propositions in tt-models can vary in time, but do not depend on points 
of spaces. But how are we to understand ‘temporalised’ terms? 

The meaning of Or should be clear: at moment n, it denotes the space 
occupied by 7 at the next moment n + 1 (see (9.23) and Fig. 9.8). The formula 


(F) EQ(OOEU, EU U Romania U Bulgaria) 


formalises sentence (F) from Sec. 2. It says that in two years the EU (as it 
is today) will be extended with Romania and Bulgaria. Note that the formula 
OOEQ(EU, EU U Romania U Bulgaria) has a different meaning because the 
EU may expand or shrink in a year. It is also not hard to formalise sentences 
(D), (E) and (H): 

(D) EQ(OX,Y) > 7AEQ(Y,OY), 

(E) pEQ(OEurope, Europe), 

(H) 7(EQ(Earth, W U L) AEC(W, L)) A P(W, OW) > P(OL, L). 
































The intended interpretation of terms of the form Òpr and Opr is a bit 
more sophisticated. It reflects the standard temporal meanings of propositions 
‘Opx ET and ‘Opx ET, for all points x in the topological space: 

















a atmomentn, term © pT is interpreted as the union of all spatial extensions 
of 7 at moments m > n; 
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= at moment n, term Opr is interpreted as the intersection of all spatial 
extensions of 7 at moments m > n. 





For example, consider Fig. 9.8 with moving cloud X depicted in it at three 
consecutive moments of time, and suppose X does not change after n + 2. 
Then Of X at n is the union of OX and OOX at n and OpX at n is the 
intersection of OX and OOX at n (i.e., OX). 

As another example, let us interpret the term O © Rain occurring in for- 
mula (G) on page 532: 


























a Q pRain at moment n occupies the space where it will be raining at some 
time points m > n (which may be different for different places). Op Rain 
at n occupies the space where it will always be raining after n. 


























a Of pRain at nis the space where it will be raining ever and ever again 
after n, while OO Rain comprises all places where it will always be 
raining starting from some future moments of time. 














Now, what can be the meaning of Rain U Snow? Similarly to the readings of 
FT and © 7 above, we adopt the following definition: 














= at moment n, the spatial extension of Tı U 72 consists of those points x 
of the topological space for which there is m > n such that x belongs to 
Tə at moment m and z is in 7; at all k whenever n < k < m. 


The past counterpart of U—i.e., the operator ‘since’ S—can be used to say that 
the part of Russia that has been remaining Russian since 1917 is not connected 
to the part of Germany (Königsberg) that became Russian after the Second 
World War (Kaliningrad): 


DC(Russia S Russian Empire, Russia S Germany). 


Summing up, the valuation %9 in tt-models can be inductively extended to 
arbitrary LT L x S4,, terms: 


a U(T,n) =U —Vir,n), V(O T n) = V(r, n) A V(t, n), 
m V(I, n) =IYV(7,n), V(CT, n) = CYU(T, n), 


E V(U T, n) = (s (T2,m n N Y(T, k J: 














m>n kE(n,m) 
Then we also have: 
m U(OrT,n) = LJ Y(T, m), YV(OpT, n) = () VT, m), 
m>n m>n 


a G(Or,n) = U(r,n+1). 
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The truth-values of LT L x S4,, formulas in tt-models are defined as follows: 
ma (M n)an Cm iff V(n,n) cUm, n), 
m (M, n) == iff (M,n) Ky, 
m (M, n) =p Ap2 iff (M,n) Ey. and (M,n) H vo, 


m (M, n) |= p1 U p2 iff there ism > n such that (M, m) = we and 
(M, k) | gı for all k € (n, m). 











An LT L x S4,, formula ¢ is called satisfiable if there exists a tt-model Wt such 
that (M, n) = y for some time point n € N. 

Observe that LT L x S4, contains both LT £ and S4,,. At first sight it may 
appear that the computational properties of this combination should not be too 
bad—after all, its spatial and temporal components are PSPACE-complete. It 
turns out, however, that this is very far from being the case: 


THEOREM 9.20 The satisfiability problem for LTL x S4, formulas in tt- 
models is ©t-complete. 


It follows from Theorem 9.20 that if we strengthen the topological com- 
ponent to TCC (by allowing terms of the form cS¥7, see Sec. 3.2), then the 
satisfiability problem for the resulting language LT £ x TCC is also Xt-hard. 
However, Theorem 9.20 is proved by a reduction of the ©}-complete recurrent 
tiling problem (Gabelaia et al., 2005b), and the terms used in its proof can de- 
note arbitrary (i.e., not necessarily connected) sets. It would be interesting to 
know the complexity of the satisfiability problem for LTL x TCC formulas 
in tt-models where spatial variables can be interpreted at each time point by 
connected sets or sets containing at most k connected components for some 
fixed k. 

One might conjecture that it is the use of the infinitary operators U, Op and 
© Fr inthe construction of LT L x S4,, terms that makes logics like LT L x S4u 
“over-expressive.’ Moreover, the whole idea of tt-models based on an infinite 
flow of time may look counterintuitive in the context of spatio-temporal repre- 
sentation and reasoning (unlike, say, models used to represent the behaviour of 
reactive computer systems). 

There are different approaches to avoid infinity in tt-models: 














= The most radical one is to allow only finite flows of time. A finite tt-model 
is a triple of the form Mt = (T, VU, N), where Ẹ is a topological space, 
N €N, and % is a map associating with every spatial variable p and 
every time point n < N a subset U(p, n) of the topological space T. 


= Amore cautious approach is to impose the following finite state assump- 
tion on models: 
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FSA Every spatial variable may have only finitely many possible states 
(although it may change its states infinitely often). 


Say that a (possibly infinite) tt-model (T, Y) satisfies FSA if, for every spatial 
variable p, there are finitely many sets A;,...,A,, in the space { such that 
{U(p,n) | n € N} = {Aj,...,Am}. (Such models can be used, for in- 
stance, to capture periodic fluctuations due to season or climate changes, say, a 
daily tide.) 

One can actually show (Gabelaia et al., 2005a) that an LT £ x S4,, formula 
is satisfiable in a model with FSA iff it is satisfiable in a model based on a finite 
(Aleksandrov) topological space. 

Unfortunately, none of these ‘finitising’ approaches improves the computa- 
tional behaviour of the combinations too much. We can even try to weaken the 
temporal component to LT Lo (by allowing only the temporal operators O p 
and © in terms and formulas), and still we have: 


THEOREM 9.21 (i) The satisfiability problem for LT Lo x S4,, formulas in 
(arbitrary) finite tt-models is undecidable. 

(ii) The satisfiability problem for LT Lo x S4y, formulas in tt-models satis- 
fying FSA is undecidable. 









































However, if we weaken the spatial component further, the combinations can 
become decidable, with high but gradually decreasing complexity. Recall the 
hierarchy of topo-logics from Sec. 3.2. It suggests that next we should consider 
RC™ as the spatial component. In this case LT L x RC” terms 7 are defined 
by 

T z= Clip: |r| nnn | nUn | TF | Cr | mam, 


and LT L x RC”-formulas as in (9.24). Unfortunately, it is an open problem 
whether the satisfiability problems for L7 £L x RC”*-formulas in arbitrary or 
in finite tt-models, or in tt-models satisfying FSA are decidable. 


Let us move one more step down and denote by LTL x RC the language 
given by the following definition: 


CIp | Clo | CI(o1 N 02) | CI(o1 U 02) | CI(o1 U 02), 
o | Io F| anr | nUT 


Q 


and formulas as in (9.24). Expressions of the form ọ will be called LT £ x RC 
region terms. Now the complexity of reasoning decreases indeed: 


THEOREM 9.22 The satisfiability problem for LTL x RC formulas in tt- 
models satisfying FSA, and in those based on (arbitrary) finite flows of time is 
2EXPSPACE-complete. 


The existence of a 2EXPSPACE decision algorithm follows from the fact 
that, similarly to the case of topo-logic RC (without a temporal component), it 
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is enough to deal with Aleksandrov topological spaces. More precisely, it can 
be shown that an LT £L x RC formula is satisfiable in a tt-model with FSA iff it 
is satisfiable in a tt-model (T, Y) where T is an Aleksandrov space induced by 
a finite disjoint union of finite brooms (cf. Theorem 9.5). The lower bound is 
established by showing that ‘yardsticks’ of double-exponential length (similar 
to those used by Stockmeyer, 1974 and Halpern and Vardi, 1989) can be encoded 
by LT Lx RC formulas of polynomial length. These yardsticks can then be used 
to encode any Turing machine computation over double-exponential space. 


By restricting the language further we obtain LT L x BRCC-8: 


p u= Q(01,02) | œ | virgo | gi Vye2 | gill pə, 


where the o; are LTL x RC region terms and Q ranges over (the translations 
of) the eight RCC-8 predicates. 


THEOREM 9.23 The satisfiability problem for LTL x BRCC-8 formulas in 
tt-models with FSA, and those that are based on (arbitrary) finite flows of time 
is EXPSPACE-complete. 


The exponential decrease in the complexity is due to the fact that now we 
can have a bound (linear in the size of the given formula y) on the size of the 
brooms inducing the underlying Aleksandrov space of a tt-model in which an 
LTL x BRCC-8 formula y is satisfied. The lower bound can be proved by 
reduction of a 2”-corridor tiling problem. 

Finally, by replacing the available region terms of LT L x BRCC-8 with 


o s= Cip | Cau o) 


we obtain the product L7 £L x RCC-8. The exact complexity of the satisfiabil- 
ity problem for LTL x RCC-8 formulas in tt-models satisfying FSA, and in 
(arbitrary) finite tt-models is not known. These problems are PSPACE-hard by 
Theorem 9.16 and in EXPSPACE by Theorem 9.23. 

It is also an open problem whether satisfiability of LT £ x £ formulas in (ar- 
bitrary) tt-models is decidable, whenever L E€ {RCC-8, BRCC-8, RC, RC” }. 


Combinations with (PC) and (LOC). We can try to obtain decidable spatio- 
temporal combinations over infinite time lines by omitting the (AOC) principle 
and allowing only ‘local control’ of evolutions of spatial objects. To begin with, 
let us consider the fragment LT L o S4,, of LT L x S4,, with terms of the form: 


T u= p | 7 | nann | nUn | rr | Cr | Or. 


In other words, LT £ o S4, does not allow applications of temporal operators 
different from O to form terms (but they are still available as formula construc- 
tors). This means that the language still satisfies (LOC), but (AOC) is no longer 
available. 
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This fragment is definitely less expressive than full L7 £L x S4,,. For instance, 
on the one hand one can show that LT L o S4,, formulas do not distinguish 
between arbitrary tt-models and those based on Aleksandrov topological spaces. 
On the other hand, the set of LTL x S4,, formulas satisfiable in tt-models 
based on Aleksandrov spaces is a proper subset of those satisfiable in arbitrary 
tt-models. Consider, for example, the LTL x S4,, formula 




















Orlp C Opp. 


One can readily see that it is true in every tt-model based on an 
Aleksandrov space, but its negation can be satisfied in a tt-model. For it suffices 
to take the topology T = (R, I) with the standard interior operator I on the real 
line, select a sequence Xn of open sets such that (pen Xn is not open, e.g., 
Xn = (—1/n,1/n), and put U(p, n) = Xn. 

However, even this seemingly weak interaction between topological and 
temporal operators turns out to be dangerous: 


THEOREM 9.24 The satisfiability problem for LTL o S4, formulas in 
tt-models is undecidable. It is undecidable as well for tt-models satisfying 
FSA, and for (arbitrary) finite tt-models. 


We can try to weaken again the topological component. The language LT Lo 
RC" can be obtained from LT L x RC" by replacing the constructor Tı U T2 
with O7 in the definition of terms. It is not known whether this helps, that is, 
whether the satisfiability problem for LT £ o RC”“-formulas in tt-models or 
in (arbitrary) finite tt-models is decidable. 


If we weaken the spatial component even further, then this kind of combina- 
tion turns out to be decidable. Consider the languages LTL o £, for L € {RC, 
BRCC-8, RCC-8}, which differ from LT L x £ only in the following aspect: 
in the corresponding definition of region terms o the constructor CI( o1 U o2) is 
replaced with CIOg. Then again we have a hierarchy of gradually decreasing 
complexity: 


THEOREM 9.25 The satisfiability problem for LTL o RC formulas in 
tt-models is 2EXPSPACE-complete. It is 2EXPSPACE-complete as well for 
tt-models satisfying FSA and for (arbitrary) finite tt-models. 


THEOREM 9.26 The satisfiability problem for LT L o BRCC-8 formulas in tt- 
models is EXPSPACE-complete. It is EXPSPACE-complete as well for 
tt-models satisfying FSA and for (arbitrary) finite tt-models. 


The proofs of Theorems 9.25 and 9.26 are essentially the same as those 
of Theorems 9.22 and 9.23. The difference is that now the correspondence 
between arbitrary satisfiability and satisfiability in tt-models based on 
Aleksandrov spaces holds not only for tt-models satisfying FSA, but for ar- 
bitrary tt-models as well. 
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THEOREM 9.27 The satisfiability problem for LTL o RCC-8 formulas in 
tt-models is PSPACE-complete. 


The idea of the proof is to separate the topological and temporal parts of a 
given formula, and then use available satisfiability checking algorithms for the 
component logics (see also Theorem 9.28 below). In order to take into account 
the interaction between the topological and temporal parts, one has to use the 
so-called ‘completion property’ of RCC-8 (cf. Balbiani and Condotta, 2002) 
with respect to a certain class € of models: given a satisfiable set 6 of RCC-8 
formulas and a model in € satisfying a subset of ®, one can extend this ‘partial’ 
model to a model in € satisfying the whole ®. 

The exact complexity of the satisfiability problem for LT LoRCC-8 formulas 
in tt-models satisfying FSA, and in (arbitrary) finite tt-models is not known. 
These problems are PSPACE-hard by Theorem 9.16 and in EXPSPACE by 
Theorem 9.26. 


Combinations with (PC) only. If we want to keep the complexity low but 
to use an expressive topological component, then the interaction between space 
and time has to be weakened. One way of doing this is to consider combined 
languages in which the temporal operators can be applied to spatial formulas but 
not to spatial terms. The resulting combinations will satisfy (PC), but neither 
(LOC) nor (AOC) is expressible. (This way of ‘temporalising’ a logic was first 
introduced by Finger and Gabbay, 1992). 

Denote by LT L[S4,,| the spatio-temporal language given by the following 
definition: 


T | ann | eee | Ir | Cr, 


Tn | 7p | virgo | giVgye | gill po. 


THR 
wrier 





Note that the definition of LT L[S4,] terms coincides with the definition of 
spatial terms in S4,, which reflects the fact that CT L[S4u] cannot capture 
(LOC) or (AOC). We have imposed no restrictions upon the temporal operators 
in formulas—so the combined language still contains LT £. (Clearly, S4,, is a 
fragment of LT L[S4,,].) 


THEOREM 9.28 The satisfiability problem for LT L{S4,] formulas in tt-models 
and in (arbitrary) finite tt-models is PSPACE-complete. 


The proof of this theorem is based on the fact that the interaction between 
spatial and temporal components of LT L[S4,,] is rather limited. In fact, for 
every LT £L[S4,,] formula y one can construct an LT £ formula y* by replacing 
every occurrence of a (spatial) subformula 7, E 72 in ọ with a fresh proposi- 
tional variable p,, m. Then, given an LT £-model IN for ~* (based on (N, <) 
or a finite flow of time) and a moment n, we take the set 


Pn = {71 E T2 | (M, n) E Prin) U {> (Ti C T2) | (M, n) = “Pri ,72 
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of spatial formulas. It is not hard to see that if ®,, is satisfiable for every n, 
then there is a tt-model satisfying y (simply because extensions of a spatial 
variable at different time moments are independent). Now, to check whether y 
is satisfiable, it suffices to use a suitable nondeterministic algorithm (e.g., Sistla 
and Clarke, 1985) which guesses an L7 £-model for ~* and then, for each time 
point n, to check satisfiability of ®,,. This can be done using polynomial space 
in the length of yp. 

Theorem 9.28 (together with Theorem 9.16) shows that the satisfiability 
problem for each of the spatio-temporal logics of the form LT L[L], where 
L € {RCC-8, BRCC-8, RC, RC}, is also PSPACE-complete. 

However, if—instead of L7 £—we consider its NP-complete fragment 
LT Lo, the complexity of ‘temporalisations’ can even be lower. On the one 
hand, by Theorems 9.1, 9.7 and 9.28, LT Lo[S4u] and LT Lo[RC”™” | are still 
PSPACE-complete. On the other, by considering NP-complete topological 
components, the same argument as in the proof of Theorem 9.28 gives us: 









































THEOREM 9.29 The satisfiability problem for LT Lo{RC] formulas in 
tt-models is NP-complete. 











It follows from Theorem 9.29 that the satisfiability problems for the weaker 
LT Lol RCC-8] and LT Lo[BRCC-8] are NP-complete as well. 


























6.2 Combinations with branching time temporal logic 
BTL 


In the framework of linear time spatio-temporal logics, we can say, for in- 
stance, that the U.K. will join the euro-zone: ©pP(UK, Eurozone). We can 
also say that this will never happen. But we are not able to convey the reality, 
viz., that both variants are possible, that is, something like 





(9.25) EOpP(UK, Eurozone) ^ E~OpP(UK, Eurozone). 


In this section we summarise the results of Wolter and Zakharyaschev, 2002 on 
the combinations of the branching time temporal logic BT £ with the topo-logic 
BRCC-8. 

The combined languages are interpreted in the following modification of 
tt-models. A branching time topological model (a btt-model, for short) is a 
quadruple M = (F, H, T, V), where § = (W, <) is an w-tree, H a set of 
histories in §, Z = (U,1) a topological space, and Y, a valuation, is a map 
associating with every spatial variable p and every time point w € W a set 
Up, w) C U. (Observe that according to this definition, U(p, w)—the ‘space’ 
occupied by p at moment w—does not depend on the actual history of events.) 

As concerns the languages, for each choice of topological/linear-time com- 
bination L € {LT L/BRCC-8], LTL o BRCC-8, LTL x BRCC-8}, we have 
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two options: to allow applications of A and E to £-formulas only, or to both 
£-formulas and £-region terms. The resulting languages will be denoted by £? 
(the former option) and L” (the latter one). 

For example, (9.21), (9.22) and (9.25) are LT L[BRCC-8]°-formulas. The 
following (CTL o BRCC-8)” -formula 





A $ (EQ(Europe, Europe) A^ P(EU, Europe)) A 
P(Europe, EOEU) ^ P(AOEU, EU) 











says that, whatever happens, the region occupied by Europe will always remain 
the same and the EU will be part of Europe; moreover, every part of Europe has 
a possibility to join the EU next year, while, on the hand, what will certainly 
belong to the EU next year, is only part of the EU as it is today. 

Now the valuation %9 in btt-models can be inductively extended to arbitrary 
region terms in a way similar to the linear case: we only have to add a history 
as parameter. Given a region term o, a history h € H, and atime point w € h, 
define the value U(o0,h,w) of o at w relative to h inductively by taking 


=» U(CIp,h,w) = CIY(p, w), p a spatial variable, 

m U(CIo,h,w) = CI(U — Y(o, h, w)), 

m V(CI(o1 N 02), h, w) = CI(YV(01, h, w) N Bo2, h, w)), 
(CI(o1 U 02), h, w) = CI(Y(01, h, w) U Y(02, h, w)), 
( 


= V(CaUo), hw) =C LJ (Vezh) (] Borhu), 


v>w,vEh uE(w,v) 














8 


= U(CIEQ,h,w)=CI |) VBe,h',w), 
h'EH(w) 


= U(CIAg,h,w) = CI N Go, h', w). 
h'EH(w) 


Now, for a formula y and a pair (h, w), the truth-value of ¢ at (h, w) in W is 
defined inductively as follows: 


m (M,h,w) H| Q(o1,02) iff Q(Vle1,h, w), Boe, h, w)) holds in F, 
for RCC-8 predicates Q, 


m (M,hw)E-y iff (M,h,w) Ky, 
M, h, w) = p142 iff (M,h,w) — yi and (M, h, w) = vo, 








( 
(M, h, w) F p1 U p2 iff there exists v > w, for v € h, such that 
(M, h, v) H| p2 and (M, h, u) = y1 for all u € (w, v), 


Spatial Logic + Temporal Logic = ? 543 


m (M,h,w) HF Ep iff there is h’ € H(w) such that (M,h’,w) = ¢, 








m (M,h,w) HE Ay iff forall h’ € H(w), we have (M, h’, w) = g. 





A formula ¢ is called satisfiable if there is a btt-model Mt such that (M, h, w) 
H ọ for some history h € H and time point w € h. 


THEOREM 9.30 The satisfiability problem for (LTL o BRCC-8)° formulas 
in btt-models is decidable. 


No significant result on the computational complexity of this satisfiability 
problem has been obtained yet. 

As to satisfiability of £” -formulas, we again face the problem of infinitary 
temporal operations on region terms. Now, besides the linear temporal opera- 
tors, the region terms can also be affected by the ‘branch’ operators A and E. 
In fact, at least for discrete topological spaces (i.e., spaces T = (U, I) in which 
I is the identity function) we have the following negative result: 


THEOREM 9.31 The satisfiability problem for (LT L x BRCC-8)” formulas 
in btt-models based on discrete topological spaces is undecidable. 


We conjecture that the satisfiability problem for (CTL x BRCC-8)"" for- 
mulas in btt-models based on arbitrary topological and Euclidean spaces is 
undecidable as well. 

A natural way to search for decidable variants of the undecidable logics 
discussed above is to restrict the class of btt-models to those having finite sets 
of histories and where each history satisfies the finite state assumption. We 
conjecture that the satisfiability problem for (CTL x BRCC-8)"" formulas in 
this kind of btt-models is decidable. 


REMARK 9.32 Temporalisations of RCC-8 and BRCC-8 with the help of 
Allen’s interval calculus (see Remark 9.19) were considered by Bennett et al., 
2002, Gerevini and Nebel, 2002 and Gabbay et al., 2003. 


7. Combining distance logics with temporal logics 


Unfortunately, not so much is known about temporal extensions of logics of 
distance spaces. Of course, some of the ‘negative’ results from Sec. 6 hold for 
similar combinations with those distance logics that contain S4,, as a sub-logic 
(for example, MT or CSL). The technique of the proof of Theorem 9.28 can 
be used to show that the temporalisations L7 L[L] of the logics L of distance 
spaces from Sec. 3.3 (where the temporal operators can only be applied to 
formulas but not to spatial terms) inherit the complexity of L (see the end of 
this section). And finally, some of the methods developed to deal with products 
of modal (in particular, temporal) logics (see Gabbay et al., 2003 and references 
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therein) can be applied to analyse the computational behaviour of combinations 
of LT L with distance logics like MSS which only contains distance operators 
of the form 3<¢ (and their duals). 

Denote by LTL x MSE the spatio-temporal language satisfying the (PC), 
(LOC) and (AOC) principles and given by the following definition: 





T = pi | FT | ann | num | Ir | nUn, 


n= m1 Cm | =œ | y^p | vivyge | gill gr. 








As before, expressions of the form 7 and y are called LTL x MSS terms and 
formulas, respectively. 

A metric temporal model (mt-model, for short) is a pair of the form Mt = 
(D,%), where D = (A,d) is a metric space and VY, a valuation, is a map 
associating with each spatial variable p and each time instant n a set U(p,n) C 
A. The valuation can be inductively extended to arbitrary LT L x MSS terms 
in a straightforward way: 


a U(7,n) = A-Vr,n), U(r N Tn) = B71, n) V(t, n), 
= G(SS¢7,n) = {x € A | 3y(d(x, y) <a ^y € U(r, n))}, 


a V(U, n)= U (B(72,m)N N DCT, k) ). 


m>n ke(n,m) 








The truth-values of L7 Lx MSS formulas in mt-models are defined in precisely 
the same way as for spatio-temporal logics from Sec. 6.1. As before, we freely 
use the temporal operators O, Or and Op (as well as their non-strict versions 
O$ and OF). 

As an example of an LT £ x MSS formula, consider the following formal- 
isation of (I) from Sec. 2: 





























A ( (deserti Æ L) A OF (AS*desert; T Odesert;) ) = 
i=1,2 














Op r (desert, M desertz # 1). 


It says that two nonempty deserts (say, the Kalahari and the Sahara) increasing 
their size in all directions by at least some a € Q”? each year will eventually 
intersect. Notice that this formula is valid in mt-models based on Euclidean 
spaces, but not in models based on disconnected or discrete metric spaces. 

Unfortunately, the complexity of this combination of a PSPACE-complete 
and an EXPTIME-complete logics turns out to be too high. Using an almost 
straightforward encoding of the recurring tiling problem (see, e.g., the proof of 
Theorem 11.1 in Gabbay et al., 2003) one can prove the following: 
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THEOREM 9.33 The satisfiability problem for LTL x MSS formulas in 
mt-models is “}-complete. 


This result might suggest that combinations of LTL with MSS have the 
same computational properties as combinations with S4, in Sec. 6.1. However, 
this is not the case. To see the difference, let us consider the problem of term 
satisfiability for both languages: a term 7 is satisfiable if there is a model Wt for 
the language where U(r, n) is not empty for some time moment n. It can be 
shown (similarly to the proof of Theorem 9.20) that the satisfiability problem for 
LTL x S4u terms is 5}-complete. In the case of MSS the picture is slightly 
better—the problem is decidable but not in time bounded by any ‘tower’ of 
exponents: 


THEOREM 9.34 The satisfiability problem for LIL x MSS terms in 
mt-models is decidable, but not in elementary time. 


The proof requires three ingredients. First, one can show (similarly to the proof 
of Theorem 9.11) that any satisfiable CTL x MSS term is satisfiable in an 
mt-model based on a tree metric space. This observation makes it possible to 
apply the methods developed to analyse the product modal logic PTLoo x K. 
In particular, the decidability result is proved analogously to Theorem 13.6 from 
Gabbay et al., 2003: first, mt-models are represented in the form of quasimodels, 
and then the existence of a quasimodel for a given term is encoded in monadic 
second-order logic. The non-elementary lower bound can be established by a 
polynomial reduction of the satisfiability problem for PTLoo x K (which is 
non-elementary by Theorem 6.37 and Claim 6.25 of Gabbay et al., 2003) to 
satisfiability of LTL x MSS terms. 

It is worth noting that the language of LTL x MSS terms is ‘local’ in the 
sense that every term refers to a bounded area of the metric space, and the size 
of this area can be effectively computed. (In particular, statement (I) refers to 
the whole space, and so cannot be expressed in the language of LTL x MSS 
terms.) In fact, this is the crucial observation required for the decidability result. 
LT Lx MSS formulas, on the contrary, can speak about the whole space which 
makes it possible to simulate tilings. 

In the same way one can explain the computational behaviour of the com- 
bination LTL o MSS satisfying both (PC) and (LOC), but not (AOC). It is 
defined analogously to the spatio-temporal case by replacing Tı U T2 with Or 
in the definition of terms: 





























fem a= p lilt ae L anra | nüm | ar | Or: 


The formulas of LT L o MSS are defined in the same way as above. 
As Theorem 9.24 might suggest, the same negative result holds even for this 
restricted combination: 
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THEOREM 9.35 The satisfiability problem for LTL o MSS formulas in 
mt-models is undecidable. 


On the other hand, LTL o MSS terms are basically ‘harmless’ because 
they can only speak about limited time, at most /(7) time moments, to be more 
precise (where ((7) is the length of 7). So we have the following: 


THEOREM 9.36 The satisfiability problem for LTL o MSS terms in 
mt-models is EXP'TIME-complete under both unary and binary coding of pa- 
rameters in distance operators. 


Finally, the temporalisations of distance logics satisfying only the (PC) prin- 
ciple, and so containing no temporal operators in spatial terms, inherit the higher 
complexity of the spatial component, which is proved similarly to the proof of 
Theorem 9.28: 


THEOREM 9.37 The satisfiability problem for LT LIMS*<], LTL|MT], 
and LT L|CMS] formulas in mt-models is EXP TIME-complete for both unary 
and binary coding of parameters. 


8. Logics for dynamical systems 


The snapshot models and the corresponding spatio-temporal logics discussed 
above are a convenient tool for representing and reasoning about evolutions of 
spatial configurations of regions such as the political (geographical, weather, 
etc.) map of the changing world, where we are interested in keeping track of 
the relations between regions. 

On the other hand, if we want to model how an object moves over an oth- 
erwise stable space and keep track of its asymptotic trajectory then different 
models of space and time may be preferable, namely models corresponding to 
dynamical systems (e.g., Brown, 1976; Katok and Hasselblatt, 1995). 

A dynamical model is a pair of the form 


where M = (6, pe pr, ...) is a spatial model and g is a total function on 
the space G. Often g is required to satisfy certain constraints depending on the 
structure of G. For example, if G is a topological space, then g is often required 
to be continuous or even a bijective continuous and open mapping (that is, a 
homeomorphism). 

In the framework of such models, we are interested in the orbits 


Orbg(w) = {g(w), g°(w), ...} 


of certain points w from G (representing moving objects). The model WM 
describes a spatial environment in which w moves according to the rule (law) 
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g. Atypical question in the framework of dynamical models is whether a point 
from a region po will eventually reach p% without visiting pæ, or whether the 
rule g is such that w will be returning to p% infinitely often. 

The aim of this section is to discuss the existing logics capable of talking 
about some aspects of dynamical models. In particular, we consider the relation 
between the spatio-temporal logics above and logics for dynamical models. 
Before reading this section the reader is recommended to have a look at Ch. 10. 

Let us begin with two illuminative examples. 


A physical system. Consider a physical system with a single degree of free- 
dom, say, a body having mass m and moving along some axis. The movement 
of the body in a force field f(x, t) can be described by the following system of 
differential equations: 


i(t) = v(t), 
o(t) = f(x,t)/m, 


where x(t) and v(t) are, respectively, the position and the velocity of the body 
at time t. For every initial point (£o, vo) € R?, the differential equations 
determine the trajectory T(x vo) (t) of the body (more precisely, its position and 
velocity) that starts with the velocity vg at the position xo and moves according 
to the above equations. The collection of all those trajectories for different 
initial conditions form the phase portrait of the differential equation (depicted 
in the left-hand side of Fig. 9.9). 

Now consider the function ¢((z,v),t), called the flow of the equations, 
defined by taking $((x, v), t) = Tæ w) (t). Note that 


= ((x,v),0) = (a, v) and 
m ¢((z,v),t+s) = ¢(d((z, v),t), 8). 


The graph of this function represents trajectories in R? x R and the phase portrait 
can be considered as the projection of ¢((a, v), t) onto R?; see Fig. 9.9. Note 
also that ¢((x, v), t) is continuous in all coordinates. 

Given such a physical system, we usually want to know answers to the 
following standard questions. Suppose that the initial conditions (position and 
velocity) of the body are restricted by some set J C R?. Isitthe case that starting 
from any point of J the body will eventually reach some point in another set 
F? Will it be visiting F infinitely often? Is it the case that the body will never 
hit some ‘danger zone’ D C R?? 

A dynamical model (WM, g) for the differential equations above can be defined 
as follows. The underlying space © of Wt is the Euclidean plane R?. Let 
g(x,v) = O((a,v), 6), for some fixed small time unit ô > 0. The predicates 
p; can model the initial and final conditions J and F’, the danger zone D, 
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Figure 9.9. Dynamical system. 


etc. As g is easily seen to be continuous, (Wt, g) is a dynamical model with a 
continuous function on R?. The three questions above can then be formalised 
as whether we have 


a ITC Lo “(F), 


i>0 

e 1c NUE), 
j>0 i>0 

= IN| ]Jg>(D)=4. 
i>0 


It is to be noted that, on the other hand, the flow $((z, v), t) can be regarded 
as a snapshot spatio-temporal model 


Mo, My, ar 


where It; = (S, g (p), BO css ), fori > 0. The intuition behind this 
definition is as follows: a point (x, v) belongs toa set Y C R? at time point ¢ iff 
(x, v) is moved to Y by i consecutive applications of g, thatis, (x, v) € g-"(Y). 


Game of Life. Our second example is the Game of Life invented by 
J.H. Conway in the 1970s (e.g., Allouche et al., 2001). The game is defined 
as follows. We have a finite {1,...,n}x{1,...,n} or an infinite ZxZ board. 
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Each point on the board is either occupied or vacant (living or dead). At each 
regular time step the points of the board simultaneously change according to 
the following rules: 


(birth) a vacant point with exactly three occupied neighbours becomes an 
occupied cell, 


(survival) an occupied point with two or three occupied neighbours stays 
occupied, 


(death) in all other cases, the point becomes or remains vacant. 


Thus, at each step 7 > 0 the state of the game can be represented by the spatial 
model 


(9.27) M; = (6, 0%, v™), 


where G is the board, oÙ is the set of occupied points at step i and v™ the 
set of vacant ones. 

The Game of Life can be represented by the spatial transition system which 
consists of all possible models IN of the form (9.27), and M — W holds iff 
IM’ is obtained from MN by one step of the game according to the rules above. 
As the Game of Life is deterministic (for every W there is exactly one W such 
that M — IM’), there is exactly one evolution for any spatial transition system 
representing it. In other words, for every initial state of the Game we obtain 
exactly one snapshot model. 

The Game of Life (on, say, Z x Z) can also be formalised as a dynamical 
model 


N= ((T, p3, 8 eee), 


The underlying space Ẹ is comprised of all functions from Z x Z into {0, v} 
representing distributions of occupied and vacant points, that is, £ = {0, v}2*7. 
The function g maps every 7 € {0,v}”* to the function g(7) € {0, v}2*% 
representing the next distribution of occupied and vacant points. In other words, 
the underlying space can be regarded as the set of all models (G, o0”, v™") with 
the function g given by the transition relation (rule) —. Finally, define a metric d 
on {o, v}2* so that g becomes a continuous function for the induced topology 
Iq as follows. Set, for 71,2 € {0, v}2*2, 


1 
d(m,m2) = 7 
iff nı and 72 agree on all points within the k x k square 


Ip = {(n,m) € Z x Z| max{n,m} < k} 
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but disagree on at least one point in J,41,. One can show that the metric d 
defines a compact topological space on {0, v}“*” with respect to which g is 
continuous. 

Notice that in this dynamical model predicates are not subsets of the board 
Z x Z but of {0,v}2*%. We can take, for instance, some interesting set pat 
of initial states (i.e., models (G, 0”, v™)), say, those with precisely N living 
points, and check whether all of them (or at least one of them) will eventually 
‘die out,’ that is, reach the singleton set p? = {(G,0™, v™")} where o™” is 
empty. 

The resulting dynamical model N = (Cs, Dae eels g) can be ‘unrav- 
elled’ into the transition system sọ — sı > ... where 


(sn) = (5, 97" (pd), g”), ..-)- 


8.1 Dynamic topological logics 


We start our discussion of languages for reasoning about dynamical systems 
by considering dynamical models based on various topological spaces. 
A dynamic topological model (DTM, for short) is a pair 


A= (M, g), 


where IN = (F, po, Py, ...) is a topological model based on a topological 
space T = (U, I) and g is a function on T. The minimum requirement imposed 
on g in dynamical systems is its continuity. We remind the reader that a function 
g on & is called continuous if g~'(X) is open whenever X C U is open. If 
g(X) is open whenever X is open, then g is called open. Another important 
type of functions is homeomorphisms, that is, bijective continuous and open 
functions on &. (It is also usually assumed that the underlying topological 
spaces are compact. We will not make this assumption in general, but point out 
when our results hold for compact topological spaces.) 

The language we consider for representing and reasoning about dynamic 
topological systems is slightly different from most of the languages for snap- 
shot models because, as we have already seen, in dynamical systems we are 
more interested in following the orbit of an object in space and time rather 
than in comparing the relative positions of regions in space. That is why the 
language DT £ for reasoning about topological systems is ‘local’ in the sense 
that we see the space from the windows of our moving ‘car’ as opposed to 
the ‘global’ language of spatio-temporal logics from Sec. 6.1 where we could 
observe all moving ‘cars’ and their relative positions. Formally, this means that 
we represent knowledge about the evolution of objects by means of terms and 
do not consider formulas constructed from them. 

The set of DT £-terms T is defined as follows: 














T See op | T | anr | Ir | Or | Opr | Ope 


Spatial Logic + Temporal Logic = ? 551 


It is worth noting that the addition of the operator U for ‘until’ to the set of 
constructors for terms would not affect any of the results presented below. We 
have omitted ‘until’ to keep the language as simple as possible. 

In a dynamic topological model A = (M, g), terms 7 are interpreted as sets 
T% C U, where IN is based on the topological space (U, T). Clearly, p% = pr 
for every spatial variable p;. The Boolean operators and the operator I are 
interpreted as before. The interpretation of the temporal operators on terms 
should become clear from the following consideration: for a point w € U and 
a term 7, we have 


(9.28) we(Or)™ iff g(w)er™ iff weg”). 


Roughly, a time point n in a snapshot model corresponds to n applications of 
the function g. If we understand w € (© p7T)™ as ‘eventually w will be moved 
by g to 7™® and w € (Opr)* as ‘g will always keep w in 7™,’ then 














(9.29) we(Orr)™ iff Orby(w) NT” 40 iff wel Jg”), 
i>0 
(9.30) w€(Opr)* iff Orb,(w) C r% iff we (ior te) 
i>0 














For example, w € (pı mo pp)” means that w is in pe and reaches pa bya 
finite number of iterations of g. 

In this section, we are interested in the satisfiability and validity problem for 
DT £-terms in some important classes of dynamic topological models: 


= ADT £-term 7 is satisfiable in a class M of DTMs iff there exists 21 € M 
such that r% Æ Q. 


a ADT L-term 7 is valid in a class M of DTMs iff 7 is not satisfiable in 
M—i.e., iff r% coincides with the whole space for every Y € M. 


DTMs with homeomorphisms. We first connect satisfiability in certain 
dynamic topological models with satisfiability in snapshot topological temporal 
models. The discussion of the two examples above indicates already how 
one can go back and forth between snapshot topological models and dynamic 
topological models. More precisely, one can show the following: 


THEOREM 9.38 Let M be any of the following classes of dynamic topological 
models: 


a DTMs based on Aleksandrov spaces with homeomorphisms; 
a DTMs based on topological spaces with homeomorphisms; 


= DTMs based on R” with homeomorphisms, for n > 1; 
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= DTMs based on the n-dimensional unit ball with a measure preserving 
homeomorphism, for n > 1. 


Then a DT L-term T is satisfiable in M iff the formula =(r = L) is satisfiable 
in a snapshot tt-model based on a topological space underlying some model 
from the class M. 


It should not come as a surprise now that reasoning with DT £-terms about 
these classes of DTMs can be extremely complex. The following result was 
proved in Konev et al., 2006b by reduction of Post’s correspondence problem: 


THEOREM 9.39 Let M be any of the classes of DTMs mentioned in Theo- 
rem 9.38. Then the set of DT £-terms that are valid in models from M is not 
recursively enumerable. 


It is worth noting that the four sets of terms that are valid in the classes 
of models mentioned in Theorem 9.38 are all different. As was shown by 
Slavnov (2003), the term 

IOp(p N CIP) 





is not satisfiable in any DTM based on (R”, g), while it is clearly satisfiable in 
some DTM. According to Kremer and Mints (2005), the term 


Ip — Corlp, 


where 71 > T2 = 71 U 79, is valid in all unit balls, but refuted in a DTM based 
on an Aleksandrov space and a DTM based on R” with the homeomorphism 
(£1,..., £n) > (@1,---,Ln-1, Ln + 1). Finally, the DT £-term 


























rip — IOrp 


is valid in DTMs based on Aleksandrov spaces, but refuted in the classes of 
DTMs based on Euclidean spaces and unit balls. 


DTMs with continuous functions. Theorem 9.38 shows that DTMs with 
homeomorphisms behave similarly to topological snapshot models. This situ- 
ation changes drastically for DTMs with continuous functions (which are not 
necessarily open). In this case, no corresponding snapshot tt-models have been 
developed. To clarify—at least to some extent—the relation between the two 
kinds of models, let us consider DTMs based on Aleksandrov spaces. 
Suppose that an Aleksandrov topological space Tg = (W, Is) is induced 
by the quasi-order 6 = (W, R) (see Sec. 3.1). Then it is easy to check that a 
function g: W — W isa continuous function on Ye iff for all u,v € W, 


uRv implies g(u)Rg(v). 
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(A bijection f is a homeomorphism on Te iff both the above implication and 
its converse hold.) 

This observation suggests that DTMs based on Aleksandrov spaces with 
continuous functions correspond to what may be called Aleksandrov snapshot 
models with expanding domains. Indeed, suppose that 2 = (M, g) is a DTM 
where M = (Te, po, pt, ...), 6 = (W, R) is as above and g is a continuous 
and surjective map on Ze. Consider the sequence of models 


(9.31) M=M, Wi = (Tens) Wh = (Tez pg tes) es 
where 

€ Gn = (W, Rn), 

m uR,v iff g” (u)Rg” (v) for any u,v € W, 

z uE pe iff g? (u) € p”. 


The temporal and topological operators on this sequence of models can be 
interpreted in exactly the same way as in Sec. 6.1. In particular, 


= u € (Cr)™ iff there is v € W such that uR,v and v € 7%, 
m u E (Opr)™ iff there is m > n such that u € 7%, 


We then obtain that, for every DT L-term 7, every w € W and every n > 0, 


A 


g’(w)er™ iff wer™, 


and so 7 is satisfiable in X iff 7 is satisfiable in (9.31). 

The difference between (9.31) and the snapshot models we have considered 
before is that the spaces Te, or, which is the same, the quasi-orders 6,, = 
(W, Rn) do not necessarily coincide. More precisely, using the fact that g is 
continuous it is easy to see that Rn C Ry+1 for every n > 0; see Fig. 9.10. 
That is why we call these models snapshot models with expanding domains. 
Fig. 9.10 also shows that the term 


OCT — COT 


is not valid in all DTMs with continuous functions, while it is clearly valid in 
all DTMs with homeomorphisms. For more details on the connection between 
such models and DTMs based on Aleksandrov spaces with continuous functions 
see Gabelaia et al., 2006. 

It is known (e.g., Gabbay et al., 2003) that satisfiability in models with 
expanding domains can be reduced to satisfiability in models with constant do- 
mains, but not the other way round as we shall see a bit later. So in principle one 
could expect that the dynamic topological logics interpreted in DTMs based on 
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space 
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> 


a n + 1 n 4 2 time 





Figure 9.10. Model with expanding domains. 


arbitrary, Aleksandrov or Euclidean topological spaces with continuous func- 
tions behave ‘better’ than their counterparts with homeomorphisms. Indeed, 
a fine-grained complexity analysis reveals interesting differences between the 
logic of homeomorphisms and the logic of continuous functions. We begin 
with the following ‘negative’ theorem proved in Konev et al., 2005 and Konev 
et al., 2006a: 


THEOREM 9.40 Let M be any of the following classes of dynamic topological 
models: 


= DTMs based on Aleksandrov spaces with continuous functions; 
a DTMs based on topological spaces with continuous functions; 
a DTMs based on R” with continuous functions, for n > 1. 

Then the satisfiability problem for DT £-terms in M is undecidable. 


Note that, in contrast to DTMs with homeomorphisms, it is still not clear 
whether any of these logics is recursively enumerable or even finitely axiomatis- 
able. However, the first exciting difference between the algorithmic behaviour 
of the two models can be observed by considering the fragment of DT £ in 
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which the topological operators are not applied to formulas containing the 
‘infinitary’ temporal operators Op and Op. This language is still very ex- 
pressive and the undecidability/non-axiomatisability results of Theorems 9.39 
and 9.40 still hold for it. However, the set of formulas from this fragment that 
are valid in DTMs based on Aleksandrov spaces or arbitrary topological spaces 
with continuous functions is recursively enumerable. This is proved in Konev 
et al., 2006a by an application of Kruskal’s tree theorem. 

The proof of Theorem 9.40 proceeds by a rather involved reduction of the 
w-reachability problem for lossy channel systems (Schnoebelen, 2002). It es- 
sentially uses the fact that the number of function iterations is infinite. This 
observation opens a second possibility for a fine-grained complexity analysis: 
what happens if we consider DTMs where only finitely (but unboundedly) many 
function iterations are allowed. In this case the interpretation of DT £-terms 
containing temporal operations depends of course on the iteration step of g. 

More precisely, let A = (M, g) be a DTM based on a topological space 
(U,1), N > 0 is the allowed number of iterations of g, and n < N. Given a 
DT L-term T, we define 77", the extension of T after n steps in the DTM A 
with N iterations, inductively as follows: 





An, N M 
m Pi = Pp; ? 
m mm) = Na TRN, 


Or) "rN = f forn = N, and (Or)*% = g71(r%m+tLN) otherwise, 


m (Op TaN — Ù gr iad mois NS: 
m=n+1 


Say that 7 is satisfiable in DTMs from a class M with finite iterations, or fi- 
satisfiable in M, for short, if there exist a DTM 2 in M and N > 0 such that 
ON # 0. 

It is not hard to see that the reduction of Post’s correspondence problem from 
the proof of Theorem 9.39 can be also used to prove the following: 


THEOREM 9.41 Let M be any of the classes of DTMs mentioned in Theo- 
rem 9.38. Then fi-satisfiability of DT £L-terms in M is undecidable. 


On the contrary, if we consider the class of DTMs based on arbitrary topolog- 
ical spaces with continuous functions then one can first reduce fi-satisfiability 
in this class to fi-satisfiability in DTMs based on finite Aleksandrov spaces 
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with continuous functions, and then use Kruskal’s tree theorem to prove the 
following: 


THEOREM 9.42 Let M be one of the following classes: 
= DTMs based on Aleksandrov spaces with continuous functions, 
a DTMs based on topological spaces with continuous functions. 


Then fi-satisfiability of DT £L-terms in M is decidable, but not in primitive 
recursive time. 


The non-primitive recursive lower bound is proved by reduction of the reach- 
ability problem for lossy channel systems. All details can be found in Gabelaia 
et al., 2006. 


8.2 Dynamic metric logics 


In this section we consider our fourth formal model—dynamic metric sys- 
tems. Similarly to dynamic topological models from Sec. 8.1, a dynamic metric 
model (DMM, for short) is a pair of the form 


A = (M, g), 


where Mt = (D, pæ, prr, ...) is a metric model based on a metric space D = 
(A,d) and g is a function on D. We will only consider isometric functions, 
i.e., bijections on A such that d(x, y) = d(g(x), g(y)), for all x,y € A. For 
instance, the translation x +» x + 1 and reflection x +» —x maps on R, the 
rotations ga of the two-dimensional unit ball B? = { (x1, £2) € R? | z? +23 < 
1} by the angle a around (0, 0) are isometric automorphisms on the respective 
spaces. 

We only consider the simplest language DML* of dynamic metric logic, 
which is defined in the same way as DT £ with the exception that the topological 
operators are now replaced by the metric operators 1% and V<“, for a € Q2°. 
Formally, DM £*-terms are 





T = p | F | ann | Ir | Or | Orr | Orr. 

















Again, we omit the ‘until’ operator to keep our language simple, although all 
the results can be extended to the language including ‘until.’ 

In a dynamic metric model 2% = (M, g), terms T are interpreted as sets 
rT% C A, where IM is based on the metric space D = (A,d). For spatial 
variables we have p% = pri; the Boolean operators are interpreted as usual, 
the metric operators JS7 and V<r as in Sec. 3.3, and the temporal operators 


as in Sec. 8.1. 
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The notions of satisfiability and validity of DML*-terms are defined in 
the standard way. The next theorem connects satisfiability in DMMs with 
satisfiability in snapshot models from Sec. 7: 


THEOREM 9.43 A DML8*-term T is satisfiable in a DMM with an isometric 
function iff the formula (rt = L) is satisfiable in a metric snapshot model 
based on the same metric space. 


As it happened with metric temporal logics in Sec. 7, dynamic metric logics 
are slightly simpler then their topological counterparts: 


THEOREM 9.44 The set of DMLS-terms that are valid in DMMs with iso- 
metric functions is decidable. However, the decision problem is not elementary. 


This theorem should not come as a surprise: its claim and the proof are essen- 
tially the same as those of Theorem 9.34 (all details can be found in Konev 
et al., 2006b). 


9. Related ‘temporalised’ formalisms 


The logics we have considered in this chapter can be regarded as temporali- 
sations of static spatial logics. As many other ‘static’ logics have also been 
extended by a temporal dimension, for example, first-order temporal logic 
(Gabbay et al., 1994), temporal epistemic logic (Fagin et al., 1995), tempo- 
ral description logic (Gabbay et al., 2003), it makes sense to briefly discuss 
similarities and differences between these temporalisations. 

The most generic approach to the temporalisation of a static logic is of course 
first-order temporal logic. In this logic, temporal operators may occur any- 
where in first-order formulas (in particular, in the scope of quantifiers), and 
the intended models are flows of time where each time point is represented 
by a relational structure interpreting the first-order part of the language. It is 
known since the 1960s that the resulting logics are extremely complex, mostly 
»}-complete (see, e.g., Gabbay et al., 1994, Gabbay et al., 2003 and references 
therein). For example, the two-variable fragment, the monadic fragment, and 
the guarded fragment of first-order temporal logic over the natural numbers and 
with constant or expanding domains is ©}-complete. 

Only recently the so-called monodic fragments of first-order temporal logics 
(in which temporal operators are only applied to formulas with at most one 
free variable) have been identified as expressive yet often decidable (or at least 
recursively enumerable) fragments (Hodkinson et al., 2000; Hodkinson et al., 
2001; Gabbay et al., 2003). The positive results about the monodic fragments 
rely, however, on the fact that they are not able to express that a binary relation 
does not change over time. In other words, in the monodic fragments one can 
reason about the change (or non-change) of unary predicates but not about the 
change (or non-change) of binary relations. This feature of monodic fragments 
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is in sharp contrast with the logics we encounter in the context of spatio-temporal 
representation and reasoning: as we have seen, in this case we usually expect 
the underlying space (e.g., a metric or topological space) not to change in time. 
What changes is the extension of unary predicates. That is to say, we almost 
always have at least one constant binary relation (or higher-order operator): 
in metric spaces the relation R(x, y) defined by d(x,y) < a, in Aleksandrov 
spaces the relation R inducing the topological space, in arbitrary topological 
spaces even the higher-order interior operator, etc. For this reason, the results on 
the decidability of monodic fragments do not apply to spatio-temporal logics. In 
fact, we have seen that the straightforward combination of spatial and temporal 
formalisms almost always leads to highly undecidable logics. In the more 
abstract setting of products of modal logics this phenomenon has been recently 
investigated by Gabbay et al. (2003) and Gabelaia et al. (2005b, 2006). 

The main message to be deduced from the results on combinations of spatial 
and temporal formalisms is that a fine-tuned analysis of both the spatial logic 
and the interaction between spatial and temporal operators is required in order 
to obtain expressive and still decidable formalisms. There appears to be no 
general way of translating positive results from other temporalisations to spatio- 
temporal logics. Actually, this is also the case for temporal epistemic logic and 
temporal description logic. Again, most of the ‘positive’ results in those areas 
depend on the assumption that one cannot reason about the change (and non- 
change) of binary relations. With one exception, the results in those areas 
are therefore much closer to the results on monodic fragments of first-order 
temporal logics than to the results on spatio-temporal logics. 

The only exception from this rule we know is the decidability (in non- 
elementary time) of the satisfiability problem for terms of the metric temporal 
and dynamic logics. Although this result cannot be obtained as an instance 
of a known result from other temporalised formalisms, its proof nevertheless 
closely resembles the proofs of the following results: 


= The decidability (in non-elementary time) of the temporal epistemic logic 
with multi-modal S5 interpreted in synchronous systems with perfect 
recall and no learning (Halpern and Vardi, 1989). 


= In temporal description logic, the decidability (in non-elementary time) 
of the satisfiability problem for temporalised ALC where roles (binary 
predicates) do not change over time (Wolter and Zakharyaschev, 1999; 
Gabbay et al., 2003). 


In all these cases, we deal with models where certain relations do not change over 
time (in the epistemic case these are the equivalence relations interpreting the 
epistemic operators, in the description logic case these are the roles interpreting 
the value restrictions). The crucial property underlying the decidability proofs 
is that those constant relations can be assumed to form tree-like structures and 
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that the satisfaction relation is ‘local’ in the sense that the interpretation of 
terms (propositional variables/concepts) in a certain distance from the root of 
the tree-like structure does not influence the satisfaction relation in the root. 

Notice, however, that in each case one has to consider carefully the con- 
straints on the relations. As we know from Theorem 9.21, a decidability proof 
does not go through for transitive relations (from Aleksandrov spaces) which 
do not change over time. 
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1. Introduction 


Dynamic Topological Logic provides a context for studying the confluence 
of three research areas: the topological semantics for S4, topological dynamics, 
and temporal logic. In the topological semantics, a model is a topological space 
X together with a valuation function V assigning to each propositional variable 
a subset of X. Conjunction is interpreted as intersection, disjunction as union, 
and negation as complementation. If we interpret the necessity connective, O, 
as topological interior, the resulting modal logic is S4. It has axioms 







































































(A> B) > (OAD OB), ADA, ADOOA 











with modus ponens and necessitation, A/OA, as inference rules. Thus S4 is a 
general logic of topological spaces. Fora general and comprehensive discussion 
see for example Rasiowa and Sikorski, 1963. 
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Topological dynamics studies the asymptotic properties of continuous maps 
on topological spaces (Walters, 1982, p. 118). Letadynamic topological system 
be an ordered pair (X, f} where X is a topological space and f is a continuous 
function on X. We can think of the function f as moving the points in X in 
each discrete unit of time: x gets moved to fx and then to f fa and so on. In 
Dynamic Topological Logic, the system S4 is extended to a logic of dynamic 
topological systems, by adding temporal modalities suited to formalizing the 
action of f on X. In particular, we want to formalize both the transition from 
one discrete moment to next, as f acts, moment by moment, on the points in 
X; and the asymptotic behaviour of the function f. 

We turn to w—time temporal logic with two future-looking modalities: next, 
O; and henceforth, x. Suppose that we ignore topological issues and represent 
discrete moments as natural numbers. Let an interpretation be an assignment 
of a truth value to each propositional variable at each moment. The Boolean 
connectives are given their standard interpretations. As for the modalities, the 
formula OA is true at the moment m iff A is true at the next moment m + 1; 
and the formula «A is true at the moment m iff A is true at the moment n, for 
each n > m. Note that «A is thus equivalent to the infinite conjunction 


AK OAK QALAR... 


The Linear Time Temporal Logic LTL of © and x (cf. van Benthem, 1995 for 
an introduction and history) can be axiomatized by the classical tautologies; S4 
axioms for *; the following axioms, 


O(AV B) = (OAV OB); OrA= 704; 


O*A = *OA; xA D OA; 
The induction axiom: (A & «(A D OA)) D *A; 


and the rules of Modus Ponens, and necessitation for x (cf. Goldblatt, 1992 for 
example). 

In this chapter, we combine the topological modality and the two temporal 
modalities, to define trimodal logics of dynamic topological systems or dynam- 
ical topological logics (abbreviated DTLs). Let a dynamic topological model 
be an ordered triple (X, f, V), where (X, f) is a dynamic topological system 
and V is a valuation function assigning to each propositional variable a subset 
of X. If we think of the subsets of X as the propositions, then, as in the static 
topological semantics, OP = Int(P), for propositions P, where Int is the 
topological interior operation. We interpret the temporal modalities CQ and x 
using the function f. Suppose that, at moment m, the proposition P is true at 
the point fx, i.e. fa € P. Then after f has acted on x once, P will be true at 
x. In other words, at the next moment m + 1, the proposition P is true at the 
point x. So at moment m, the proposition OP is true at x, that is x € OP iff 
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fxe Piffx e f—'(P). Thus the next modality is interpreted by the preimage 
under f: 


OP = f~! (P). 


The interpretation of x as an infinite conjunction leads to the definition: 
xP = Nn>0 fo te) 


Our goal is to investigate interesting classes of dynamical systems in this 
propositional framework. The restriction to a propositional framework is made 
for three reasons. First, this is the most natural way to extend the extensive work 
done on propositional modal logics: work on their axiomatizability, expressive 
power, and so on. Second, propositional systems are much more manageable 
than extensions admitting quantifiers. Third, it is interesting to investigate the 
expressive power of a purely propositional language: for example the Poincaré 
Recurrence Theorem can be stated in this language—cf. Sec. 3. In fact, this 
is the principal example that started our investigation: finding a manageable 
DTL of measure-preserving transformations is one of the principal unsolved 
problems in dynamical topological logic. We try to avoid, as far as possi- 
ble, generalizations so wide that they are unlikely, given our present state of 
knowledge, to lead to results useful in mainstream mathematics. 

As shown in (Konev et al., 2006a), the presence of x, the henceforth con- 
nective, often leads to incompleteness for a straightforward semantics. In par- 
ticular, a significant range of DTLs are not axiomatizable: the DTL of home- 
omorphisms, the DTL of homeomorphisms on R” (for any fixed n > 1), the 
DTL of homeomorphisms on Alexandrov spaces (see below), and the DTL of 
measure-preserving homeomorphisms on the unit ball of dimension n, where 
n > 2. Konev et al. (2006a) leave open the axiomatizability problem of DTLs 
that are based on continuous functions in general, rather than homeomorphisms. 

Given the results in Konev et al., 2006a, most of the positive results for 
DTL are for a fragment of the trimodal language. Two extreme cases are 
-free purely temporal formulas, and purely modal formulas without temporal 
connectives. In Sec. 4, we show that most DTLs are conservative over each of 
these fragments. 

We also have a number of completeness results for the QU fragments of the 
language. For these results, we define two axiomatic systems: the logic of con- 
tinuous functions S4C, introduced in Artemov et al., 1997, and the logic of 
homeomorphisms S4Q), introduced in Kremer and Mints, 1997. S4C is de- 
fined by the following axioms and rules: 


























Axioms the classical tautologies, 
S4 axioms for O, 
O(AV B) = (OAV OB), 
On7nA = AOA, 
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the continuity axiom: QUA > OQA 


Rules Modus Ponens: A, (A > B)/B 
Necessitation for O: A/OA 
Necessitation for O: A/OA. 

















S4Ọis defined by the axioms and rules of S4C, plus the following: 




















the converse of the continuity axiom: OOA > OOA 








We will use S4C and S40 both for these axiomatizations and for the sets 
of all formulas derivable from the axioms by the inference rules. 

Our main completeness results for the QU fragment of the language are as 
follows: 














Sec. 6: S4CO is sound and complete for 
= homeomorphisms on topological spaces 
= homeomorphisms on Alexandrov spaces 
= homeomorphisms on finite topological spaces 
= homeomorphisms on Cantor Space (This result is noted in Sec. 7.) 
= homeomorphisms on R 
= homeomorphisms on the open [closed] unit interval 
Sec. 7: S4C is sound and complete for 
= continuous functions on topological spaces 


= continuous functions on Alexandrov spaces, i.e. monotonic functions on 
Kripke frames (see Definitions 10.2 and 10.10) 


a continuous functions on finite topological spaces 
™ continuous functions on Cantor Space 


Most of our completeness results for mathematically interesting spaces are 
based on the topological completeness of S4 in the real line R—a classic result 
of McKinsey and Tarski, 1944. In Sec. 5, we give a simplified proof of this 
classic result, based on a specially simple proof for rational numbers in the 
interval (0, 1); this is followed, for real numbers in (0, 1), by a passage to limit. 
Our main use of this result in is Sec. 6: our proof that S4CQ is complete for 
homeomorphic transformations on R proceeds first by giving a simple reduction 
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of an arbitrary formula (not containing *) to a formula that is almost ()-free; 
this reduction allows us to appeal to the completeness of S4 in R. 

This result does not extend to the logic S4C (Sec. 7): S4C is not complete 
for arbitrary continuous function on R, as shown by counterexamples due to 
P. Kremer (2004) and S. Slavnov (2003). However we are able to prove com- 
pleteness for Cantor Space (Sec. 7.5), an important subspace of R. There are 
also two positive results in the vicinity: Slavnov, 2005 contains a proof that S4C 
is complete for arbitrary continuous functions on any R”, n > 1. To be more 
precise, S4C is complete for the follow set of dynamic topological systems: 
{(R", f) : n > Land f is a continuous function on R” } and Fernandez, 2006 
contains a proof that S4C is complete for arbitrary continuous functions on R?. 
Whether the logic of continuous functions on R is axiomatizable remains open. 

We also note the completeness of a special axiomatization of a temporal 
over topological fragment where the temporal modalities cannot occur in the 
scope of a topological modality. See Kremer and Mints, 1997 and more recent 
results on recursive axiomatizability of the topological over temporal fragment 
in Konev et al., 2006b. Ch. 9 in this Handbook contains a great deal of material 
on the interaction between spatial and temporal logic. 

The current chapter is part of a research program whose first results were 
announced in three conference abstracts (Kremer, 1997, Kremer and Mints, 
1997, and Kremer et al., 1997). An independent and closely related research 
program saw its first results published in Artemov et al., 1997, and has been 
further pursued in Davoren, 1998. 


2. Basic definitions 


We work with a trimodal language L witha set PV of propositional variables; 
Boolean connectives V and ~; and three one-place modalities O (interior), O 
(next) and x (henceforth). We assume that &, D and =, are defined in terms of V 
and —. We use p, q, r as metavariables over PV and A, B, C as metavariables 
over formulas. The language L- is the fragment of L whose only modality 
is O; and the language LOF is the fragment of L whose only modalities are 
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DEFINITION 10.1 A topological model is an ordered pair, M = (X,V), 
where X is a topological space and V : PV — P(X) is a valuation function 
assigning a subset of X to each propositional variable. The valuation V is 
extended to all formulas of L- as follows: 














V(AVB) = V(A)UV(B), 
V(AB) = X—-—V(B),and 
V(OB) = Int(V(B)), 
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where the interior of a subset Y C X is denoted by Int(Y). A formula A is 
validated by M = (X, V) iff V(A) = X. Notation: M } A. 


DEFINITION 10.2 A Kripke frame is an order pair (W, R) where W is anon- 
empty set (of worlds) and R is a reflexive and transitive relation on W. World 
w’ is a successor of world w iff wRw'. w is R-equivalent to w’ (in symbols, 
w =r w) iff wRw and w' Rw. 


DEFINITION 10.3 Given a Kripke frame (W, R), a subset S of W is open iff 
S is closed under R: for every x,y E€ W, if x € S and xRy then y € S. The 
family of open sets forms a topology. Thus, for every Kripke frame (W, R), a 
topological space is defined by imposing that topology on the set W. We define 
the interior of X C W: 


Int(X) =at {w E W : Yw € W,if wRw' then w € X}. 


Note that, in these spaces, the intersection of arbitrary open sets is open: thus 
they are Alexandrov spaces (cf. Alexandrov, 1937) as defined in Artemov et al., 
1997. In fact, as noted in Artemov et al., 1997 every Alexandrov space is 
generated in a natural way by a Kripke frame, so we can identify Kripke frames 
and Alexandrov spaces. 

The next result (McKinsey and Tarski, 1944,Kripke, 1963) provides com- 
pleteness for purely modal formulas. 


THEOREM 10.4 (MCKINSEY-TARSKI-KRIPKE) Suppose that X is a dense- 
in-itself metric space and A is a formula that does not contain the temporal 
connectives () and x. Then the following are equivalent: 


(i) A E S4. 
(ii) E A. 
(iii) X E A. 
(iv) RE A. 
(v) Y E A for every finite topological space Y. 





(vi) Y E A for every Alexandrov space Y. 


Proof The equivalence of (i)-(v) is due to McKinsey and Tarski, 1944. For 
the completeness of S4 in the real line, see a streamlined proof in Sec. 5. The 
equivalence of (i) and (vi) is due, in effect, to Kripke, 1963. QED 


Thus not only does the topological interpretation give a semantics for S4, but 
S4 is the topological logic of a host of particular topological spaces, for exam- 
ple the real line, R; the closed unit interval, [0, 1]; and any other dense-in-itself 
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metric space. So the purely modal language is expressively weak, unable, for 
example, to distinguish between R and [0, 1] despite their topological dissimi- 
larities. Part of the DTL project is to see whether analogues of Theorem 10.4 
can be proved or disproved: see Sec. 6 and Sec. 7 below. 


DEFINITION 10.5 A dynamic topological system (DTS, cf. Brown, 1976, 
Furstenberg, 1981) is an ordered pair, (X, f), where X is a topological space 
and f is a continuous function on X. A dynamic topological model (DTM) is 
an ordered triple M = (X, f, VV) where (X, f) isa DTS andV : PV + P(X) 
is a valuation function assigning a subset of X to each propositional variable. 
The valuation V is extended to all formulas by the clauses in Definition 10.1 
plus the following: 


V(OB) = f-'(V(B)),and 
V(*B) = Maso f-"(V(B)). 


A formula A is validated by M = (X, f, Vy iffV(A) = X. Notation: M | A. 


DEFINITION 10.6 Suppose that (X, f) isa DTS. Validity relations are defined 
in a standard way. 


(X,f) EB iff M EB for every model M = (X, f, V}. 
XEB iff (X,f) H B for every continuous function f. 
Bis valid (= B) iff X |= B for every topological space X. 





Our goal is to axiomatize interesting classes of topological spaces and contin- 
uous functions on these spaces. 


DEFINITION 10.7 Suppose that F is a class of functions so that each f € F 
is a continuous function on some topological space. Suppose that T is a class 
of topological spaces. 


T,FEB iff for every f € F and every X € T, if f is a continuous 
function on X then (X, f) = B. 

FEB iff for every topological space X and every f € F,if f isa 
continuous function on X then (X, f) = B. 

THB iff X E B for every topological space X E€ T. 





We assume that that in specifying a particular continuous function, we specify 
both the function itself as a set of ordered pairs, and the topological space on 
which we are taking it to act. 

We are now ready to define various Dynamic Topological Logics, or DTLs. 
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DEFINITION 10.8 For any class T of topological spaces and any class F of 
continuous functions, we define 


DTLrF = {A:T,F } A}. 


DTL7 {A:T E A}. 
DTLF = {A: F A}. 





Most of our completeness results for interesting spaces and classes of spaces 
are based on completeness for finite spaces of a special kind. 


DEFINITION 10.9 A dynamic Kripke frame is an ordered triple K = (W, R, f) 
where (W, R) is a Kripke frame, and f is a function on W that is R-monotonic 
with respect to R, i.e. 

wRw > fwRfw'. 


The world r € W is a root world of (W, R, f) iff both fr = r and Y w € 
W, rRw. (W, R, f) is rooted iff there is some root world r € W. 


Our dynamic Kripke frames are the continuous Kripke frames of (Artemov 
et al., 1997). They give rise to dynamic Alexandrov systems in an obvious way 
(cf Definition 10.3). 


DEFINITION 10.10 Adynamic Kripke model is a quartuple M=(W, R, f, V}, 
where (W, R, f) is a dynamic Kripke frame and V is a valuation function as- 
signing a subset of W to each propositional variable. The valuation V is 
extended to all formulas as follows: 





V(AVB) = V(A)UV(B) 
V(A& B) = V(A)NV(B) 
V(nA) = X-V(A) 
V(DA) = Int(V(A)) 
V(OA) = f-(V(A)) 
VA) = osn OP V(A) 


(W, R, f, V) is rooted iff (W, R, f) is rooted. A formula A is validated by 
M = (W, R, f, V) if V(A) = W. Notation: M } A. 


This is equivalent to the familiar definition for Kripke models: 








w € V (OA) w' = A for all w’ such that wRw’ 


weV(OA) = f(w)€V(A) 


Given a particular DTL, we will also be interested in its fragments. 
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DEFINITION 10.11 If Dis a dynamic topological logic, then the purely topo- 
logical fragment of D is the (Q), *)-free fragment, that is, the set of formulas 
without (O, *) belonging to D. The purely temporal fragment of D is the 
-free fragment. Thenext-interior fragment of D is the «-free fragment. We 
denote these logics as D® and DO* and DO", respectively. 






































We are especially interested in three DTLs: the DTL of all dynamic topo- 
logical systems, DTLo; the DTL of homeomorphisms, DTL; and the DTL 
of measure-preserving functions on the closed unit circle. Less interesting 
mathematically DTL, of Alexandrov spaces is important as a starting point 
for investigation of other classes of DTL, since Alexandrov spaces are, in ef- 
fect, Kripke frames. DTLy is nonaxiomatizable (Konev et al., 2006a), and the 
question of the axiomatizability of the other systems is still open. 

Though the axiomatizability of DTL,4 remains an open question, it is easy 
to see that DT Lp © DTL4. For we have (10.1) and (10.2), below: 








(10.1) («Op > Oxp) ¢ DTLo 
































(10.2) (xOp D Oxp) € DTLy. 


(10.2) follows from the fact that, in an Alexandrov space, the intersection of 
arbitrary open sets is open. To see (10.1), let M = (R, f, V} where f(x) = 2a 
and V(p) = (—1,1). Note that V(Op) = (—1,1) so that f-"(V(Op)) = 
(—1/2", 1/2"). Thus V(*Op) = {0}. Similarly, V (xp) = {0}. So V (Qxp) = 
Ø. SoM jA (xOp D Oxp). 
























































3. Recurrence and the DTL of measure-preserving 
continuous functions on the closed unit interval 


A central motivation for this study is the phenomenon of recurrence in mea- 
sure theory and topological dynamics, and the possibility of expressing this 
phenomenon in the framework of propositional logic. In fact, we can express 
recurrence in our trimodal language. Let us restrict attention to the segment 
[0, 1] of reals which is already difficult enough (cf. Konev et al., 2006a). 

Suppose that f is a function on a set X. Say that a point x € S is recurrent 
(for S) if f” (x) € S for some n > 1. Let u be the Lebesgue measure defined 
on subsets of the closed unit interval, [0, 1]. We say that S is measurable iff, for 
every set S’, we have u( S") = u(S N S") + w(S’ — S). We say that a function 
f on (0, 1] is measure-preserving iff (f~'(S)) = u( S) for every measurable 
S C [0,1]. Consider the following (non-essential) extension of the Poincaré 
Recurrence Theorem on [0,1] (see Walters, 1982): 


THEOREM 10.12 If f is a measure-preserving continuous function on [0,1] 
then the set of recurrent points of a non-empty open set S C [0,1] is dense in S. 
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In order to express recurrence in our trimodal language, define the possibility 
connective } as =U, and the possibility connective # as =~. These represent 
topological closure and “some time in the future”, respectively. Consider the 
formula 


(10.3) (Op > OO#Up). 
Let (X, f) be any dynamic topological system. Note that (X, f) = (10.3) iff, 
(10.4) VopenO C X :O C Cl{z : there is ann > 1such that f"x € O}. 


By Theorem 10.12, (10.4) is true when X = [0,1] and f is any measure- 
preserving continuous function on [0, 1]. Thus, by Theorem 10.12, ([0, 1], f} H 
(10.3) when f is any measure-preserving continuous function on [0, 1]. So, in 
some sense, (10.3) expresses the phenomenon of recurrence. 

Thus the class M of measure-preserving functions on [0, 1] is of interest. As 
we have just shown the formula (10.3) is in DT Lm. The results of (Konev 
et al., 2006a) imply that the DTL of measure-preserving homeomorphisms on 
[0, 1] is not axiomatizable: the axiomatizability of DT Lm remains open. One 
can try to approach these systems in the same way as first order arithmetic or 
second order logic: find a manageable axiomatizable fragment, strong enough 
to derive most mathematically interesting results that are statable in a given 
language. 






































3.1 A Simple Decidable DTL 


To illustrate the existence of an interesting tractable DTL we prove that the 
logic DT Lum of measure-preserving homeomorphisms of the real interval 
(0, 1] is very easily reducible to S4 and hence decidable. The key observation 
is the next proposition. 


LEMMA 10.13 The only measures preserving homeomorphisms on [0,1] are 
the identity f(x) = x and f(x) = 1 — zx. 


Proof Since f is ahomeomorphism, it takes all values in [0, 1] and is strictly 
monotonic, say increasing for definiteness. Then f(0) = 0, since otherwise 
0 is not a value of f. For every x € [0,1] u(f~![0,2]) = p([0,2]) = x 
and f~![0,z] = [0, f-(x)]. Hence f~!(x) = x. Applying f to both parts, 
fay = a QED 

For every formula ¢ of S4C define two reducts corresponding to f(x) = x 
and f(x) = 1 — x repectively. Note that under f(x) = x one has 

OA = *A eA 

while f(x) = 1 — x for all x implies f(f(x)) = a, hence 


(10.5) OOA <> A, and xA = A& OA 
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Define 
m ¢° := the result of erasing all occurrences of ©, * and 


= ¢! := the results of replacing © and * according to (10.5) including 
pushing © to atomic formulas and dropping all occurrences of OO (cf. 
the operation g in Sec. 6). 


The following theorem provides a decision algorithm for DT Lum. The no- 
tation S4 - ¢ for a formula containing O) means that Oy is treated as a new 
atomic formula (cf. Sec. 6). 


THEOREM 10.14 DTLym H diff S4 F o° & ¢! 





Proof The direction from S4 to DTLym is obvious. In the opposite direction 
we have only to prove that 


(10.6) S4 7 d implies ((0,1), f) Fé 


for f(x) = 1 — z and every formula ¢, where O is applied only to atoms—that 
is, where ¢ is constructed from propositional variables p and formulas Op by 
the Boolean connectives and O. We apply the same method as in the proof of 
the similar result for R and the logic of homeomorphisms (Theorem 10.46). 

Find an S4-countermodel M = ((4, 1), V} for ¢ on the interval (4,1). Let 
f(x) = 1 — zx, so that f~' = f. Define a new valuation 


V'(p) = V(p) U f-*(V(Op)) = Vip) U F(V(Op)) 
on the dynamic topological system ((0,1), f}. See Fig. 10.1. The same 














-1 
f VOp)) V(Op) Vp) 


( A L ZAA ) 


0 1/2 


Figure 10.1. V'(p) is shaded. 
computation as in the proof of Theorem 10.46 shows that 
V(B) = IN V'(B) for all formulas B and I = (5,1). 
The only thing to check anew is the implication 


x € I and z € V'(O”p) > x € V(O"p) for n = 0,1. 
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We consider both cases, n = 0 and n = 1. Suppose that n = 0, and suppose 
that x € I and x € V'(O”p). Then x € V'(p). So either x € V(p) or 
x € f(V(Op)). But IN f(V(Op)) = 0. Sox ¢ f(V(Op)). Thus x € V(p), 
as desired. 

On the other hand, suppose that n = 1, and suppose that x € I and x € 
V'(O”p). Then x € V'(Op). So fa € V'(p). So either fa € V(p) or 


fa e f(V(Op)). So either x € f—!(V(p)) = f(V(p)) or x € V(Op). But 
IN f(V(p)) = 0. So x Z f(V(p)). Thus x € V(Cp), as desired. QED 


4. Purely topological and purely temporal fragments of 
DTLs 


In work on DTL, we foresee that most of the action will be in the interaction 
between the topological modality (O) and the temporal modalities (O and *). 
As it turns out, temporal differences often do not affect purely topological issues 
(see Theorem 10.15). Furthermore, the purely topological fragments and the 
purely temporal fragments of DTLs normally coincide with previously studied 
logics (see Theorems 10.16 and 10.20), namely S4 and LTL. 





THEOREM 10.15 Suppose that T is a class of topological spaces and F is a 
class of continuous functions. Also suppose that for every X € T, there is a 
f € F with dom( f) = X. Then DTLy p =DTL7. Thus temporal differences 
do not affect purely topological issues. 



































Proof The inclusion > is obvious. For C take a modal formula a ¢ DTL7 
and a space X € T with a continuos function g on X such that (X, g) /F a. 
Now replace g with a continuous function f € F with the domain X. Since a 
is modal, (X, f) = a iff (X, g) Fa. QED 








THEOREM 10.16 Suppose that T is a class of topological spaces and that 
either 


(i) every topological space is in T, 
GD RET, 
(iii) some dense-in-itself metric space is in T, 
(iv) every finite topological space is in T, or 
(v) every Alexandrov space is in T. 


Then DTL7 = S4. 














Proof This follows from the Theorem 10.15 and the McKinsey-Tarski-Kripke 
Theorem (Theorem 10.4). QED 
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COROLLARY 10.17 



























































DTL; = DTL} = DTL m = DTL} = DTLg = 















































DTL 0,1] = DTL R,H. = DTLA H = DTL fin = S4, 








where fin is the class of finite topological spaces. 


DEFINITION 10.18 Suppose that f is a continuous function and that X = 
dom( f). For m,n € w, f has the m-n-property iff there is some x € X 
such that x, fx, ..., f+" are all distinct and f™t"*+!2 = fx. f has the 
w—property iff there is some x € X such that x, fx, f?x, ... are all distinct. 
Suppose that F is a class of continuous functions. F is rich iff either (i) F 
contains some function with the w—property or (ii) for each m,n E€ w, F 
contains some function with the m—n—property. 


REMARK 10.19 The following classes of functions are rich: 
(i) the class H of homeomorphisms; 


(ii) the class O of open continuous functions (a function is open iff the image 
of every open set is open); 


(iii) the class M of measure-preserving continuous functions on [0, 1]; and 


(iv) the class of functions on finite topological spaces with the discrete 
topology. 


For (i) and (ii) it suffices to find a homeomorphism on R with the w—property, 
for example fx = x + 1. For (iii), the following function is continuous, 
measure-preserving and has the w—property: f(x) = 1 — 2x for x € [0, 5] and 
f(x) = 2x —1 for x € [5,1]. To see that f is measure-preserving consider any 
S C [0,1]. Note that (f-!(5) N [0, 4]) = w(f-4(8) n [4,1)) = tu(s) so 
that (f~!(S)) = (S). See Fig. 10.2. To see that f has the w—property, let 
x = 2-1. Note that f"(2) is of the form z+ 2”,/2, where z is an integer, so 
that x, fx, f?x, ... are all distinct. For (iv), we fix m and n and define a function 
with the m—n—property in the given class. Let X be the set {0, 1, 2, ... m+n} 
and let fe =x+lifx<m-+nand let f(m+n) = m. 





THEOREM 10.20 Suppose that F is a rich class of continuous functions. Then 
DTL?* = LTL. 


Proof Recall the axiomatization of LTL given in the introduction. To show 
that LTL C Dine, it suffices to show that this axiomatization is sound for 
DTL&*. To show that DTLS* C LTL, we consider two cases. 
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Figure 10.2. f™+(S) is shaded. 


Case 1. F contains a function with the w—property. Suppose that A ¢ LTL 
where A is in the language LO*. Then there is some infinite purely temporal 
model falsifying A. To be more precise, let an infinite purely temporal model be 
afunction V : PV xw — {0,1}, where PV is the set of propositional variables; 
where the natural numbers represent discrete moments in time; and where 0 and 
1 represent falsity and truth. Given an infinite purely temporal model V, we 
define n / B, for each n € w and each formula B in the language LO* as 
follows: n — piff V (p,n) = 1; n H} =B iff n - B; n H (B v C) iff n H B 
orn = C; n | OB iff n +1 H B; andn — *B iff m H B for every m > n. 
The completeness theorem for LTL tells us that since A ¢ LTL , there is some 
infinite purely temporal model V such that 0 A. Choose sucha V. 

Since F contains a function with the w—property, we can choose a topo- 
logical space X, a function f € F and an x € X, such that the points 
x, fx, f fx, ff fx,... are all distinct. Choose a function V’ : PV — X such 
that f*a € V'(p) iff V (p, k) = 1, for every k € w. Define M = (X,T,V’). 
By a standard induction on formulas, it can be shown that fx € V(B) iff 
k H B for all formulas B in the language LO* and all k € w. Thus z | A 
since 0 A. So A ¢ DTL&", as desired. 

Case 2. F contains a function with the m—n—property for every m,n E€ w. 
Suppose that A ¢ LTL where A is in the language LO*. Let a finite purely 
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temporal model be an ordered triple M = (Y, g, V} where Y is a finite set; g 
is a function on Y; and V : PV x Y — {0,1}. Given a finite purely temporal 
model M = (Y, g, V}, we define y | B, for each y € Y and each formula B 
in the language LO* as follows: y = p iff V(p,y) = 1; y K AB iff y KK B; 
y = (B v C) iffy H Bory H C; y | OB iff g(y) H B; and y H} «Biff 
g” (y) = B for every n > 0. 

Segerberg, 1976 proves that LTL satisfies the finite frame property. So since 
A ¢LTL, there is some finite purely temporal model M = (Y, g, V} and some 
y € Y such that y jÆ A. Since Y is finite,we have g™t"*1(y) = g™(y), 
for some m,n € w with the gf (y) distinct for i < m + n. Choose such an 
m and n. 

Choose a function f € F with the m-n-property and let X be the topological 
space on which f acts. Choose an x € X such that x, fz,..., f*"z are all 
distinct, and such that ft” +g = f™ax. Define V’ : PV — P(X) as follows: 


V'(p) = {f*a : V(p,g*y) = 1}. 


And let M' = (X, f,V’). Claim: f*x € V'(B) iff g*y H B for all k € w and 
formulas B in the language LO*. We prove this by induction on formulas. 

Base case: For propositional variables p: ffx € V’(p) iff ffx € V'(p) iff 
V(p, g"y) = 1 iff g"y E p. 

Inductive step =, V: standard. 

Inductive step B = OC: ffx € V'(OC) iff f*tla € V'(C) iff g"+ty H 
C (by IH) iff g*a = OC. 

Inductive step B = «C: ffx € V'(*C) iff (Yn > k)(f"a € V'(C)) iff 
(vn > k)(g"y = C) (by IM) iff g*y = *C. 

Thus x ¢ V’(A) since y - A. SOA ¢ DTLẸ*, as desired. QED 

















ConorLaRy 10.21 DTLP* = DTLY* = DTL = DTLG* = DTLO* = 
DIL@ 1) = DTLẸ} = DTL} = DTLẸ} = LIL, where fin is the class of 
finite topological spaces. 


5. S4 is topologically complete for (0, 1) 


Here we combine ideas from several previous constructions into a short proof 
of topological completeness of the modal logic S4, first for the binary rational 
numbers (see below) in the interval (0, 1), and after that for real numbers in the 
same interval. Beginning with a finite, reflexive and transitive Kripke frame 
K, we present a direct definition of an open and continuous map from (0, 1) 
onto the topological space corresponding to K. Given that S4 is complete w.r.t. 
finite, reflexive and transitive Kripke frames, this suffices for the completeness 
of S4 in (0, 1). The map we define is practically the same as in Mints and 
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Zhang, 2005a, very close to Aiello et al., 2003, and has some common features 
with Bezhanishvili and Gehrke, 2005. 

Let us state a familiar general condition of propositional equivalence (more 
precisely, bisimulation) of two topological spaces. Recall that a map f : X; —> 
Xə is open iff the image of any open set is open. 


LEMMA 10.22 Let X1, Xə be two topological spaces and f a continuous and 
open map from X: onto Xə. Let Vz be a valuation for topological semantics on 
Xo and define V; by the equation V;(p) = f~+(Va(p)). Then for any formula 
A in the language L~, 














(X2, V2) E A iff (Xi, Vi) F A. 
In particular, if A is refuted in Xo, then A is refuted in Xj. 


Proof Induction on formulas. QED 


Let 
K = (W, R) 

be a finite Kripke frame with a reflexive and transitive accessibility relation 
R, a set of worlds W = {0, 1, ..., N}, and with root 0: ROw holds for every 
w € W. For every w € W the submodel with the root w is denoted by Kw. 
In particular Ko = K. If w has no proper R-successors, that is Rww’ implies 
w’ = w, then w is a leaf of K. Speaking of R-least, R-maximal worlds we 
mean reading Rww’ as w < w’. So the root 0 is the R-least element of W, a 
leaf is an R-maximal element. In general W may contain clusters, that is sets 
of R-equivalent elements. Every element of a cluster is R-least in the cluster. 

Our plan is to define a continuous open function W from Q’ onto K, where 
Q’ is the set of binary rational numbers in the real interval (0, 1): 


Q! = {mj/2? 0m <2" 


5.1 Partitions 


DEFINITION 10.23 The first partition P; (K), corresponding to K = (W, R), 
of the open interval (0,1) of reals is defined as follows. 

If N = 0, that is the frame K has just one world 0, there are no endpoints 
and the whole interval (0,1) is marked K. All points r € Q' are marked by the 
root 0: W,(r) = 0. 

If N > 0, then the partition has endpoints 


27 4¢=1,...,2N 
that determine 2N + 1 open intervals which are marked as follows: 


10.7 ; k . : 
a (272-1 2-8), (252, 2-7, (271,1) 
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0 0 O 0 0 0 
o ‘9-4 5-3 ‘5-2 N- =f ji 
K KÆ K ky K 


Figure 10.3. The first partition for a three-element model. 


that is W,((0,2~2%)) = K,.... The endpoints are marked by the root of the 
model K: 


(10.8) Wi (2) =0. 


DEFINITION 10.24 The first partition P;((a,b), K) of an arbitrary interval 
(a,b), a < b corresponding to the frame K is proportional to (10.7). If N = 0, 
the whole interval is marked K and W,(r) = 0 for all r € Q’. 
If N > 0, the partition has endpoints 
a+ (b-a), ¢=1,...,2N 

that determine 2N + 1 open intervals marked as in (10.7): 
(10.9) K, Kira K, Kiji IEA K 
All endpoints are marked as in (10.8) by the root: 

Wila + (b-a)? ™) =0, i=1,...,2N. 


DEFINITION 10.25 ((n + 1)-ST PARTITION) Assume the n-th partition 
P,,(K) of the interval (0,1) corresponding to any finite frame K is already 
defined. If N = 0, define Pa41(K) = Pa(K) = P (K). 

Otherwise assume P,(K) consists of endpoints 


Qi <Qa2 <... <404M 


with each of the intervals (aj, aj+1) marked by some Ky,, wj € W. Then 
the (n + 1)-st partition Pa+ı(K) of (0,1) corresponding to K consists of all 
endpoints of P,( K) plus all endpoints of all partitions 


(10.10) P,((0, a1), K), Er Pee Pi ((ai, aiy1), Kuwi)». Š .,Pi((am, 1), K). 


Intervals of the new partition are marked according to partitions (10.10). The 
marking Wr+1(ex) E€ W is an extension of W,, defined for new endpoints e; 
according to partitions (10.10): 


0 ifex € (0, a1) or ex E€ (am, 0) 


Wj ifek € (aj, a@j+1). 


Wn+1(€x) = 
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0 0 0 0 
0 ‘9-2 ‘5-1 ji 
K Kı K 


Figure 10.4. The first partition for a two-element model. 


If Kw, consists only of w;, (that is w; is a leaf of K), then Wn (r) = wj for 
allr € (aj, aj+1) NQ. 
DEFINITION 10.26 Define P(K) as the union of all P, (K): the endpoints 
of P(K) are all binary rational points r € Q' of the interval (0,1) marked as 
in the corresponding P,,(K). 

n(r) = the first number n such that r is an endpoint of Pa (K) or r 

belongs to an interval of P,() marked by Ky, for a leaf w 
Wir) = Wa(r) (r). 
EXAMPLE 10.27 Consider a two-element Kripke frame K = ({0,1},<): 
sd 

| 
. 0 

















K is used to falsify the formula (Op V O-p) by setting V (p) = {1}, so that 


(10.11) 0p, 1Ep. 


The first partition corresponding to the Kripke frame K = ({0,1}, <) is shown 
in Fig. 4. This shows that successive partitions are obtained by 





I marking the endpoints by 0, 


2 removing the second 1/4 of each of the remaining intervals and mark- 
ing (all points in) the removed intervals by Ky; these intervals are not 
partitioned further, 


3 marking the remaining intervals by K = Ko. 


Let P be the union of all intervals marked K; and P := (0,1) — P. The 
valuation V leads to a topological model with the carrier (0,1) and a valuation 


V"(p) =P. 


The set P is open, so Int(P) = P, but Int(P) = 9, since every binary rational 
point in P is a boundary of some interval of P. Hence 


V' (Gp V O-p) = Int(P) UInt(P) = PUd= P £ (0,1). 
































Hence (Op V On-p) is not valid, as expected. 
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5.2 Properties of P„(K) 


The next lemma lists some combinatorial properties of partitions P, (i). 
The marking wf = w/(r) in the clause 3 is determined by r and the partition 
P,(K). 


LEMMA 10.28 1 The functions n(r), W(r) are defined for every r € Q’. 


2 Form > n(r), Wm(r) = War) = W(r). If r is an endpoint of the 
partition Pa(K), then one of the intervals of this partition adjacent to 
r is marked by Kyr) and the other adjacent interval is marked by Kw 
with some w satisfying RW(r)w. 


Forn(r) > 1, r belongs to an interval of Ppiy)—1(4.) marked by Kyyr). 


3 For every j < n(r) the point r belongs to an interval I of the partition 
P;(K) marked by a K,,; for a world w) such that Rw) wt! (where 
war) = W(r)). 


4 If r belongs to an interval I of the partition Pm(K) marked by a Ky, 
then RwW/(r). 


Proof Induction on n. Note that if r = k/2", 0 < k < 2”, then W;(r) is 
defined beginning with j = n(r). QED 


LEMMA 10.29 Forr € Q' 
(10.12) W(r) is an R-least w such that r = limyn—otn 


for some sequence of rn € Q' with rn £ r and W(rn) = w. 


Proof Letn = n(r). If r belongs to an interval I of P„,(K) marked by a leaf 
w of K, then W(r) = w = W(r’) for all r’ € I. Hence w is the only world 
satisfying (10.12). 

If r is an endpoint of P,,( K), then one of the the intervals of this partition ad- 
jacent to r (say from the left) is marked by Ky, and the other adjacent interval 
is marked by Ką with some w’ satisfying RW(r)w’. Hence RW(r)W(r’) 
holds for every r’ in the union of these intervals, so that r = limny—sooTn and 
W(rn) = w imply RW(r)w. On the other hand, the left adjacent to r interval 
of any Pm(K), m > nis marked Ky(r). If rm is the midpoint of that interval, 
we have Wn+1(7m) = W(r) = 0 as required. QED 


Recall that the topology on K is determined by open sets Ku for w € W. 


LEMMA 10.30 The mapping W : Q! — W is continuous: for every r € Q' 
there isa 8 > 0 such that for every r’ € Q’, if|r—1'| < 6, then W(r') € Kw). 
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Proof If W(r) = 0, then w € Kyyr) = K holds for all w € W. Otherwise, 
take n = n(r) > 1. Then r belongs to an interval J of Pa—ı(K) which was 
assigned Kọ) and 


Wr") € Kw) for all r € QO 
as required. QED 


LEMMA 10.31 The mapping W : Q! — W is open: for every r € Q’, every 
w! € Kycr) and e > 0 there is an r' € Q! with 


Ir — r'| <eandW(r') = w. 


Proof Letn = n(r). Then W(r) = Wy(r) and for any m > n the number 
r is an endpoint of an interval which is assigned Kyi) in Pm(K). Since W 
maps Q’-points of this interval onto Ayr); there is a point ri, € Q in this 
interval such that W(r’) = w’. The sequence r’, converges to r. QED 


THEOREM 10.32 Q’ is complete for S4. 


Proof By Lemmas 10.30,10.31 and 10.22. QED 


5.3 Extension to real numbers 


The map W is extended here to real numbers in (0,1) by continuity using 
(10.12) as a hint, so that Lemmata 10.29, 10.30, 10.31 still hold. To make an 
R-least w in (10.12) unique, let’s fix a representative p(C) € C for each cluster 
C of the model K. 


DEFINITION 10.33 For x € (0,1) — Q’ define 
I(n, x) := the interval of the partition P,(K) containing x 


W(x) = p(C) for the unique cluster C such that 








(10.13) = Ano(Vn > no)(Sw E€ C) I(n, x) is marked by Ky in P,(K). 
LEMMA 10.34 W(x) is defined for every x € (0,1) — Q’. 


Proof Let x € (0,1) — Q’ be fixed. Since x ¢ Q’, it is not an endpoint 
of I(n, x) for any n. Let the world M,, € W be the marking of the interval 
I(n, x) in Pa(K), that is [(n, x) is marked by Kyy,,. For all n we have 


(10.14) RMn Mn+. 
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Indeed, 
(10.15) I(n+1,x) C I(n, x), 


since for arbitrary intervals J € P (K), J € Pa+ı(K) either I > J or I and 
J are disjoint. In the former case a relation 


1 
Rww 


for the marking Ku of I [in P (K)] and the marking Kw of J [in P,41(K)] 
follows from the definition of P,41(/), hence (10.14). 

Since the model K is finite, (10.14) implies that all worlds M,, are 
R-equivalent beginning with some n, that is Mp € C for one and the same 
cluster C’, which is obviously unique. QED 


LEMMA 10.35 The function W : (0,1) — W is continuous. 


Proof Similar to Lemma 10.30. Take an arbitrary x € (0,1). The case 
W (x) = 0 is obvious. Assume W(x) Æ 0. 

Case 1. x € Q’. Then n(x) > 1, since W(x) 4 0. For n = n(x) > 1 and 
w = W(x) the point x belongs to an interval of P,—1(4’) which was marked 
by Kw. For all real numbers y in that interval J(n — 1, x) one has RwW(y) 
by (10.14), as required. 

Case 2. x ¢ Q’. Let w = W(x), that is for every n > no the interval I (n, x) 
is marked by Ką for some w’ ~p w. Then W(y) € Ky forall y € I (no, £), 
as required. QED 


LEMMA 10.36 The function W : (0,1) — W is open. 


Proof For x € Q’ use Lemma 10.31. 

Take x € (0,1) — Q’. For n > no the point x belongs to an interval I (n, x) 
marked by K, for some w’ ~p W(x). These intervals stabilize or converge to 
x and for each w’ with Rww’ each of them contains an r € Q’ with W(r) = w’, 
as required. QED 


THEOREM 10.37 The interval (0,1) is complete for S4. 


Proof By Lemmas 10.35, 10.36, 10.22. QED 
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6. The logic of homeomorphisms 


Of particular interest is the class H of homeomorphisms (continuous bi- 
jections with continuous inverses). Intuitively, we keep track of time with 
f. Although our temporal modalities are forward-looking, it seems natural to 
keep track of time with functions that can look in both directions (i.e. that 
are bijective) and that are continuous in both directions. Despite the fact that 
our temporal modalities are forward-looking, restricting our attention to the 
class makes a difference that can be expressed in our trimodal propositional 
language. In particular we have (10.16) and (10.17), below: 


(10.16) (Op > Op) ¢ DTLo. 






































(10.17) (GQp > Oop) € DTLy. 


(10.17) follows from the fact that Int(f~!(S)) C f~'(Int($)) where S is 
a subset of a topological space X on which f is a homeomorphism. To see 
(10.16), let M = (X,f,V) where X = {0,1} with open sets Ø, {0} and 
{0,1}; and where f(0) = f(1) = 1 and V(p) = {1}. The function f is 
continuous and hence M is a DTM. Also note that V(OQp) = {0,1} and 
V(QUp) = 0, so that M KK (ACp > Oop). 

As mentioned in Sec. 1, Konev et al. (2006a) present a proof that DTLy, is 
not axiomatizable. We do, however, have an axiomatization of its next-interior 
fragment. Define the logic S4C as in Sec. 1. It turns out that S4C is complete 
for the class H, i.e., 54O = DTLY . What’s more, completeness holds for the 
real line R and similar spaces, providing an analogue to the McKinsey-Tarski- 
Kripke Theorem (Theorem 10.4). 












































THEOREM 10.38 (KREMER ET AL., 1997,KREMER AND MINTS, 1997) 
S4O-=DIL2" =DTLG = DTL y =DILYy = DTL” = DTL = 


[0,1],H 
O0 _ pO 
DIL 3,0 = DTL: 


































































































Proof (Vladimir Rybakov helped us with this proof.) The claim that S4 C 
DTLS is just a version of soundness, which is proved as usual. Given this, 
the following inclusion relations are obvious: 
































S40 C DTLG” c DTL" C DTLỌF 
SOCDILe  CDilLes CDI 


S40 © DTL” C DILY? C DTL J 


Ọ O O 
S4O C DTL" C DTL Jo E PTR H 


S40 C DTLY® c DTLOZ c DTL? 
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So it suffices to show, for every formula A in the language LO": 


if [0,1], H E Athen R, H E A, 
if R, H — A then A € S40, and 
if A, H H Athen A € S40. 











See Theorems 10.39, 10.46 and 10.47, respectively. QED 


THEOREM 10.39 Jf [0,1], H H A then R, H H A. 


Proof Suppose that R, H A A. Let M = (R, f, V} be a model where f is 
a homeomorphism on R and where M {A A. Since f is a homeomorphism 
on R, f is either strictly increasing or strictly decreasing. (In fact, as we show 
in the proof of Theorem 10.46, we can take f to be f(x) = x +1. But we 
will continue with the more general case for now, since we have not yet shown 
Theorem 10.46.) Choose some strictly increasing continuous one-one function 
h from R onto the open interval (0, 1). Define f’ on [0, 1] as follows: 


f'(x) =hfht(x)if0 <£ <1 
f'(x) = x if f is strictly increasing and either x = 0 or x = 1 
f'(x) = 1 — z if f is strictly decreasing and either x = 0 or x = 1. 


And define 
V'(p) = {x € (0,1) : A(x) € V(p)}. 


f’ is one-one and onto. f’ is also continuous. For if f is strictly increasing 
then lim,—of’(2) = 0 and lim,—1 f'(x) = 1; and if f is strictly decreasing 
then limao f (x) = 1 and limzif'(x) = 0. So M’ = ([0, 1], f’, V’) isa 
dynamic topological model. 

Notice that (0,1) N V’(B) = {x € (0,1) : h-1(x) € V(B)}, for every 
formula B. The proof of this is a routine induction on formulas. So V’(A) 4 
[0, 1]. For otherwise we would have V(A) = R, which is false. So [0, 1], H 
A, as desired. QED 


F 





Before we prove Theorems 10.46 and 10.47, some definitions and lemmas. 


DEFINITION 10.40 Given a formula B, let g( B) be the result of pushing all 
the occurrences of © to the atomic formulas. For example, g(CQ(OQU(p V 


Og) y Onrr)) = (A(OOpV OO0q) V 200r). To be more precise, define 
g(B) inductively as follows: 

















g(O”"B) = O"B, if B € PV, 
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g(O" 7B) = 7g(O"B), 
g(O”(B v C)) = g(O"B) V g(O"C), and 
g(O”"OB) = Og(O"B). 


DEFINITION 10.41 A near-atom is a formula of the form OQ” p where p € PV. 























DEFINITION 10.42 A formula is simple iff it is built up from near-atoms using 
the Boolean connectives and O. Simple formulas are the formulas in the range 


Of g. 


CONVENTION 10.43 We will take S4 to be formulated by its standard axioms 
and rules, for a language whose formulas are just the simple formulas, treating 
the near atoms as indivisible atomic formulas. We also slightly restate the 
definition of topological model, Definition 10.1: A topological model now 
becomes an ordered pair, M = (X, V}, where X is a topological space and V 
assigns a subset of X to each near atom ()"p rather than to each propositional 
variable p. Mimicking Definition 10.1, we extend V to all simple formulas as 
follows: 

















V(O"p) = V(O"p), 
V(AV B) = V(A) UV(B), 
V(AB) = X — V(B), and 
V(QB) = Int(V(B)) 











As in Definition 10.1, We define standard validity relations: 


MK Biff V(B) =X. 
X E= B iff M — B for every model M = (X, V}. 
B is valid (} B) iff X — B for every topological space X. 


The McKinsey-Tarski-Kripke Theorem (Theorem 10.4) still holds: Suppose 
that X is a dense-in-itself metric space and A is a simple formula. Then the 
following are equivalent: (i) A € S4; (ii) FE A; Gii) X E A; (iv) R E A; 
(v) Y H A for every finite topological space Y; and (vi) Y — A for every 
Alexandrov space Y. 











LEMMA 10.44 B € S40 iff g(B) € S4 iff g(B) € S40. 


Proof By a standard induction on the proof of B in S4, we can show that if 
B € S40 then g(B) € S4. It is obvious that if g( B) € S4 then g(B) € S40. 
Finally, if g(B) € S4O then B € S40, since (B = g(B)) € S40. QED 


LEMMA 10.45 For every formula B, g(B) € S4 iff (0,1) = g(B) where 
(0, 1) is the open unit interval. 
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Proof This follows from Theorem 10.4 and Lemma 10.44. QED 


THEOREM 10.46 IFR, H H A then A E€ S40. 





Proof Suppose that A ¢ S4. Then, by Lemmas 10.44 and 10.45, for 
some topological model M = ((0,1),V), we have M |- g(A). Let M’ be the 
dynamic topological model (R, f, V’), where fa = x+landV'(p) = {x € R: 
for some natural number m, x — m E€ V(O™p)}. See Fig. 10.5. fisa 





Vip) Vip) VOp) V(p) V’(p) 
| | | | 
C MORENE + O- 
0 1 2 3 


Figure 10.5. Definition of V”. 





homeomorphism. We will be done if we can show that MW’ p+ A. For this, 
it suffices to show that M’  g(A), because of Lemma 10.44 and because of 
soundness. And for this it suffices to show that for every simple formula B, we 
have V(B) = (0,1) O V’(B). We show this by induction on the construction 
of B. 

Base case: B is a near atom, say ()"p. Note the following: 

x € (0,1) A V'(B) 

=> x € (0,1) and z € V'(O”p) 

=> x € (0,1) and z +n € V'(p) 

= x € (0,1) and, for some m, x +n — m E€ V(O™p) 

= m = n, since x € (0,1) and z +n — m E€ V(O™®p) C (0,1) 

=> x € (0,1) and z € V(O”p) 

=> zx € V(B). 


Conversely, x € V (B) 
=> zx € V(B) 
=> x € (0,1) and z € V(O”p) 


= x € (0,1) and, for some m, x + n — m E€ V(O™p) 
=> x € (0,1) and z +n € V'(p) 

=> x € (0,1) and z € V'(O"p) 

= x € (0,1) N V'(B). 


Inductive step B = CV D. V(C v D) = V(C)UV(D) = ((0,1) NA 
V'(C)) u ((0,1) Nn V'(D)) = (0,1) NVC v D). 
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Inductive step B = =C. V(=C) = (0,1) — V(C) = (0,1) — (0,1) NA 
V"(C)) = (0,1) — (RAV"(C)) = (0,1) A V’ ~C). 

Inductive step B = OC. V(OC) = Int(V(C)) = Int((0,1) N V'(C)) = 
Int((0,1)) A Int(V'(C)) = (0,1) NV"(OC). QED 
































THEOREM 10.47 If A,H — A then A € S40. 


Proof Suppose that A ¢ S4C. Then g(A) ¢ S4. So there is a Kripke model 
M = (W, R, V) (where (W, R) is a Kripke frame) such that M j+ g( A). Now 
define a dynamic topological model M’ = (X, f, V’) as follows: 


X = {(w,n) : w € W and nis an integer}, 

(w, n) R' (w, m) iff wRw' and n = m, 

Y C X is open iff Y is closed under the relation R’, 
f(w,n) = (w,n +1), and 

(w,n) € V'(p) iff w € V(O"p). 


X is a topological space, if we take the topology of open sets as defined 
directly above. Infact, X is an Alexandrov space (see Definition 10.3). fis both 
continuous and open since (w,n)R'(w’,m) iff f(w,n)R’ f(w',m). And f is 
clearly one-one and onto. So M’ = (X, f, V’) isa dynamic Alexandrov model, 
with f a homeomorphism. We will be done if we can show that M’ |£ A. For 
this, it suffices to show that W’ jÆ g(A), because of Lemma 10.44 and because 
of soundness. And for this it suffices to show that for every simple formula B 
and every w € W we have w € V(B) iff (w,0) € V’(B). We show this by 
induction on the construction of B. 

Base case: B is a near atom, say O©”p. Then (w,0) € V’(B) iff (w,0) € 
V'(O"p) iff f"(w,0) € V'(p) iff (w,n) € V'(p) iff (w,n) € V'(p) iff 
weéeV(O”p) iff w € V(O"p) iff w € V(B). 

Inductive step B = CV D. (w,0) € V'(C v D) iff (w,0) € V'(C) or 
(w,0) € V'(D) iff w € V(C)orw € V(D) iff w € V(C Vv D). 

Inductive step B = =C. (w,0) € V’(AC) iff (w, 0} ¢ V'(C) iff w ¢ V(C) 
iff w € V(AC). 

Inductive step B = OC. (w, 0) € V’(OC) iff (Vw’) (Vn) (if (w, 0) R’(w’, n) 
then (w’,n) € V’(C)) iff (Vw’)Gf (wRu’ then (w’,0) € V’(C)) iff (Vw') (if 
wRuw’ then w € V(C)) iff w € V(OC). QED 



































THEOREM 10.48 If fin, H — A then A € S40, where fin is the class of 
finite topological spaces. 


Proof Suppose that A ¢ S4Q). Then, by Lemma 10.44 and the finite model 
property for S4 we have M j4 g(A), for some topological model M = (X, V} 
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where X is finite. Let n be the maximum number of consecutive occurrences 
of © in the formula g(A). For each k = 0, . . . , n, define the finite topological 
space X% as follows: 

Xp = {(z,k) +2 € X}, 


where a set O C X;, is open in Xx iff the set {x : (x, k} € O} is open in X. 
Define the function fg : X — Xz, as follows: f(x) = (x,k). fp is clearly a 
homeomorphism from X onto Xx. 

For S C X, define Intx(S) as the interior of S in X. Similarly, for 
S C Xp define Intx,(S) as the interior of S in Xx, and for S C X’ define 
Intx:(S) as the interior of S in X’. Then note that, for any S C X’, we have 
Xg NA Inty: (S) = Intx, (Xg N S). Also note that, for any S C X, we have 
Intx($) = fg '(Intx,(fo($))). 

Next, define the topological space X’ = Uk Xp, where a set O is open in X’ 
iff the following set is open in X; for each k: OM Xx. Define the function 
f: X — X as follows: 


(x,k+1), ifk<n 


F(a.) = es if k =n. 


f is clearly a homeomorphism from X onto X. Finally, define the valuation 
function V’ : PV — X' as follows: V’(p) = {(a,k): x € V(O*p)}. 

Let M” be the dynamic topological model (X’, f, V’). We will be done if we 
can show that M’ Æ A. For this, it suffices to show that M’ | g( A), because 
of Lemma 10.44 and because of soundness. And for this it suffices to show that 
for every simple formula B with n or fewer consecutive occurrences of Q), we 
have 








V(B) = {x€ X : (2,0) € V'(B)}. 


We show this by induction on the construction of B. 

Base case: B is a near atom, say C*p, where k < n. Note: (x,0) € 
V'(B) = (2,0) € V'(O*p) & f*((x,0)) € V'(p)  (2,k) € V'(p) & 
x € V(O"p) & z € V(B). 

Inductive step B = C V D. Note: (x,0) € V'(C vV D) & (2,0) € 
V'(C) or (x,0) € V'(D) Se E V(C)orz E V(D) exe V(CVD). 

Inductive step B = ~C. Note: (2,0) € V’(AC) & (2,0) €V'(C) Sx g 
V(C) eaeV(-C). 

Inductive step B = OC. Note: (x,0) € V’(QC) 

= (x,0) € Intx(V'(C)) 

















> (x,0) € Xo N Intx/(V'(C)) 
= (x, 0) € Intx,(X0oN V’(C)) 
S (x,0) € Intx,({(y,0) : (y,0) € V’(C)}) 
= (x,0) € Intx,({(y,0) : y E V(C)}), by the inductive hypothesis 
= folz) € Intx,(fo(V(C))) 
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s x € fo '(Intx,(fo(V(C)))) 
= x € Intx(V(C)) 
x € V(0C). SED 














COROLLARY 10.49 S4© = DTL finn. 


7. The logic of continuous functions 


The most basic dynamic topological logic is the logic of continuous functions 
on topological spaces, i.e. DTLo = {A: | A}. Itisnotknown whether DTLo 
is axiomatizable. In this section, we will prove that the next-interior fragment of 
DTLo is axiomatizable: it is axiomatized by the system S4C, defined in Sec. 1, 
above. (See Sec. 7.1.) Thus, S4C is the most general next-interior logic of 
continuous functions on topological spaces. This is a corollary to our theorem 
that S4C is the next-interior logic of all dynamic Kripke frames (Sec. 7.1). We 
also prove that S4C satisfies the finite model property (Sec. 7.2), and that S4C 
is the next-interior logic of continuous functions on Cantor space (Sec. 7.5). 

Together, these results provide a partial analogue to the McKinsey-Tarski- 
Kripke theorem (Theorem 10.4), above: For every formula A of LO", we have 
A €S4C iff AiffY | A forevery Kripke model Y iff Y — A for every finite 
Kripke model Y. To this extent, the situation with S4C is similar to the situation 
with S4 (see Theorem 10.38, above.) But there is an important disanalogy: 
though S40Ọ is the next-interior logic of homeomorphisms on R, S4C is not 
the next-interior logic of continuous functions on R. Presently, we will give an 
example of a formula A in the language LO" such that A ¢ S4C but R H A. 
See Slavnov, 2005 for another example. We note that the axiomatizability of 
the next-interior logic of the real line remains an open problem. 

Consider the following formula A, where p and q are propositional variables: 


(GOp > OOP) V (Og > 909). 
We first show that A ¢ S4C. Let M = (X, f, VY, where 





















































X=410,1,2}; 

the open sets are 0, X, and {2}; 
f(2) = f(1) = 0 and f(0) = 1; and 
V(p) = {0, 1}, and V (q) = {1}. 


Note the following: 











V(Op) = X; so V(0Ọp) = X. 
V (0p) = 9; so V(OYUp) = 0. 
Thus V(OOp > OdUp) = . 
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Meanwhile, V(Qq) = {0}; so V(OQq) = 4. 
Thus V (Oq D OQOg) = {1,2}. 

Thus V (A) = {1,2} # X. 

Thus M FE A. 























We now show that R = A. Suppose not. Then there is some dynamic 
topological model M’ = (R, f’,V’) and some x € R such that x ¢ V’(A). 
Thus, 


(i) xz € V'(DOp). So there is an open interval J such that x € I C V'(Op). 
So f'(x) € F'(T) CV"(p). 


(i) x € V'(OOOp). So f'(x) g Cl(Int(V"(p))). 
(iii) x € V'(Oq). So f'(x) € V(q). 


(iv) x ¢ V'(OOq). So there is some y € I such that y ¢ V'(Oq). Thus 
f'(y) € V' (q). Thus f'(x) # f'(y). Thus f’(Z) is not a singleton set. 


Since f'(I) is not a singleton set and since J is an open interval, f’(Z) is either 
an open interval, a closed interval, or a semi-closed interval, i.e. an interval of 
the form [a, b) or (a, b]. In any case, f’(1) C Cl(Int(f’(Z))). And since from 
(i) we have f’(I) C V'(p), we also have 

f'(x) € F'(I) E ClUnt(f'(2))) € ClUnt(V"(p))). 


But this contradicts (ii). 






































7.1 Canonical models 


We begin with the completeness of S4C for Kripke models. Recall some 
standard notions: A is a theorem iff A € S4C. A is consistent iff ~A ¢ SAC. 
A theory is a set of formulas in the language LOU containing all the theorems 
of S4C and closed under Modus Ponens. A theory T is complete iff for every 
formula A either A € T or ~A € T. A theory T is consistent iff some formula 
is notin T. A set S of formulas is consistent iff some theory T D Sis consistent. 














THEOREM 10.50 (ARTEMOV ET AL., 1997 AND DAVOREN, 1998) S4C 
is sound and complete for the class of all dynamic Kripke models (and hence 
for all dynamic Alexandrov models). 


Proof Soundness is obvious. For completeness it suffices to construct a canon- 
ical dynamic Kripke model M (see Definition 10.10) such that M = A iff A € 
S4C, for every formula A in the language LOT. In fact, given soundness, it 
will suffice to show that if M } A then A € S4C. 

Define a Kripke frame (X, R} and a function f on X as follows: 

















X = {x: xis a complete consistent theory }; 
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xRy iff for every formula A, if OA € x then A € y; and 
fa={A:CAexz}. 


Note that R is reflexive since (GA D A) € S4C and transitive since (OA D 

A) € SAC. 

Now we show that f is monotone: xRy = (fx)R(fy). So suppose that 

xRy. To see that (fx)R(fy), suppose OA € fa. Then OOA € x. So 
OA € a, since (OOA D OQA) € S4C. So OA € y. SOA € fy, as 

desired. 

Thus (X, f} is a dynamic Kripke system. Define V (p) = {x E X : pe x}. 
Then M = (X, f, V} is a dynamic Kripke model. By a standard induction 
on the complexity of the formula A, we have x € V (A) iff A € x, for every 
TEX: 

To show that if M — A then A € S4C, suppose that A ¢ S4C. Then —A is 
consistent. By a standard argument, every consistent formula is a member of 
some complete consistent theory. So ~A ¢ x, for some x € X. Sox ¢ V(A). 
So M i A, as desired. QED 



















































































COROLLARY 10.51 S4C is sound and complete for the class of all dynamic 
topological systems. 


7.2 The finite model property for S4C 


We can improve on the last corollary as follows: 


THEOREM 10.52 S4C is sound and complete for the class of all finite rooted 
dynamic topological systems. 


In particular, we will show that if A ¢ S4C, then there is some finite rooted 
dynamic Kripke model M = (W, R, f, VY such that M /- A. 

Let a signed formula be any ordered pair (+, A) where A is a formula of 
LOH, We will write +A for (+, A) and —A for (—, A). A pseudo-atom is a 
set of signed formulas. Given a pseudo-atom a, let |a| = {A : +A € a}. 
We identify any nonempty pseudo-atom a with its conjunction as follows: we 
identify {+A,—B,—C} with A & ~B & AC. We identify the pseudo-atom 
Ø with the formula p V ~p. We say that a formula A is consistent just in case 
=A ¢ S4C. A pseudo-atom is consistent just in case the formula with which it 
is identified is consistent. 

We say that a set S of formulas is strongly closed iff S satisfies the following 
closure conditions, for any formulas B and C, and for n,m > 0: 























(CCl) if O"+1B € S then O"B € S 
(CC2) if OOB € S then O”B € S, 
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(CC3) if "4B € S then O”B € S, 

(CC4) if O” (B & C) € S then O”B € SandO"C € S, 
(CC5) if O” (B v C) € S then O” B € Sand O”C € S, and 
(CC6) if O” O™ Be SthnO"™U0O™ Bes. 














Note that if S satisfies (CC1)-(CCS5) then S'is closed under subformulas. We 
say that a pseudo-atom a is strongly closed just in case the set |a| of formulas 
is strongly closed. If S is a strongly closed set of formulas, then an S-atom is 
any consistent strongly closed pseudo-atom a with |a| C S. 

Suppose that the formula A ¢ S4C. We will define a finite rooted dynamic 
Kripke model M such that M 4 A. Let S be the smallest strongly closed set 
of formulas containing A. Note that S is finite. Define the dynamic Kripke 
model M = (W, R, f, V} as follows: 


1 W = the set of S-atoms. 








2 aR iff, for every formula B, if +OB € a then +O0B € 2. 
3 fia) ={4+4B:4+0OBea}U{-B:-OB€e a}. 




















4 V(p) = {a: +p € a}, for each propositional variable p. 


We have to make sure that M is a dynamic Kripke model. It is obvious that R 
is reflexive and transitive. We must still show two things: 


(1) Ifa € W then fa € W, i.e. if œ is an S-atom then fa is an 
S-atom; and 
(2) f is monotonic. 


We will show these below. Given (1) and (2), M is a dynamic Kripke model. 
Also note that M is finite, since Sis finite. Also, M is rooted: Ø is an S-atom; 
for every S-atom a we have (Ra; and f(0) = Ø. Below, we will also show 
the following, for every formula B: 





(3) For every S-atom a, if +B € a then (a € V(B) iff +B € a). 


Given (3), we are just about done. Since A ¢ S4C, there is some S-atom a 
with —A € a. Thus +A ¢ a. Thus a ¢ V(A), by (3). Thus M fF A. 

All that remains for Theorem 10.52 is to prove (1), (2) and (3). 

Proof of (1). Suppose that a is an S-atom. We must show that (1.1) fa 
is consistent, (1.2) | fa] C S and (1.3) fa is strongly closed. If fa is empty, 
then (1.1) is satisfied since fa is identified with the consistent formula p V ~p; 
and (1.2) and (1.3) are trivially satisfied. So we assume that fa is nonempty. 
Re (1.1): fa is inconsistent > O fa is inconsistent = a is inconsistent since 
(a D Ofa) € S4C. Re (1.2): Note that | fa| C |a|, since |a| is closed under 
subformulas; thus | fa| C S since |a| C S. Re (1.3), we must show that | fal 
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satisfies the closure conditions (CC1)-(CC6) above. Re (CC1): Suppose that 
O"t!B € |fa|. Then O"t?B € |a|. SoO"*!B € |al, since |a| satisfies 
(CC2). So O”B e€ |fal, by the definition of f. Re (CC2): Suppose that 
O"OB € |fa|. Then O"t'OB € |a|. So OTIB € |al, since |a| satisfies 
(CC2). So O"B € |fal, by the definition of f. Similarly for (CC3)-(CC5). 
Re (CC6): Suppose that O” O+! B € |fa|. ThenO”*! Ot! B € Jal. So 
O"o0™! Be |a], since |a| satisfies (CC6). SoO"0 O™t! Be | fal. 

Proof of (2). Suppose that aRG: we want to show that fa R f 3. So suppose 
that +OA € fa. Then + O OA € a. Since a is strongly closed, |a| satisfies 
(CC6), above; thus either +0 CQ OA € aor -OOOA € a. Note that 
OOAD0OO0OA € SAC. So+0 Q OA € a, by the consistency of a. Thus 
+0 QO 0A € $£, since aRZ. Thus + O OA € £, by the strong closure and the 
consistency of 3. Thus +O0A € f@, as desired. 

Proof of (3). We proceed by induction. 

Base case, B is a propositional variable. Cf the definition of V. 

Inductive step, B = C & D. Suppose that +B € a. Note that a € V(B) 
iffa € V(C & D) iffa € V(C) anda € V(D) iff +C € a and +D € aiff 
+B € a, by the consistency of a. 

Inductive step, B = C V D. Suppose that +B € a. Note that a € V(B) iff 
a E V(CVD)iffa € V(C) ora € V(D) iff +C € aor +D € aiff+B € q, 
by the consistency of a. 

Inductive step, B = =C. Suppose that +B € a. Note that a € V(B) 
iff a ¢ V (C) iff +C Z aiff —C € aiff +B € a, by the consistency of a. 

Inductive step, B = OC. Suppose that +B € a. Note that a € V (B) iff 
face V(C)iff+C € faiff+OC €aiff+B €a. 

Inductive step, B = OC. Suppose that +B € a. We consider the directions 
of the biconditional separately. 

(=) Suppose that +B ¢ a. Let y = {+0D: +OD € a} and let 6 = 
y U{—C}. First note that ô is a consistent pseudo-atom: 6 is inconsistent => 
(y D C) € S4C => (y D OC) € SAC (since y is a conjunction of formulas 
of the form OD) = a is inconsistent. Since 6 is consistent and |6| C S, 
there is an S-atom 8 such that ô C Ø. Since -C € 8, we have C ¢ B. 
Thus, by the inductive hypothesis, 8 ¢ V(C). Also note that aR. Thus 
a g V(O0C) = V(B). 

(<): Suppose that +B € a. To show that a € V(B), consider any S-atom 
GB with aR. Note that +OC = +B € £ by the definition of R. So +C € p, 
by the consistency of (3. So, by the inductive hypothesis, 6 € V(C). Thus 
a € V(OC) = V(B), as desired. 
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7.3 Bisimulation of Dynamic Topological Systems 


DEFINITION 10.53 Let Mı = (Xi, Tı), Mə = (X2, T2) be two dynamic 
topological spaces. We say a map W : Mı — Mə is commuting if 


I W is a continuous and open map from X 1 onto X2, and 
2 W(Ti(2)) = T(W(a)). 


LEMMA 10.54 Let Mı = (X1, T1, Vi), Mz = (X2, Tə, V2) be two dynamic 
topological models. Suppose that W : Mı —> NE is a commuting map and for 
each propositional variable p, 


Vi (p) = W7! (V2(p)). 









































Then 
Vi(A) = W™(V2(A)) 
for any formula A of LO". 
Proof By induction on A. The base case and induction steps for connectives 
V, A^, ~ are straightforward. Now consider the remaining two cases: A = OB 
and A=OB. 
= Case A = OB. We have 
Vi(A) = Vi(OB) 
= Int(Vi(B) by the definition of V; 


) 
= mor '(V2(B))) by the induction hypothesis 

= We '(Int(V2 (B))) by the continuity and openness of W 
= W “(Val B)) by the definition of V2 

= W“ (v(A)). 


= Case A = OB. We need to show that V1 (OB) = W7!(Vo(OB)). Let 
x € Xı. We have 














rEVi(OB) Til) € Vi(B) by the definition of V, 
& T(z) © W~'(Va(B)) _ by the induction hypothesis 
= W(Tı(x)) € Vo(B) 
= To(W(2)) € V2(B) since W is a functor 
< W(x) € V(OB) by the definition of V2 
@ xe W'(V2(OB)) 


QED 


LEMMA 10.55 Let My = (XiTi, Vi), M2 = (X2, To, V2) be two dynamic 
topological models. Suppose that W : Mı — Mə is a commuting map and for 
each propositional variable p, 


Vi (p) = W7 (V2(p)). 
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Then for any formula A of LO®, 





M: | A iff MEA. 





Proof Suppose that Mə — A, that is, V2a(A) = X2. By Lemma 10.54 
Vi(A) = W71(V2(A)), and so Vi(A) = Xj as required. On the other hand 
suppose that Mı = A, but Mz F A, i.e., Vo(A) Æ X2. Since W is onto 
and Vi(A) = W~1(V9(A)), we have Vi(A) # Xı, that is, Mı - A, a 
contradiction. QED 


COROLLARY 10.56 Let Cı and C2 be two classes of dynamic Kripke models 
such that for every model Mə € Cə there is an My © Cı and a functor W : 
Mı — Mə. Then, if Cz is complete for S4C, then Cı is also complete for S4C. 


Proof If Mz A A, then Mı j4 A by Lemma 10.55. QED 


7.4 Stratified frames and limits 


This section uses definitions and results of Slavnov, 2005 as reformulated 
in Fernandez, 2006. 


DEFINITION 10.57 Given a finite Kripke frame W = (W, R) we say that 
limits are chosen in W if for any R-monotone sequence {wpn} a particular 
element w = w; is fixed such that Rw,w foralln. We write w = limp{ wn}. We 
assume that if {wn } stabilizes, that is wn = wforn > no, then w = lim, {wn}. 


In Sec. 5 (and earlier in Mints, 1999) we chose limits in an arbitrary way. Now, 
given a dynamic system (X, g), we would like the function g to commute with 
the limits. 


THEOREM 10.58 S4C is complete for the class of finite dynamic Kripke mod- 
els (W, R, g, V) where limits are chosen so that 


glim{wn}) = lim{g(wn)}. 


Proof Consider a formula a refuted in a world vo of a finite dynamic Kripke 
model M = (W, R, g, V) where limits are chosen in some way. Let’s “stratify” 
M. Let d be the C-depth of a, that is the maximal nesting of O in a. Define 


W = {(w,g(w),... gw): wE W, 0< j <d} 


Rlw,g(w),..., gf(w) lw, g(v),-..,9%(v)) > Rg (w)g*(v) & J =k 
(w,glw),..., gf (w)) €V(p) => g!(w) € V(p) 
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Ilw, g(w),.-., 9 (w))) = (w, g(w),.--,97(w), 9 **(w)) 
for j < d; 


Ilw, g(w),--.,9%(w)}) = (w,g(w), ++ 94(w)) 
lim{ (wn, glun), +592 (Wn) = 


{(im{w,}, g(lim{wn}),.-.,9 (im{wp}))}- 


= (W, Ř, g, V) is a dynamic topological model. 

Consider an example. Let W = ({0,1,2}, R,g,V) with ROO, R01, R02, 
R11, R12, R21 and R22; with g(0) = g(2) = 1 and g(1) = 2; and with any 
valuation function V. Note that 0 is the root and that {1,2} is a cluster. The 
original model M is shown at the left of Fig. 10.6. For M = Mo and depth 


ae f (2,1) => À (2,12) = [eA 212) > 
| \ | | 
| SEE 
| | | | 





| | | | | | 
| | z | | ş z 
Yy > |o laz = | e | (121) ss É jeza Erm 
| | 
| | | 
yo yoy 
Jere | z z z 
>> \e/ on > e / (012) —> e > 
M M M. 


1 2 M, 


Figure 10.6.. M = Mo U Mı U M2 U M3. 


d = 3 we have M = Mo U Mı U Mə U M3, as shown in Fig. 6. Each of the 
M;, i > 0, is a cluster, and the function g moves M; to Mi+ı “horizontally”. 
In the general case, induction on formulas shows that 


(w, g(w),... g (w)) €V(d) = g!(w) € V(9) 


for all j < d and all ¢ of the O-depth < d — j. In particular (w) € V(¢) iff 
w E V(@), for every ¢ and every w € W, so given formula a is refuted at (vo). 
Note that g commutes with limits by the definition. QED 


7.5 Completeness of S4C for Cantor Space 


7.5.1 Setup. Here we present a streamlined version from Kremer, 
2004 of a proof from Mints and Zhang, 2005b that S4C is sound and complete 
for Cantor Space. To state our main theorem, we define both Cantor Space and 
a number of related spaces based on trees. 
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Let *w be the set of finite sequences of natural numbers, including the empty 
sequence A. We use bold x, y, z, v to range over *w. For x,y € *w, we write 
xy or x” y for x concatenated with y. For x,y € *w we say that x < y iff x 
is an initial segment of y, and x < y iff x < y and x # y. For x € *w and 
k € w, we write xk or x^k for x concatenated with k. For x € *w, length(x) 
is the length of x; and for m < length(x), Xm is the mt? member of x. So 
X = X0,X1,---,Xlength(x)—1: 

For our purposes, a tree is any T C *w satisfying the following: 


AET; 





VxeET,dkew,xk eT; 


and 
Vxew,Vk ew, ifxk eT thenx €T. 


We can think of the members of T as nodes in T. T is nontrivially branching 
iff Vx € T,3k,j E€ w, k Æ j and xk,xj € T. T is finitely branching iff 
Vx ET, the set {k € w : xk € T} is finite. 

We will be interested in paths through trees: these can be represented by 
members of “w, i.e. by w-long sequences of natural numbers. We use bold- 
italic x, y, z, v to range over “w. For x € *w and y € “w, we write xy or x” y 
for x concatenated with y. For x € “w and m € w, £m is the m** member of 
£. SO £ = £o, £1, . Em,- -~ And for x € “w and m E w, z|[mis the finite 
sequence consisting of the first m member of x. Soav[m = £o, £1, ..., Em—1- 
If T is a tree, a T-path is an x € “w such that x Ìm € T for every m € w. 

For every tree T' we define 





path(T) = {a : xis a T-path}. 


A T-path through the node x € T is any T-path of the form x~y. We impose 
a topology on path(T) as follows. For x € T, let 


BE = {x°y:y€ “wand xy € path(T)} 
the set of T’-paths through the node x. 


Generalized T-Cantor Space is path(T) with the topology determined by the 
basis sets BZ, where x € T. And Cantor Space is C = path(*2), where *2 is 
the set of finite sequences of 0 and 1. 

Our main theorem is as follows. 


THEOREM 10.59 (SOUNDNESS AND COMPLETENESS IN CANTOR SPACE) 
CK A iff A € SAC. 


Since soundness (i.e. the < direction of the “ iff”) is routine, we concentrate 
on completeness: if C = A then A € S4C. Or, as we will prove it, if A ¢ 
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S4C then C jÆ A. First, we reduce the problem of showing completeness in C 
to the problem of showing completeness in path(T), where T is any finitely 
branching tree. 


LEMMA 10.60 Suppose that T is a finitely branching tree. Then, for every 
formula A, we have C = A iff path(T) = A. 


Proof Firstassume T is nontrivially branching and construct ahomeomorhism 
of path(T) onto C, that is one-one onto continuous open function, ¢ : path(T) —> 
C. For this we construct a one-one function Y% : T — *2. Let 


YA) = A. 


Suppose that (x) is defined for x € T, and let ko, . . . , km—1 be an exhaustive 
strictly increasing list of natural numbers such that xkn € T for each n < m. 
Note that m is finite since T is finitely branching, and that m > 1 since T 
is nontrivially branching. If m = 5 then we have the following picture of a 
portion of the tree T: 


x 
xko xk xk xk xk4 


We define ¢)(xk,,) as follows, for each n < m: 
w(xkn) = Yx) 0 ifn < m—1, 
w(x) 1" ifn =m-—-1. 


In this definition, 1” is the finite sequence consisting of n occurrences of 1. 
The portion of T pictured above would get mapped by w into a portion of *2 as 
follows: 





Y(x)^110 = (xk) 


W(x)71110 = Y(xks) (xy) = (KTM 


602 HANDBOOK OF SPATIAL LOGICS 


The function Y : T —* 2 induces a function ¢ : path(T) — Cas follows: ọ(æ) 
is the unique *2-path through the nodes w(x[0), (a1), (af2), o(af3),.... 
Note that ¢ is indeed an isomorphism, hence T and C verify the same formulas. 

Now consider case when some points x € T have only one successor xk. 
Construct a new tree T” by duplicating every such successor and the whole 
branch it begins. 


Ty := {x17 k} es Se ky : x1Ck,...°x,"k, € T and C} 


where C means: each of kı, .. . kı is the only successor in T, ki € {kj, ki + 1} 
and at least one of k; is k; + 1. 


T'=TUT; 


T’ is a nontrivially branching tree. Define a map W : T’ — T sending each 
new node into the old node of the same length from which it was generated: 


W(xı^ki Fah ok.) := X1^ k1... OX kı € T for xk} ssp Ox ky, ET 


W(x) := x for x € T. W is continuous, open and onto T. The inclusion map 
I: T — T is “ inverse” to W. If f : T — T is a continuous function, it 
induces a continuous function g : T” — T”: 


g(x) = FWE) 


such that W commutes with f, g. Indeed, 
W(g(x)) = W(FW(x)) = FOV(x)) 


since the values of f belong to T. Hence T and T’ verify the same formulas 
by Lemma 10.55. QED 


Given Lemma 10.60 and Theorem 10.52, proving Theorem 10.59, reduces 
to proving the following: 


THEOREM 10.61 Jf the formula ¢ is refuted by some rooted finite dynamic 
Kripke model, then path(T) E ¢ for some finitely branching tree T. 


Proof So suppose that the formula ¢ is refuted by some finite dynamic Kripke 
model M’ = (W’, R’, f’, VV’). Consider finite dynamic Kripke model 


M := M' = (W,R, f, V) 


constructed from M” as in the proof of Theorem 10.58. The limits lim, are 
chosen so that f commutes with limits. Let 


W = {0,1,... m- 1}, m> 2. 
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Consider the tree of all R-monotone sequences of worlds w € W (that is of 
numbers < m). In other words, one-element sequences in T are 


(0), (1),.-., (m — 1) 


and sons (immediate successors) of an x = (£0,... £n) € T are ew, ..., 
x~w);, where w1,..., w, are all R-successors of x, in W. This shows that for 
every x € path(T) and all n, l > 0 


(10.18) Rn ini 
Define for x € path(T) 
g(æ) := An. f(#n), that is (g(æ))n = f (£n) 
and similarly for a finite sequence x € T 
g(x) = An. f (Xn). 


g(a), g(x) are R-monotone sequences by (10.18) and R-monotonicity of f, 
that is g(a) € path(T) and g(x) € T. Moreover, g is continuous on path(T’) 
since g(a) {n is determined by x := an, so that for y := g(x) we have for 
the corresponding basic open set By: 


g (By) DBE. 
Define for x € path(T): 
W(x) = lim{ £n}. 


To finish, we prove that W : path(T) — W is commuting (Definition 10.53). 
W is continuous. Indeed, W(x) = w implies Rwa, Raw for all n > no. 
Hence for every y with Yno = £no and n > no we have Rwy,,. This implies 
Rw limn {yn} that is RwW(y). This proves W(y) € Ow, that is W~1(O~) 2 
A [no* 
W is open, since w € W(BY) implies Ow C W(BZ). Indeed, we have 
W(x) = w for some a extending y, implying Rynw for n = length(y) — 1 
(the last component of y). For an arbitrary w € Oy we have Rynw’. Define 


yi ifi<n 
Zi := ge RS 
w ifir>n. 


Then z € BY and W(z) = lim;{z;} = w’ as required. Finally W commutes 
with f,g, that is jW = fW: 


W(g(a)) = lim{g(@)n} = lim{ f(@n)} = f(lim{an}) = f(@). 
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QED 


REMARK 10.62 We should note that, by an argument similar to the arguments 
in Sec. 6, we can show that 


CHK AiffA€ S4O, 


where C is Cantor Space and H is the class of homeomorphisms. Thus S4 
is the CU logic of homeomorphisms on C just as S4C is the OO logic of 
continuous functions of C. 


























8. Conclusion 


Let’s outline the general picture and possible direction for future work. There 
are complete axiomatizations of general topology (S4) and of temporal logic 
(LTL). For the logic of dynamic systems, the “one-step” case in the language 
L©® is also axiomatizable, and even decidable, both when the action © is 
only continuous and when it is a homeomorphism. Adding the trajectory con- 
nective xp corresponding to &,, O” p results in an undecidable logic (Konev 
et al., 2006b). However significant fragments are (claimed to be) axiomatizable 
(Konev et al., 2006b) for continuous actions. On the other hand, the general 
case of the measure-preserving action-the case that provided the initial impetus 
for our investigation—turns out to be non-axiomatizable (Konev et al., 2006a). 
This does not prevent the possibility of a fruitful investigation by means of 
logic, similar to investigations in first order arithmetic or second order logic. 

At this moment the most urgent task seems to be a propositional axiomatiza- 
tion allowing sufficiently many of the “routine” derivations in general topolog- 
ical dynamics, for example from (Aiken, 1993). Since the latter often involve 
a treatment of action as a relation, not just a function, a use of branching time 
temporal logic might be needed. Some of the problems left open by the current 
investigation may be of interest too, although it is difficult to predict which of 
these may be useful for mainstream mathematics. Let us mention the complete 
axiomatization of all continuous functions in the full trimodal language L (i.e. 
with modalities O, ©, and *) and an axiomatization in the language LOU of 
continuous functions on the real segment, [0, 1]. 
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1. Introduction 


Our goal is to make relativity theory accessible and transparent for any reader 
with logical background. The reader does not have to “believe” anything. The 
emphasis is on the logic-based approach to relativity theory. The purpose is 
giving insights as opposed to mere recipes for calculations. Therefore proofs 
will be visual geometric ones, efforts will be made to replace computational 
proofs with suggestive drawings. 

Relativity theory comes in (at least) two versions, special relativity (Einstein 
1905) and general relativity (Einstein, Hilbert 1915). They differ in scope, the 
scope of general relativity is broader. Special relativity is a theory of motion and 
light propagation in vacuum far away from any gravitational object. I.e. special 
relativity does not deal with gravity. Also, special relativity is a “prelude” 
for general relativity, it provides a foundation or starting point for the general 
theory. General relativity unifies special relativity and the theory of gravitation. 
In some sense, general relativity is an “extension” of special relativity also 
putting gravity into the picture. General relativity can be used as a foundation 
for cosmology, e.g. it is a suitable framework for discussing the (evolution, 
properties of the) whole universe (expanding or otherwise). Special relativity, 
on the other hand, is not rich enough for this purpose. General relativity also 
provides the theory of black holes, wormholes, timewarps etc. Special relativity 
shows us that there is no such thing as space in itself, instead, a unified space- 
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time exists. This inseparability of space and time becomes more dramatic in 
general relativity. Namely, general relativity shows us that gravity is nothing 
but the curvature of space-time. It is extremely difficult, if not impossible, to 
explain gravity without invoking the curvature (i.e. geometry) of space-time. 
The crucial point is that curvature of space is not enough (by far), it is space-time 
whose curvature explains gravity.! From a different angle: general relativity 
is a “geometrization” of much of what we know about the world surrounding 
us. E.g. it provides a full geometrization of our understanding of gravity and 
related phenomena like motion and light signals. 

In Sec. 2 we study special relativity, in Sec. 3 we do the same for general 
relativity, in Sec. 4 we apply the so obtained tools to black holes, wormholes, 
timewarps. The emphasis is on the space-time aspects. Sec. 5 is about connec- 
tions with the literature. 


2. Special relativity 


In this section, among others, we give a first-order logic (FOL) axiom sys- 
tem for special relativity such that we use only a handful of simple, streamlined 
axioms. In our approach, axiomatization is not the end of the story, but rather 
the beginning. Namely: axiomatizations of relativity are not ends in them- 
selves (goals), instead, they are only tools. Our goals are to obtain simple, 
transparent, easy-to-communicate insights into the nature of relativity, to get a 
deeper understanding of relativity, to simplify it, to provide a foundation for it. 
Another aim is to make relativity theory accessible for many people (as fully 
as possible). Further, we intend to analyze the logical structure of the theory: 
which assumptions are responsible for which predictions; what happens if we 
weaken/fine-tune the assumptions, what we could have done differently. We 
seek insights, a deeper understanding. We could call this approach “reverse 
relativity” in analogy with “reverse mathematics”. 


2.1 Motivation for special relativistic kinematics in 
place of Newtonian kinematics 


Why should we replace Newtonian Kinematics with such an exotic or coun- 
ter-common-sense theory as special relativity? The Newtonian theory proved 
very successful for 200 years. By now, the Newtonian picture of motion has 
become the same as the current common-sense picture of motion. Hence the 





'If we took into account the curvature of space only, then apples would no more fall down from trees. 
Gravitational attraction as such would disappear. On the other hand, if we keep the temporal aspects of 
curvature but ignore curvature of pure space, then gravity would not disappear, instead, this would cause 
only minor discrepancies in predicting trajectories of very fast moving bodies (relative to the source of 
gravity, e.g. the Earth or a black hole). 
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question is why we have to throw away our common-sense understanding of 
motion.” 

The answer is that there are several independently good reasons for replac- 
ing the Newtonian worldview with relativity. These reasons are really good 
and decisive ones. They are so compelling, that any one of them would be 
sufficient for justifying and motivating our replacing the Newtonian worldview 
with relativity. We will mention a few of these reasons, but for simplicity of 
presentation, we will base this work on a fixed one of these reasons, namely on 
the outcome of the Michelson-Morley experiment. We will call this outcome 
of the Michelson-Morley experiment the Light Axiom. There are deeper, more 
philosophical reasons for replacing the Newtonian worldview with relativity 
theory, which might convince readers who are not experimentally minded, i.e. 
who are not easily convinced by mere facts about how results of certain ex- 
periments turned out. These philosophical reasons (under the name “principles 
of relativity’) are intimately intertwined with issues which were significantly 
present through the last 2500 years of the history of our culture; see p. 663 
and Barbour, 1989. 

We now turn to the Light Axiom which will play a central role in this chapter. 
The first test of the Light Axiom was the Michelson-Morley experiment in 1887 
and it has been tested extremely many times and in many radically different ways 
ever since. As a consequence, the Light Axiom has been generally accepted. 
An informal, intuitive formulation of the axiom is the following. (Later we 
will present this axiom in more formal, more precise terms, too, see AXPh in 
Sec. 2.3.) 

Light Axiom: The speed of light is finite and direction independent, in the 
worldview of any inertial observer. 

In other words, the Light Axiom means the following. Imagine a (huge) space- 
ship drifting through outer space in inertial motion. (Inertial here means that 
the rockets of the spaceship are switched off, and that the spaceship is not spin- 
ning.) Assume a scientist in this inertial spaceship is making experiments. The 
claim is that if the scientist measures the speed of light, he will find that this 
speed is the same in all directions and that it is finite. It is essential here that 
this is claimed to hold for all possible inertial spaceships irrespective of their 
velocities relative to the Earth or the Sun or the center of our galaxy or whatever 
reference system would be chosen. The point is that no matter which inertial 
spaceship we choose, the speed of light in that spaceship is independent of the 
direction in which it was measured, i.e. it is “isotropic”. 

In the technical language what we called “inertial spaceship” above is called 
an inertial reference frame, and the scientist in the spaceship making the exper- 





2A second, equally justified question would ask why exactly those postulates/axioms are assumed in relativity 
which we will assume. We will deal with both questions. 
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iments is called an “observer”. Later “observer” and reference frame tend to 
be identified.* 

Let us notice that the Light Axiom is surprising, it is in sharp contrast with 
common-sense. Namely, common-sense says that if we send out a light signal 
from Earth, and a spaceship is racing with this light signal moving with almost 
the speed of the signal in the same direction as the signal does, then the velocity 
of the signal relative to the spaceship should be very small. Hence, one would 
think that the astronaut in the spaceship will observe the motion of the light 
signal as very slow. With the same kind of reasoning, the astronaut should 
observe light signals moving in the opposite direction very fast. But the Light 
Axiom states that light moves with the same speed in all directions for the 
astronaut in the spaceship, too. Hence, the Light Axiom flies in the face of 
common-sense. This gives us a hint/promise that very interesting, surprising 
things might be in the making. See also Fig. 11.18 on p. 642. 

In fact, if we add the Light Axiom to Newtonian Kinematics, then we obtain 
a logical contradiction. I.e. (Newtonian Kinematics + Light Axiom) is an 
inconsistent theory in the usual sense of logic as we will outline soon (cf. 
Proposition 11.1). Seeing this contradiction, Einstein did the natural thing. He 
weakened Newtonian Kinematics (NK for short) to a weaker theory NK™ such 
that NK~ became consistent with the Light Axiom. Then the theory (NK~ 
+ Light Axiom) became known as Special Relativistic Kinematics (SRK for 
short). We will study this theory under the name Specrel, to be introduced 
in a logical language soon. We represent the above outlined process by the 
following diagram: 


(NK + Light Axiom) leads to Contradiction (!) 


4 
NK gets replaced with the weaker NK- 


4 
(NK~ + Light Axiom) receives the name Special Relativity (SRK). 


SRK is consistent (this will be proved in Corollary 11.12, p. 644). 


To see the above process more clearly, let us invoke a possible axiomatization 
of NK, still on the intuitive level. 

Preparation for NK: If we want to represent motion of “particles” or “bod- 
ies” or “mass-points”, it is natural to use a 4-dimensional Cartesian coordinate 
system R x R x R x R (where R is the set of real numbers), with one time 
dimension t and three space dimensions X, y, Z. A three-dimensional part of 





3However, it is good to keep in mind that some thought-experiments are carried out by a team of observers 
(and if the members of this team do not move relative to each other then they are called, for simplicity, a 
single observer). 
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this is depicted in Fig. 11.1. The time-axis t is drawn vertically. Representing 
t 

bs b2 

bı 


ie 


point p 








Figure 11.1. A space-time diagram. Wordlines of bodies b1, b2, b3 represent motion. (Coordi- 
nate z is not indicated in the figure.) b3 is motionless and bı moves faster than b2. 


the motion of a body, say b, in a 4-coordinate system can be done by specifying 
a function f which to each time instance t € R tells us the space coordinates 
x,y, z where the body bis found at time t. Hence a function f : Time — Space 
specifies motion of a particle in this sense. The function f representing motion 
of bis called the worldline or lifeline of b. Fig. 11.1 represents motion of bodies, 
in this spirit. Besides the coordinate axes, we have represented the worldlines 
of inertial bodies b1, b and b3 in Fig. 11.1. The straight line labeled by b; is 
the worldline of bı. The slope of the worldline of bı is greater than that of bo 
which means that bı moves faster than bə does. The worldline of the third body 
b3 is parallel with the time-axis, this means that b3 is motionless. Bodies bı and 
bə meet at space-time point p = (t, x,y,z}. Such a meeting (of two or more 
bodies) is called an event. We will extensively refer to such 4-dimensional 
coordinate systems and such worldlines of bodies and events. 


The axioms of NK are summarized as (i)-(V) below. 


(i) Each observer “lives” in a 4-coordinate system as described above. The 
observer in his own coordinate system is motionless in the origin, i.e. his world- 
line is the time-axis.“ 

(ii) Inertial motion is straight: Let o be an arbitrary inertial observer and let 
b be an inertial body. Then in o’s 4-coordinate system the worldline of b is a 





4Tt is sufficient to assume that his worldline is parallel with the time-axis. 
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straight line. I.e. in an inertial observer’s worldview or 4-coordinate system all 
worldlines of inertial bodies appear as straight lines. 

As we said, an observer in his metaphorical “spaceship” is inertial if his 
rockets are turned off and the spaceship is not spinning. In special relativity, we 
discuss only inertial motion, hence in our axiomatization the adjective “inertial” 
could be omitted. (Of course, then we need a general claim that only inertial 
things/objects will be studied.) 

(iii) Motion is permitted: In the worldview or 4-coordinate system of any 
inertial observer it is possible to move through any point p in any direction with 
any finite speed. 

(iv) Any two observers “observe” the same events. I.e. if according to 01 
bodies bı and bz have met, then the same is true in the 4-coordinate system of 
any 02. We postulate the same for triple meetings e.g. of b1, ba, b3. 

(v) Absolute time: Any two observers agree about the amount of time elapsed 
between two events. (Hence temporal relationships are absolute.) 

So, now, NK is defined as the theory axiomatized by (i)-(v) above. 

It is easy to see that (NK + Light Axiom) is inconsistent. Einstein’s idea 
was to check which ones of (i)—(V) are responsible for contradicting the Light 
Axiom and to throw away or weaken the “guilty” axioms of NK. We will see 
that (v) is guilty and that part of (iii) is suspicious. Hence we throw away (v) 
and weaken (iii) to a safer form (iii) where (iii~) is the following. 

(iii) Slower-than-light motion is possible: in the worldview of any inertial 
observer, through any point in any direction it is possible to move with any 
speed slower than that of light (here, light-speed is understood as measured at 
that place and in that direction where we want to move). 

In the formal part we will carefully study whether all of these modifications 
are really needed and to what extent (cf. Theorems 11.4, 11.7). We define NK~ 
as the remaining theory: 


NK- := {(i), (ii), (ii), (iv)} 
and Special Relativistic Kinematics is defined as 
SRK := (NK~ + Light Axiom). 


The formalized version of this SRK will appear later as the theory Specrelp. 
We will prove that Specrel, is consistent (i.e. contradiction-free) and will study 
its properties. Therefore SRK is also consistent, since, as we said, Specrel, 
is a formalized version of SRK. Actually, the whole process of arriving from 
NK and the Light Axiom (or some alternative for the latter) to SRK will be 
subjected to logic-based conceptual analysis in Sec. 2.5. 

Before turning to formalizing (and studying) Special Relativity SRK in logic, 
let us prove (informally only) on the present level of precision why absolute 
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time (i.e. axiom (V)) is excluded by the Light Axiom, or more precisely, it is 
excluded if we want to keep a fragment of our intuitive picture of the world, 
i.e. if we want to keep (i), (ii), (iv) of NK. We will prove: 


(NK~ + Light Axiom) | Negation of (v), 


where we use turnstile “ } ” as the symbol of logical provability or derivability. 
I.e. AF B means that from statement A one can prove, rigorously, statement B. 

Actually, we will prove something stronger and stranger from the Light 
Axiom (and a fragment of NK~). We will prove that the time elapsed between 
two events may be different for different observers even in the special case when 
this elapsed time is zero for one of the observers. I.e. the very question whether 
two events happened at the same time or not will depend on the observer: two 
events A and B may happen at the same time for me, while event A happened 
much later than event B for the Martian in his spaceship. We will refer to 
this phenomenon by saying that “simultaneity is not absolute’. Moreover, we 
will see later (Corollary 11.5) that the temporal order of some events may be 
switched: event A may precede event B for me, while for the Martian in his 
spaceship, event B precedes event A. 

We say that events e and e’ are simultaneous for observer O if in O’s coor- 
dinate system the two events e, e” happen at the same time. 


PROPOSITION 11.1 (SIMULTANEITY IS NOT ABSOLUTE) Assume SRK. 
Moving clocks get out of synchronism, i.e.: Assume that a spaceship S is in 
uniform motion relative to another one, say E, and assume that two events e, e' 
happen simultaneously at the rear and at the nose of the spaceship S according 
to the spaceship S. Then e and e' take place at different times in E’s coordinate 
system. 

Le., the time elapsed between e and e' is zero as “seen” from the spaceship 
S, but the time elapsed between e and e' is nonzero as “seen” from E. See 
Fig. 11.3. 


Intuitive proof Assume that we are in spaceship E, and let us call E “Earth”. 
Assume that spaceship S—let us call it “Spaceship’”—moves away from us in a 
uniform motion with, say, 0.9 light-speed. The captain of Spaceship positions 
his brothers called Rear, Middle, and Nose at the rear, middle and nose of the 
spaceship, respectively, and asks Rear and Nose to switch on their flashlights 
towards Middle exactly at the same time. Then the light signals (photons>) Ph1 
and Ph2 from the two flashlights arrive to Middle at the same time, because 





5We use the word “photon” as a synonym for light signal. It tacitly refers to the corpuscular conception of 
light. In this work we do not need the quantum-mechanical definition of photons. (That will be needed only 
in the final, as yet nonexistent, generalization of general relativity called quantum gravity.) 
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Middle is exactly in the middle of the spaceship, and because the speed of Ph1 
sent by Rear is the same as the speed of Ph2 sent by Nose (by the Light Axiom). 
See Fig. 11.2. 











> ce = X 
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Phi Ph2 
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space 


Figure 11.2. Seen from Spaceship, the two light-signals (i.e. photons) Phl and Ph2 are sent 
out at the same time, and meet in the middle. This is indicated by the clocks at the rear and at 
the nose of the spaceship. Notice that time in this figure is running upwards! Le., this figure is 
similar to drawings in cartoons in that a sequence of scenes is represented in it. However, here 
the temporal order of the scenes is switched: the scene at the bottom took place earliest. The 
reason for this convention is our seeking compatibility with the usual space-time diagrams like 
Fig. 11.1. 


How do we see all this from the Earth? We see that Rear and Nose send light 
signals (or photons) Ph1 and Ph2 towards Middle, and we also see that Ph1, Ph2 
arrive to Middle at the same time (because this is a 3-meeting of bodies/entities 
and axiom (iv) in NK~ ). (Spaceship’s hull is missing, we can imagine it having 
only a grid of metal rods for keeping it together or something to this effect.) 
However, by the Light Axiom, the speeds of Ph1 and Ph2 are the same for us 
on the Earth, too. Since Spaceship moves away from us (with 0.9 light-speed), 
we see Ph1 crawl very slowly along the hull of Spaceship because the ship is 
“running away” from us (and from Ph1, too). On the other hand, the other 
photon, Ph2, flashes along the hull of the spaceship towards us with enormous 
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relative speed (relative to the hull of the spaceship). Because of this difference 
of their speeds relative to Spaceship, according to Earth, Ph1 and Ph2 either 
meet close to the rear of the spaceship, or if they meet in the middle, then Nose 
had to switch on his flashlight much later than Rear did. See Fig. 11.3. 















































Figure 11.3. Seen from the Earth, the photon Ph2 had to be sent out later in order that it arrive 
in the middle at the same time as Ph1 does. But seen from Spaceship, they were sent out at the 
same time. Hence the clocks at the nose and the rear are out of synchronism, as seen from the 
Earth. 


So far we proved that at least one of two things cannot be absolute. These are 
(a) being in the middle of the spaceship, and (b) simultaneity. Here, (a) means 
that Spaceship observes Middle in the middle of the ship, while Earth observes 
that Middle is not in the middle of the ship; and (b) means that emissions of 
photons Ph1 and Ph2 are simultaneous for Spaceship but not for Earth. 

The first possibility is that Middle stands closer to the rear of Spaceship as 
seen from the Earth, i.e. that he is not in the middle of the ship according to Earth 
observers, while he is in the middle according to the ship observers. Here is a 
thought-experiment which shows that this is not possible. Let us ask the captain 
to give mirrors to Rear and Nose, and order Middle to send photons Ph3, Ph4 
at the same time to these two mirrors. Since Middle is exactly in the middle of 
the ship, the bounced-back photons arrive to him at the same time, by the Light 
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Axiom. By (iv) in NK~, we on the Earth also see that the two photons Ph3, Ph4 
meet again at Middle after bouncing back, so they traveled their round-trips 
in the same amount of time. We will show that, as seen from the Earth, the 
time needed for the round-trip is proportional to the covered distance: if, say, 
Nose is twice as far from Middle as Rear is, then the time needed for Ph4 for 
the round-trip Middle-Nose-Middle is twice as much as the time needed for 
Ph3 for the round-trip Middle-Rear-Middle, even in a fast-moving spaceship. 
From the Earth we see that the round-trip took the same time for Ph3 and for 
Ph4, therefore we have to infer that Middle is really in the middle of the ship. 
See Fig. 11.4. 

We now prove that the time needed for the round-trip is proportional to the 
covered distance. Indeed, assume that the distance Middle-Nose is twice as 
much as the distance Middle-Rear. We will show that the round-trip Middle- 
Nose-Middle takes twice as much time for a photon Ph4 as the round-trip 
Middle-Rear-Middle for a photon Ph3. Let us watch from the Earth how the 
two photons Ph3 and Ph4 move relative to the spaceship (as in Fig. 11.4, but 
now Middle standing closer to Rear). We will see that Ph3 covers the seg- 
ment Middle-Rear fast, traveling towards us, and then covers the segment 
Rear-Middle slowly, moving away from us. The same way, Ph4 covers the 
segment Middle-Nose slowly, moving away from us, while Ph4 covers the seg- 
ment Nose-Middle fast, moving towards us. The relative speed of Ph4 in the 
“towards-us” segment Nose-Middle is the same as the relative speed of Ph3 in 
the “towards-us” segment Middle-Rear; hence this part of the trip takes twice as 
much time for Ph4 as for Ph3 because we assumed that the distance Nose-Middle 
is twice as much as the distance Middle-Rear. The situation is completely anal- 
ogous for the “away-from-us” segments, so the trip Middle-Nose takes twice as 
much time for Ph4 as the trip Rear-Middle for Ph3. Summarizing the segments, 
the round-trip takes twice as much time for Ph4 as for Ph3. 

As we said earlier, we observe from the Earth that Ph3, Ph4 and Middle 
meet in a single event. Therefore, since we observe that Ph3 arrives to Middle 
exactly when Ph4 arrives to Middle after their round-trips, we have to infer, on 
the Earth, that Middle really stands exactly in the middle of Spaceship. There 
remains only the possibility that Nose sent out his photon Ph2, which we see 
as fast-moving along the hull of the space ship, much later than Rear sent Ph1 
which we see as slowly-moving along the hull of the spaceship. Thus, as seen 
from the Earth, the clocks at the nose and at the rear of the spaceship show 
different times (at the same Earth-moment). This is what we mean when we 
say that clocks of the spaceship get out of synchronism. 

Summing up: Let e and e’ be the events when Rear sends his photon Ph1 
towards Middle, and when Nose sends his photon Ph2 towards Middle, re- 
spectively. Then these two events took place at the same time as seen from 
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Figure 11.4. The round-trip for Ph3 takes the same time as for Ph4, seen both from Spaceship 
and from the Earth. Hence Earth infers that Middle is indeed in the middle of the ship. 


Spaceship, while as seen from the Earth, e’ took place later than e did. This 
finishes the proof of Proposition 11.1. QED 


Let us notice that Proposition 11.1 above is a far reaching claim. It implies 
that one of the most basic words of natural language refers to an illusion only 
and carries no real meaning. The word in question is the word “now”. 

In order to be able to carry out the proof of Proposition 11.1 and other 
similar chains of thought in the “safe”, precise setting of mathematical logic, in 
the next subsection we introduce a first-order language in which we formalize 
our axioms, statements, and proofs. 


2.2 Language 


Motivation for language. We want to talk about space-time as relativity the- 
ory conceives it. We will talk about space-time as experienced through motion. 
Though we discuss here kinematics (theory of motion) only, one can derive (log- 
ically) dynamic predictions of relativity, too (i.e. phenomena involving forces, 
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energy) using the same approach/axioms.° We will use first-order logic, and 
the most important decision is to choose the language (vocabulary), i.e. what 
objects and what relations between them will belong to the language. We will 
specify our first-order language by specifying its models. 

We want to axiomatize motion. What moves? Bodies. Hence our model 
has a universe B for bodies. What does it mean to move? To move means 
changing location intime. We will have coordinate systems, or reference frames 
in other words, for marking locations and time, and we will use quantities in 
setting up these coordinate systems. So our model has another universe Q for 
quantities. We will think of quantities as real numbers, so Q together with the 
operations +, *, < will form a linearly ordered field. We will think of coordinate 
systems as belonging to special bodies called observers. Hence Ob is a one- 
place relation on B (picking out a subset of B). We will have another kind 
of special bodies, photons, too. Hence Ph is another one-place relation on B. 
The heart of our model is the so-called worldview-relation W. This is a 6-place 
relation connecting bodies and quantities. We think of W (o, b, t, x, y, z) as the 
statement that the body b is in location xyz at time t in observer o’s coordinate 
system. We will simply pronounce this as 


o sees the body b at tayz 


though this has no connection with optical seeing, instead, it is an act of co- 
ordinatizing only. With this intuition in mind we now fix the language of our 
theories of special relativity. 


The language. We fix a natural number n > 1, it will be the number of space- 
time dimensions. In most works n = 4, i.e. one has 3 space-dimensions and one 
time-dimension. Recent generalizations of general relativity in the literature 
indicate that it might be useful to leave n as a variable (e.g. string-theory uses 
11 dimensions). 

We declare two sorts of objects. One sort is for “quantities”, it will be 
denoted by Q. (This is the same as “real numbers” in other treatments.) We 
have two-place (i.e. binary) operation symbols +,* and a 2-place predicate 
symbol < of sort “quantities”. To avoid misunderstandings, we emphasize that, 
in this chapter, Q is not the set of rational numbers. (The letter Q abbreviates 
“quantities”. It is a coincidence that the same letter is used in the literature to 
denote the rationals. We do not follow that convention.) 

The other sort, B, is for entities which do the “moving”. We will call these 
“bodies”. (We call the moving entities “bodies” whatever they may be, in 
reality they can be e.g. coordinate systems or electromagnetic waves, or centers 





®For the spirit of this we refer to the relativity textbook Rindler, 2001, Sec. 6 “Relativistic particle mechanics”. 
Rindler, 2001, Sec. 6.2 is particularly relevant here. 
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of mass.) We have two kinds of special bodies, observers and photons. Thus 
Ob and Ph are one-place predicate symbols of sort B. 

We have a relation which connects these two sorts, the (n + 2)-place relation 
W which is of sort B x B x Q x --- x Q. The sentence “observer o observes 
body b at space-time location p),..., Pn” is denoted as W (o, b, pi,..-, Pn), OF 
as W(o, b, p) in short. 


Summing up, a model 9N of our language is of form 
(Q,+,%,<; B,Ob,Ph; W) 


where (Q, +, *, <) is a structure similar to ordered fields, Ob, Ph are subsets 
of BaandWCBxBxQx:---xQ. 


2.3 Axiomatization Specrel of special relativity in 
first-order logic 


In this subsection we formalize the axioms we talked about on an intuitive 
level in Sec. 2.1, by using the first-order language introduced in the previous 
subsection. The formalized version of (NK~ + Light Axiom) will be called 
Specrel,. 


Axiom 1 (AxField). The quantities behave like real numbers do in the sense 
that (Q,+,*, <) is a linearly ordered field in which every positive member 
has a square root. Such fields are called “quadratic”. 


For an axiom system for linearly ordered fields we refer to e.g. Chang and 
Keisler, 1973, p. 41, below item 18. We will often simply say “field” or 
“ofield” or “ordered field” instead of “linearly ordered field”. We recall that 
if (Q,+,*, <) is a field, then 0 and 1 denote the neutral elements of + and x 
respectively, and an element x € Q is called positive iff x > 0. Further, y is 
called a square root of x iff y x y = x and y > 0, we denote this by writing 
y = v/T. With this notation, the absolute value |y| of yis |y| := \/y? (as usual, 
y? denotes y * y). The inverses of the operations + and * will be denoted by - 
and /, respectively. Thus a linearly ordered field is quadratic (or Euclidean) iff 
(Va > 0)(Sy)x = y x y is true in it. According to the usual practice, we will 
often omit « from an expression, e.g. we write td in place of t x d. 


On AxField: In most physics books, the set of quantities is taken to be the 
set of real numbers (with +, * as addition, multiplication of the real numbers). 
Some, fancy, books use also imaginary numbers for quantities. We know from 
mathematics that much complexity is tied to the real numbers. Hence in our 
axiomatic approach, we single out those properties of the quantities that we 
rely on in the investigation in question. In special relativity only the quadratic 
ordered field-structure of the quantities is presupposed, but we could do much 
even with assuming only the ring-structure. In particular, the ordering and 
the existence of square roots are used mostly in order to be able to formulate 
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results in a simpler way. (E.g. we use square roots in expressing the distance 
between two coordinate points. Without the use of square roots we always 
would have to talk about the square of distance. This would cause only an 
inconvenience but not an impossibility.) As a pay-off of this explicit way of 
handling the quantities, we can build models of special relativity with a finite 
field as structure of quantities, or we can use fields with infinitesimally small 
numbers. In general relativity, in addition to AxField it will suffice to use an 
axiom-schema called CONT, see Sec. 3.6. 


By a coordinate point, or space-time location, we understand an n-tuple (i.e. 
a sequence of length n) p = (pi,..., Dn) of elements of Q, the set of all these 
n-tuples is denoted by Q”. If p € Q”, then p,,..., Pn are its components, i.e. 
p = (p1, ..-, Pn). We call 0 := (0,...,0) € Q” the (n-dimensional) origin, 
and we call t := {(,0,...,0) € Q” : x € Q} the (n-dimensional) time-axis. 
By the worldline (or lifeline, or history) of a body b as observed by the observer 
m we mean the set of space-time locations where m observes b to be present, 


wline,,,(b) := {p € Q” : W(m, b, p)}. 


Axiom 2 (AxSelf) An observer m in his own coordinate system is motionless 
in the origin (of space), i.e. his worldline is the time-axis: wlinem(m) = t. 
As a formula of the FOL language this axiom is 


(Ym € Ob) (Vp € Q”)[W(m, m, p) > p2 = +++ = pn = 0]. 


Having a field in our language makes it possible to talk about straight lines. 
We recall that the straight line going through p,q € Q”,p Æ q is the set 
{p+a2*(p—q): x € Q}. In the latter formula, +, — and x denote operations 
of Q” as a vector-space. We will often say just “line” for “straight line”. 





Axiom 3 (AxLine) The motion of an observer as observed by any observer is 
uniform, i.e. such that both the “spatial direction” and the “pace” of the motion 
are constant (and “longest possible” with this property). In geometrical terms 
this means that in each observer’s coordinate system, the worldline of an 
observer is a straight line, i.e. WliNem (k) is a straight line for all m, k € Ob. 
Formally, 


(Vm, k € Ob) (Ap, ¢ E€ Q”)(W(m, k, p) \AW(m,k, gd) Ap#aqA 
(Vr € Q")[W(m,k,r) > (ar €Q)r=pt+2x«(p—q))). 








On AxLine: AxLine is a formalized version of postulate (ii) in Sec. 2.1. Later 
we will consider non-uniform motions, too. We will call those motions “accel- 
erated’ ones. Newton’s First Law of Motion states that “an object moves with 
constant, uniform motion until acted on by a force”. A body is called “inertial” 
if no force acts on it. Hence AxLine indicates that Ob denotes the set of inertial 
observers when using AxLine. 
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We introduce the speed of a uniform motion. In geometric terms, this is the 
“slant” or “slope” of the straight line (representing the motion). For p,q € Q”, 
let space (p, q) and time(p, q) denote the spatial distance and the time-distance 
between p and q, respectively: 





space(p, q) := (p2 — q2)? +--+ (Pn — qn)?, and 


time(p, q) := |pı — q| = y (pı — q). 


Now speed(p, q) denotes the speed necessary to reach q from p (or p from q): 


speed(p, q) := space(p, q)/time(p,q) when time(p, q) # 0. 


Axiom 4 (AxPh) For every observer, the speed of light is 1, and moreover, 
photons move uniformly along straight lines and in each location in each 
direction it is possible to send out a photon. In geometrical terms this means 
that the worldlines of photons are exactly the straight lines of slope 1. Formally 
this is: 


(Ym € Ob)(Vph € Ph)| (wline,,,(ph) is a straight line) A (Yp, q € Q”) 
p # q = (speed(p, q)=1 iff (3ph € Ph)[W(m, ph, p) A W(m, ph, q)])]. 


On AxPh: This is the formal version of the Light Axiom used in Sec. 2.1. It 
expresses that the speed of light is finite (nonzero) and isotropic, i.e. direction- 
independent. We formulated the Light Axiom in a seemingly stronger form, 
namely such that we require the speed of light to be 1. This way we are freed 
from having to deal with always adjusting everything to the actual speed of light. 
Instead, we adjust the units of measurement to the speed of light: we measure 
distances with “light-years” if we measure time in “years”. We emphasize that 
assuming that the speed of light is 1 instead of some finite direction independent 
number (which might depend on the observer) is not a “physical” assumption 
but instead a merely “linguistic” one. It would be sufficient (for our results) to 
use a more literal formalization of the Light Axiom in Sec. 2.1. That such a 
weaker version of AXPh is sufficient for our results in shown in Andréka et al., 
2002 and Madarász, 2002, p. 121. 

In Sec. 2.1 we talked about photons bounced back from a mirror. When 
using AxPh, we will simulate this “bouncing back” by treating the out-going 
and the bounced-back photons as two different photons that have met at the 
mirror (see e.g. Fig. 11.8). 





The next axiom states that each observer can make thought-experiments in 
which he assumes the existence of “slowly moving” observers. This is the 
formalized version of postulate (iii~) in Sec. 2.1. 
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Axiom 5 (AxThEx) For each observer m € Ob, in each space-time location, 
in each direction, with any speed smaller than that of the light it is possible to 
“send out” an observer: 


(Ym € Ob)(Vp, q € Q”)[space(p, q) < time(p, q) > 
(ak € Ob)(W(m, k, p) A W(m, k, q))]. 





In geometric terms this means that each line in the coordinate system with slant 
smaller than 1 is the worldline of a (potential) observer, in m’s worldview. 


The next axiom is the formalized version of postulate (iv) in Sec. 2.1. 


Axiom 6 (AxEvent) If an observer observes three bodies at the same space- 
time location, then all other observers observe that these three bodies meet: 


(Vm, k € Ob)(Vb, b’, b” € B)(Vp € Q”)(Ap’ € Q”) 
[W(m, b, p) \W(m, b, p) AW(m, b”, p) > 
(W(k, b, p') AW(k, b', p') A W(k, b”, p'))]. 


On AxEvent: In Sec. 2.1 we talked about “events”. E.g. “Rear sent light signal 
Ph1” was called an event, another event was that “Nose sent light signal Ph2”, 
and a third event was that “Middle, Ph1, and Ph2 meet”. In the axiom AxEvent 
above, we talk about “3-meetings”’. We will reserve the word “event” for the set 
of all bodies present at a space-time location. Let us call AxEvent* the axiom 
we obtain from AxEvent by replacing “3-meetings” with “events” in it, in this 
latter sense. We will see at the end of this subsection (cf. Theorem 11.2) that, 
in our approach, AxEvent is equivalent with this seemingly stronger axiom. 


Specrel, := {AxField, AxSelf, AxLine, AxThEx, AxEvent, AxPh}. 


Specrel, is the formalized version of SRK = NK +Light Axiom introduced 
in Sec. 2.1. Most of the interesting predictions of special relativity can be proved 
(in the rigorous manner of first-order logic) from Specrelg. However, some 
of the predictions have a little bit more complicated forms because different 
observers may use different “units of measurement”. The last axiom brings the 
units of measurement of two observers to a common “platform”. 

For an observer m and space-time location p € Q”, eV» (p) denotes the “full 
event” happening in m’s coordinate system at p, 


eV,,(p) := {b E B: W(m, b, p)}. 


We call the next axiom the Axiom of Simultaneous Distance. 





Axiom 7(AxSim) Any two observers agree on the spatial distance between 
two events, if these two events are simultaneous for both of them: 

(Vm, k € Ob) (Yp, q, p', q! € Q”)[CVin(p) = eve (p') ACV (gq) = evl) A^ 

time(p, q) = time(p’,q’) =0 —  space(p,q) = space(p’, q’) |. 
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Specrel := Specrel, U {AxSim} 
= {AxField, AxSelf, AxLine, AXThEx, AxEvent, AxPh, AxSim}. 


In Sec. 2.5 we will prove that Specrel is consistent, and hence the weaker 
Specrel, is also consistent. This will show that we have succeeded in eliminat- 
ing the contradiction from (NK+Light Axiom): there is no statement A such 
that from the new theory (NK~+Light Axiom) we can derive both A and its 
negation ~A. In the next subsection we will prove that Specrel, implies (in 
the rigorous manner of first-order logic) the negations of (v) and (iii), i.e. the 
negations of “absolute time” and “all motion is possible”. In the next subsection 
we will also begin to investigate what the world looks like assuming SRK, in 
which ways it is different from our common-sense Newtonian world. Before 
doing this, we show two simple properties of Specrel). 

An important theme will be to establish which things all the observers per- 
ceive (“see”) the same way, and which things they perceive differently. The 
things that they see the same way will be called “absolute”, the things that they 
see differently will be called “relative”. Whence the name “relativity theory”. 
First we show that all observers see the same “events” to occur, and not only 
they see the same 3-meetings to occur. 

Let AxEvent* denote the statement that if an observer observes an event, 
then all other observers observe this event: 


(Ym, k € Ob) (Vp € Q”)(3p' € Q”)(Vb € B)[W(m, b, p) > W(k, b, p”). 





The symbol = denotes the semantic consequence relation of FOL. Before 
discussing the details, we note that in the case of FOL, | coincides with FOL- 
provability |. If WM is a possible model and ọ is a FOL formula, then M = y 
abbreviates the statement “formula ¢ is valid in model St”. For a set Ax of 
formulas, Ax | y means that for every possible model M, if M H Ax, then 
ME y. 





THEOREM 11.2 {AxEvent, AxPh, AxField} = AxEvent*. 


Proof Assume IN = (Q,...,W) = {AxEvent, AxPh, AxField} and let 
m,k € Ob, p € Q”. There are (at least) two distinct lines 41, 2 of slope 1 
going through p, e.g. 4; = {p+ax*(1,—-1,0,...,0): x E€ Q}and l2 = {p+ax 
(1,1,0,...,0) : £ E€ Q} are such. (We used AxField here.) There are photons 
ph,, ph “living on £1, l2” respectively, by AXPh. (Le. 4; = wlinem(ph;) for 
i = 1,2.) Let us consider the worldlines of these photons in k’s worldview. 
By AxPh, these are straight lines of slope 1. We are going to show that they 
intersect in a unique point. 

We have that wlinem (ph) 4 wline,,,(ph.). Let gq € 01, q É L and let £3 be 
the straight line of slope 1 going through q and parallel with 42. (Le. l3 = {q+ 
x * (1,1,0,...,0) : x E Q}.) Let ph} € Ph be such that wline,,,(ph3) = £3. 
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Now, m “sees” 3-meetings of {ph;, pha, pha}, {ph;, ph, pha} but m does 
not see a 3-meeting of {ph,, ph, ph}. By AxEvent, the same must hold for 
k. Thus wline;,(ph,) and wline;(ph,) must meet but must not coincide and 
hence they intersect in a unique point. 

Let p’ be their intersection point, i.e. {p'} = wline;(ph,) N wline;,(ph,). 
Now, it is easy to show by using AxEvent again that ev,,(p) = ev,(p’). 
(Indeed, let b € B be arbitrary. Then m sees a 3-meeting of ph,, phs, b iff 
b € EVm(p), and the same for m, p replaced with k, p’.) QED 


THEOREM 11.3 No observer observes the same event at two different space- 
time locations in models of {AXPh, AxField}. 


Proof Let p,q € Q”, p 4 q. There is a straight line £ of slope 1 through p 
which avoids q (because through each point p there are at least 2 distinct lines of 
slope 1). By AxPh, @ is the worldline of a photon ph € Ph (in m’s worldview). 
Then ph € evm(p) while ph ¢ evm(q), showing that evm (p) A CEVm(q). QED 


Theorems 11.2 and 11.3 imply that in each observer’s worldview, the space- 
time locations and the events observed by any observer are in one-one corre- 
spondence. Thus, in a given observer’s worldview, we can speak of events as if 
they were space-time locations. E.g. we can quantify over events, meaning that 
we have in fact quantified over space-time locations. By the same token, we 
can apply any function defined on space-time locations to events. Specifically, 
let loc,,,(e) denote the location of event e in m’s worldview, then 


loc,,(e)=p iff CVm(p) =e. 


By the time-distance between two events as seen by an observer we will mean the 
time-distance between the space-time locations where the observer sees the two 
events, and similarly for spatial distance. Formally, with p = loc;,(e), p’ = 
loc,,(e’) we have 


timen (e, e’):= time(p, p’), space,,,(e, e’):= space(p, p’), 


time,,, (e):= pı denotes the time where m sees event e happen, and space, (e):= 
(0, p2, . . . , Pn) denotes the space-location where m sees event e happen. 


2.4 Characteristic differences between Newtonian and 
special relativistic kinematics 


The most frequently quoted predictions of special relativity are the following 
three paradigmatic effects. (1) moving clocks slow down, (2) moving meter- 
rods shrink, and (3) moving pairs of clocks get out of synchronism. These three 
effects are easily formulated in the first-order language introduced so far. 
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Figure 11.5. Moving clocks slow down and moving spaceships shrink. 








Figure 11.6. Moving clocks get out of synchronism. 


Let m, k be observers in a model of our language. By the direction of spatial 
separation of two events e, e’ in m’s worldview we mean the natural thing, i.e. 
we mean the straight line connecting the “spatial projections” (0, p2,..., Dn) 
and (0, q2,.--,n) if p and q are the space-time locations m sees e and e’ at, 
respectively (or the point (0, p2,..., Pn) if these two points are the same). The 
spatial direction of motion of abody bin m’s worldview is the direction of spatial 
separation of two distinct events in WliNem (b), whenever the latter is a straight 
line. (In order to deal with the “degenerate” situations in the next theorem, we 
say that a point is both parallel and orthogonal to a line or to another point.) We 
say that e, e’ are simultaneous in m’s worldview iff timem (e, e’) = 0. Let un (k) 
denote the speed of k as seen by m, i.e. Um(k) is the slope of the worldline of 
k in m’s worldview. We note that Specrel, A (Vm, k E€ Ob)um(k) = ug (m) 
while Specrel — (Vm, k € Ob)um(k) = v(m) (see Corollary 11.13). 


Theorem 11.4 below implies that Absolute Time (i.e. (v) of NK) is inconsis- 
tent with SRK (i.e. with NK~ +Light Axiom), hence it was necessary to omit it 
from NK. Theorem 11.4 says that simultaneity of events is not absolute. Actu- 
ally, it implies something more surprising, more exotic: the question of what 
happened earlier and what later is not absolute either (see Corollary 11.5 after the 
theorem). Fig. 11.7 illustrates the statements in Theorem 11.4. Theorem 11.4 
is a more detailed version of Proposition 11.1. 
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Figure 11.7. Illustration for Theorem 11.4. Simultaneity of events is not absolute. Events 
e,e', e” are simultaneous for k, but e, e’ are not simultaneous for m. 


THEOREM 11.4 (SIMULTANEITY OF EVENTS IS NOT ABSOLUTE) Assume 
Specrel, and let m, k be observers. Statements (i) and (ii) below hold. 


(i) Assume that in m’s worldview the spatial separation of events e, e' is parallel 
with the direction of motion of k. Then 


e,e’ are simultaneous in k’s worldview 
iff 
time,,,(e, e’) = Um(k) * space, (e, e’). 


(ii) Assume that e, e" are simultaneous both in k’s worldview and in m’s world- 
view. Then in m’s worldview the spatial separation of e, e" is orthogonal 
to the direction of motion of k. 


Proof The proof of Theorem 11.4 follows the structure and ideas of the intu- 
itive proof of Proposition 11.1. 
Proof of (i). Let e,e’ be simultaneous events in k’s worldview. Let p = 
loc. (e), p’ = loc;,(e’) and let q = (1/2) * (p + p’), their “middle-point”. 
Let £1, l2, l3 be straight lines parallel with the time-axis and going through 
p, p',q respectively and let m1, M2, M3 be observers with 4; = wline;(m,) 
(i = 1, 2,3). See Fig. 11.8. 

Let 6 := |p — q| = y (pı — q1)? +--+ + (Pn — Gn)2. This exists since the 
field (Q, +, *, <) is quadratic by AxField. By using ô now we can construct 
straight lines of slope 1 connecting 44, l2, 43 as follows. Let u = (1,0,...,0) 
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Figure 11.8. Illustration for the proof of Theorem 11.4(i). 


(the “time unit-vector”), pl = q + ô * u, p2 = p+ 28 * u, p3 = p' +26 * u, 
p4 = pl + 26 x u. The straight lines connecting the points pp1, p'p1, p2p4, 
and p3p4 all have slope 1 by construction, hence by AXxPh there are photons 
ph,,...,ph, whose worldlines these are, respectively. Let e; = eV;,(pi) for 
i= 1,...,4. See Fig. 11.8. 

We will think of the pattern constructed so far as representing the two thought- 
experiments in Proposition 11.1. We will think of k, m1, M2, m3 as the Space- 
ship, Rear, Nose, and Middle respectively; e is the event when Rear sent ph, 
towards Middle, e’ is the event when Nose sent ph, towards Middle, and e1 
is the event when these two reached Middle. The upper part of the arrange- 
ment (events e1,...,e4 and photons ph,,...,ph,) represents the experiment 
of Middle by which he tested that he indeed was standing in the middle (the 
two photons phs, ph, sent towards Rear and Nose arrived back, after bouncing 
back at the mirrors, at the same time in event e4). 

Switch now to the worldview of m! See Fig. 11.9. The worldlines of 
mı, M2, M3, Ph}, ..., Phy, respectively, are all straight lines, the last four of 
slope 1, by AxLine, AxPh. The meeting points of these lines are exactly as 
those of the corresponding worldlines in k’s worldview, by AxEvent. The 
worldlines of m1, m2, M3, k are parallel in m’s worldview because they are so 
in k’s worldview. (In more detail, e.g. for m1, Moa: their worldlines do not meet, 
by AxEvent. Their worldlines are not skew, because one can construct photons 
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ph,, ph, with meeting points e, e’, e1, e2, e3 as in Fig. 11.8, and this ensures that 
they are in one plane (i.e. in the plane determined by e, e’, e1).) Now assume that 
the spatial separation of e, e’ is parallel with the spatial direction of movement 
of k, in m’s worldview. This means that there is a plane P containing the whole 
configuration (the worldlines of m1,...,ph4), and it is vertical, i.e. it contains 
a line parallel with the time-axis. The worldlines of ph,, ph, are parallel with 




















Figure 11.9. Illustration for the proof of Theorem 11.4(i). 


those of phs, phy, respectively, in m’s worldview, because they are so in k’s 
worldview (and because they are all in one plane). Thus the distance of events 
e> and e; is the same as the distance of events e4 and eg according to m, i.e. with 


the notation r; = loc,,,(e;) we have |r2 — rı| = |r4 — r3|. Similarly, since the 
lines connecting e4, e1 and e3, e’ are also parallel, we get |r] — r'| = |r3 — ral 
(where r’ = loc,,,(e’)). Thus |r2 — rı| = |r; — r'|. For this reason, the distance 


between e and C in Fig. 11.9 is the same as the distance between C and E, 
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i.e. C is the middle-point of e and E. Thus m also sees that mg is positioned 
exactly in the middle of mı and mg. 

We now recall the intuitive chain of thought in the proof of Proposition 11.1 
taking into account the quantitative aspects, too. We think of m as Earth. Let 
us measure time in “seconds”, and let us assume that the length of the ship as 
Earth sees it is 2d. Now, Earth sees that inside the spaceship the photon ph, 
covered distance d with velocity 1 — v, thus it took t = d/(1 — v) seconds 
for ph, to reach the middle of the ship. Similarly ph, covered distance d 
with velocity 1 + v, so it took t = d/(1 + v) seconds for it to reach the 
middle of the ship. Hence Nose had to send ph, exactly T = t — t = {d/ 
(1 — v)] — [d/(1 + v)] = 2dv/(1 — v?) seconds later in order that they meet 
in the middle. Since the photon was sent T seconds later, the ship covered 
T x v distance during this time, thus the spatial distance of event e (which is 
sending out photon ph,) and event e’ (which is sending out photon ph,) is 
D=2*«d+T*v = ([2d(1 — v?)] — [2dv?])/(1 — v?) = 2d/(1 — v?). 
Hence T = v x D, as was to be shown. This computation can be faithfully 
reconstructed in the settings of the present Theorem 11.4, see Fig. 11.9. 

This proves the “only if” part in (i). The “if” part in (i) can be proved by 
taking an event e” in k’s worldview which is simultaneous with e and which 
takes place at the same place as e’, i.e. space; (e”, e’) = 0; now we can use the 
previously proven part for e, e” and then use time,,,(e”, e’) £ 0. 

The proof of (ii) is similar to that of (i), we include Fig. 11.10 for illustration. 

QED 


Remark. The first-order logic axiomatization of relativity theories we are 
describing here is a very good place for applying Tarski’s first-order logic ax- 
iomatization of Euclidean geometry. We try to illustrate this claim. In the 
proof of Theorem 11.4 we used several geometrical properties of the Euclidean 
geometry G built on the field (Q, +, *) in place of the reals. E.g. we used that 
“for any two distinct points there is a unique (straight) line connecting them”, 
or “through any point there is a unique line ¢’ parallel with 2’, we used the 
notions of planes, being parallel etc. The properties of G we used in the proof 
are easy to check directly by using the axioms of a quadratic ordered field and 
the (analytic) definitions of a straight line etc. There is another way, though. 
Instead of directly checking in the geometry G validity of each statement which 
arises in the proof, we could use the axioms in a first-order logic axiomatization 
of (synthetic) Euclidean geometry and derive everything from those axioms (or 
just rely on the existing theorems and definitions of this area of research). We 
recall that Hilbert, 1977 axiomatized Euclidean geometry over the reals by using 
second-order logic axioms, and Tarski, 1959 gave a first-order logic axiom sys- 
tem for this geometry. This also made possible to replace the field of reals with 
arbitrary fields and investigate what algebraic properties of the field correspond 
to what geometrical properties. This subject—which is highly relevant in the 
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Figure 11.10. Illustration for the proof of Theorem 11.4(ii). If ph,,ph,,m3 meet, then the 
spatial separation s of e, e’ must be orthogonal to the spatial direction d of movement. This 
figure shows how m “sees” the thought-experiment illustrated in Fig. 11.8, but conducted in a 
spatial direction not necessarily parallel with the direction d of motion of k. 


approach presented in this paper—is quite rich, see e.g. Tarski, 1959, Goldblatt, 
1987, Schwabhiuser et al., 1983, Szczerba, 1970, Szmielew, 1974 and Aiello 
and van Benthem, 2002. Ax, 1978, Goldblatt, 1987 and Mundy, 1986 make 
use of Tarski’s axiom system for Euclidean geometry in their axiomatizations 
of Special Relativity Theory. 

Actually, Fig. 11.11 shows that by using the methods of axiomatic Euclidean 
geometry, the proof of Theorem 11.4 could be made simpler and more trans- 
parent. QED 


COROLLARY 11.5 (THE TEMPORAL ORDER OF EVENTS IS NOT 
ABSOLUTE) Assume Specrel,. For all observers m, k not at rest relative to 
each other there are events e, e' such that e happens earlier than e! according 
to m while e happens later than e' according to k. Formally: 


Specrel) = (Vm, k € Ob)[Um(k) #0 > 
(de, e’)[time,,,(e) < time,,(e’) A time; (e) > time; (e’) } ]. 








Proof Assume that vm(k) 4 0. Let e,e’ be distinct events in the life of k 
(ie. k E€ ene’, e Æ e’) and assume time,,(e) < time,,(e’). If time;(e) > 
time;,(e’) then we are done. So assume time;(e) < time;,(e’). Let P be 
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Figure 11.11. Simpler proof for Theorem 11.4(i) by using synthetic geometry: By using the 
parallelograms e2 — e1 — e3 — ea and e1 — e! — e3 — e4 We get that the distance between e2 
and e1 is the same as the distance between e1 and e’. The triangles e — e2 — e1 and e — ed — e 
are congruent since the two photon-lines are orthogonal to each other. Since the slope of the 
light-line e — e1 is 1, the angle between the time-axis ¢ and e — e2 is therefore the same as the 


angle between the “space axis” 7 and e — e’, yielding the desired result. 


the “plane of movement of k”, i.e. let P be a plane parallel to the time-axis 
and which contains wline,,(&). By Theorem 11.4(i), the events on P which 
are simultaneous according to k with e form a straight line @ which is not 
“horizontal”, see Fig. 11.12. Therefore there is an event e” on £ such that 
time,,(e”) > time,,(e’). Now time;,(e”) = time;(e) < time;(e’), and we 
are done. QED 


In the next theorem we formalize the three paradigmatic effects of SRK and 
prove them from Specrel. Figs. 11.13, 11.14 illustrate the statement of Theo- 
rem 11.6 in cartoon and in space-time diagram respectively, while Fig. 11.17 
at the end of the proof summarizes in one picture how two observers “see” 
each other’s coordinate systems. Fig. 11.18 gives a geometric illustration and 
explanation for the three paradigmatic effects. 


THEOREM 11.6 (THE THREE PARADIGMATIC EFFECTS) Assume Specrel 
and n > 3, and let m, k, k' be observers with v := Um(k), and vg(k') = 0. 
Assume that k’s spaceship, the rear and nose of which are marked by ob- 
servers k, k', moves forwards (i.e. Wiin€,,(k), wline,,,(k’) are contained in a 
plane parallel with the time-axis and for some e", e* with k € e”, k! € e& 
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space 





m’s world-view 


Figure 11.12. Illustration for the proof of Corollary 11.5. In m’s worldview, e happens earlier 
than e’ and e” happens later than e’. However, in k’s worldview e is simultaneous with e”. 


and space,,(e”,e*) = 0 we have time,,(e*) < time,,(e”)), time flows 
forwards for k as seen by m (i.e. time;,(ev,(0)) < time,,(ev,(1,))). As- 
sume further that, according to k, the clocks in the ship are synchronized (i.e. 
time;,.(ev;(0)) = 0) and the length of the ship is D (i.e. |space;(e’)| = D), 
see Fig. 11.14). Then (1)-(3) below hold. 


(1) (moving pairs of clocks get out of synchronism) According to m, the clock- 
readings at the nose of k’s spaceship are v x D less than the simultaneous 
readings at the rear of the ship. (The clocks in the nose are late relative 
to those in the rear. See Fig. 11.13.) Formally: 


(Ve,e)[keEe Ak’ ce’ A timen(e,e’) =0- 
timex (e’) = time;(e) — (v x D) J. 


(2) (moving clocks slow down (called “time-dilation” )) Any process that lasts t 
seconds in the ship, lasts for t/v 1 — v? seconds as seen by m. Formally: 


(Ve, e’)[k E€ ene’ —> timem(e, e’) = time; (e, e')/ V1 — v?]. 


(3) (moving ships get shorter (called “length-contraction” )) According to m, 
the length of k’s ship is only D * y1 — v? (and not D as k states). 
Formally: 


(Ve,e)[keeAk € e Atimenle, e) = 0 — 


space,,(e,e’) = D * V1 —v?]. 


Hence, moving spaceships become ‘squat”: They get shorter but their 
width and height do not change by AxSim. So they get distorted. This 
distortion effect remains true in weaker fragments of Specrel, e.g. in 
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Ikm = 1 light-second 


hi 
my spaceship (in this picture) 


is Ikm long 










it’s only 


V1 — v? km long 


now (m) 1 second later (m) 


Figure 11.13. Illustration for Theorem 11.6. According to m, the length of the spaceship is d 
km, it is 1 km wide and tall, and the clocks in the nose show dv/v 1 — v? less time than those 
in the rear. According to k, the length of the ship is D = d/V/1 — v?, it is 1 km wide and 
tall, and the clocks in the nose and the ones in the rear all show the same time. In the picture 
we chose d = v1 — v? and D = 1. Compare this picture with Fig. 11.3. As v increases, the 
spaceship becomes squat: it becomes shorter while its width and height remain the same. Cf. 
also Figs. 11.5, 11.6. 
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k E 





m’s worldview k’s worldview 


Figure 11.14. Illustration for Theorem 11.6, which states that t = T/V 1 — v? and d = D x 
vl- v. 


Specrelg, and for arbitrary n, as shown in Andréka et al., 2002, Sec. 4.8, 
esp. p. 653. 


Proof To prove Theorem 11.6, we will use two new thought-experiments 
analogous to the one in Proposition 11.1. Indication for the proof in geometrical 
flavor is in the caption of Fig. 11.18. Let m, k, k’, v be as in the hypothesis part 
of the theorem. 

The thought-experiment for proving time-dilation uses Einstein’s light-clock, 
cf. Fig. 11.15. This light-clock consists of two mirrors and a photon which 
bounces back and forth between the two mirrors. The two mirrors, Mı and 
Mbp, are positioned at the rear of k’s spaceship so that their spatial separation is 
orthogonal to the movement of the ship (as seen by m) and their spatial distance 
is 1 light-second. Thus for the photon ph from one mirror M; to the other Mə 
lasts for 1 second; one tick of the clock lasts 1 second as seen from the ship 
k. Let e, e’ be the events when ph leaves mirror Mı and reaches mirror Mo, 
respectively. According to m, the spatial distance between e and e’ is not 1 
(as seen from k’s ship) but V1 + x? where z is the distance the second mirror 
Mp2 covers while the photon reaches it. If t is the time elapsed between e and 
e’ as m sees it, then x = t x v, and thus space,,,(e,e’) = V1+ 27, hence 
t = v1 + x? because the speed of ph is 1 in m’s worldview, too. Now from 
t? = 1 + t?v? we get t = 1/\V/1 — v2 (which is greater than 1). We obtained 
the desired rate of time-dilation. 

The thought-experiment for proving length-contraction (Theorem 2.4(3)) 
uses a so-called “two-dimensional light-clock”, this is the following. See 
Fig. 11.16. There are two pairs of mirrors and two photons bouncing between 
them. The first pair of mirrors M, Mə and the photon ph bouncing between 
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Einstein’s light-clock 


Figure 11.15. Illustration for the proof of Theorem 11.6(2) (time-dilation). Einstein’s light- 
clock consists of two mirrors and a photon ph bouncing between them. One tick lasts t = (1/ 
V1 — v?) seconds in m’s worldview, if one tick lasts 1 second in the ship. 


them is as in Einstein’s light-clock. The second pair of mirrors M3, M4 and 
ph’ are like M1, M2, ph with the difference that M3, M4 are separated in the 
direction of movement of the ship. Thus if ph, ph’ leave mirrors Mı, M3 in 
the same event e (we may assume that M3 is positioned where Mı is), then 
after bouncing they will be back in the same event e’ again. The whole scene 
in m’s worldview is as follows. Photon ph behaves exactly as in Einstein’s 
light-clock, so t = time,,(e,e’) = 2/1 — v?, as before. Let us see what 
the “tick” made by ph’ looks like in m’s worldview. By the arguments in the 
proof of Theorem 11.4(i), if d = space, (e, e’), then t = d/(1 — v) + d/ 
(1 +v) = 2d/(1 — v?). Hence d = v1 — v?, the distance between mirrors 
M3, M4 is V1 — v? (which is smaller than 1) as seen by m and not 1 as seen 
by k. We obtained the desired rate of length-contraction. 

We get Theorem 11.6(1) by combining Theorem 11.4(/) and Theorem 11.6(2) 
(cf. e.g. Fig. 11.17). QED 


Theorem 11.7 below implies that the statement “Motion with every finite 
speed is possible” (i.e. (iii) of NK) is inconsistent with Specrel,. This justifies 
the step of weakening (iii) to (iii)~ in NK~. Theorem 11.7 below also shows 
that we do not have to postulate as an axiom that no observer can move faster 
than light; as an axiom this would be difficult to motivate. Luckily, “no faster- 
than-light observer” turns out to be a corollary of the well-motivated axioms in 
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m’s worldview 
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photons bouncing 
between mirrors 


Figure 11.16. Illustration for the proof of Theorem 11.6(3) (length-contraction). The “two- 
dimensional” light-clock consists of two pairs of mirrors and two photons (ph, ph’) bouncing 
between them. The two photons’ bouncing-time is the same in k’s worldview, thus it has to be 
the same in ™’s worldview, too. 


Specrel,. Putting it more succinctly: “no faster-than-light observer” (No FTL 
for short) is a theorem only in our approach and not an axiom. 


THEOREM 11.7 (NO FASTER-THAN-LIGHT MOTION) Assume Specrel). 


(i) No observer can move with the speed of light, i.e. Um(k) 4 1 for all ob- 
servers m, k. 


(ii) Assume n > 3. Then vm(k) < 1 for all observers m, k, i.e. ifn > 2 then 
no observer can move faster than light. If n = 2, then vm(k) > 1 for 
some observers m, k is possible. 


For proof see e.g. Andréka et al., 1999, Proposition 1, Theorem 3, Madarász, 
2002, 2.3.5, 2.8.25, 3.2.13, Madarász et al., 2004, Theorem 3, Theorem 5. A 
proof can also be reconstructed from the proof of Theorem 11.11, p. 641. ep 
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v/V1—v? 


asynchron 





£ 
T? 
v1—v? 
1k length contraction 
y k’s clock shows 


—v here 


Figure 11.17. Tllustration for Theorem 11.6. The three photon-worldlines illustrate, in some 
sense, the three thought-experiments for proving time-dilation, length-contraction, and getting 
out of synchronism, respectively. This figure also illustrates Lorentz transformations (Def. 11.9). 
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For a conceptual analysis of the above No FTL theorem we refer to Sec. 2.7. 
There we will address the question “why No FTL?”, e.g. which parts of Specrel, 
are responsible for No FTL, etc. 

In Newtonian Kinematics, NK, spatial distance of events is not absolute (e.g. 
two events that took place in the dining car of a moving train at different times, 
took place at different places for someone not on the train), but the time elapsed 
between two events is the same for any two observers, moving relative to each 
other or not. Theorem 11.6(ii) says that in SRK the time elapsed between 
two events is not absolute, either. In this respect, SRK is a more symmetric 
theory than NK. But is there anything left that the observers see the same way? 
Curiously, a “mix” of time and space does remain absolute (as opposed to being 
relative like time and space are). 


THEOREM 11.8 (RELATIVISTIC DISTANCE) Assume Specrel and n > 3, 
and let m, k be observers, e, e' be events. Then 


time,,,(e, e’)? — space,,, (e, e’)? = time; (e, e’)? — space, (e, e’)?. QED 


The above theorem is the starting point for building Minkowski geometry, 
which is the “geometrization” of SRK. It also indicates that time and space are 
intertwined in SRK. 

Let us denote the quantity that is the same for all observers as stated in 
Theorem 11.8 above by 


u(p, q) := time(p, q)? — space(p, q)? . 


The letter pz refers to Minkowski distance (also called relativistic distance). We 
will see that every coordinate property that the observers observe the same way 
about events (in a model of Specrel) can be defined from this relativistic distance 
u (Corollary 11.17). More importantly, the whole structure Wt can be retrieved 
from relativistic distance p (provided we disregard irrelevant properties of 
observers and photons like e.g. “there are several distinct photons on ph’s 
worldline”). This means that we can re-define photons, observers, and even 
the field-operations on quantities Q from knowing only relativistic distance ps 
(Theorem 11.16). This indicates that there is an ““observer-independent reality” 
which is behind the different worldviews of the observers. This observer- 
independent reality is often called “objective” or absolute (as opposed to being 
“subjective” or relative like relative motion). We will explore these ideas in 
Sec. 2.6. 
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2.5 Explicit description of all models of Specrel, basic 
logical investigations 


An advantage of having axiomatic theories is that we can use the benefits of 
the well-developed syntax-semantics duality of first-order logic. Namely, if we 
want to check whether a formula y follows from Specrel, instead of making a 
rigorous syntactic derivation, we can check whether in all models of Specrel the 
formula y holds or not. Specifically, we can prove that y does not follow from 
Specrel by exhibiting a model of Specrel in which ¢ fails. In this subsection 
we give an explicit description of all models of Specrel and Specrel,. Based 
on this, then we will give a sample of logical investigations such as consistency, 
completeness, categoricity, decidability, and independence of axioms. 

Theorems 11.4—11.7 in the previous subsection provide all the important 
ingredients for describing the models of our theories Specrel, and Specrel. 
The “heart” of this description is the description of the so-called worldview 
transformations. Let m, k be observers. The worldview transformation Wm 
relates the worldview of m with that of k, it relates those space-time locations 
where m and k observe the same events. Le. 


Wmk := {(p,q) € Q” x Q” : Vm (p) = evz(q)}.- 


The worldview transformation is defined to be a binary relation on space-time 
locations, but under very mild assumptions it is a transformation of Q” indeed 
and eV} = EVm © Wem, hence the name “worldview transformation”. (Here, 
and later, fog denotes the composition of functions f and g, i.e. (f o g)(x) = 
f(g(«)).) In fact, the worldview transformation w,,, : Q” — Q” is the natural 
coordinate-transformation between the coordinate systems of mand k. It shows 
how the worldview of one observer m is distorted in the eye of another observer 
k. 

Theorems 11.4, 11.6 give quite a lot of information on the worldview transfor- 
mations in models of Specrel. They imply that Wm« is a Lorentz transformation 
as defined below, up to a suitable choice of coordinate directions. 

It will be convenient to use the so-called unit-vectors. Let 1 < i < n. The 
i-th unit-vector is 


1; := (0,...,0,1,0...,0) where the 1 stands in the i-th place. 


We will also use the names 1+, 14, 14, 1, for the first four unit-vectors. From 
now on we fix a quadratic ordered field Q = (Q, +, *, <). 


DEFINITION 11.9 (LORENTZ TRANSFORMATION) Let —1 <v < 1, v € 
Q. By the Lorentz transformation (or boost) of velocity v and over Q we 
understand a linear mapping f : Q” — Q” for which 


FCL) = (1/v1 — v2, v/V1— v?, 0,...,0), 
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fe) = (v/V1— xv, 1/71 —v?, 0,...,0), and 
fl) =1; foral3<i<n. 


Fig. 11.17 illustrates Lorentz transformations. A Lorentz transformation as a 
coordinate transformation usually is written as 


t = (t—vr)/ v1- v, 2 =(x—vt)/ Vl- v, y =y, 2?! =z. 


The usual Newtonian (or Galilean) coordinate transformation is t = t, 2’ = 
x—vt, y! = y, z! = z. Comparing the two transformations reveals that in SRK 
time and space are treated in a symmetric way, while in NK they are treated 
differently. In the formula for Lorentz transformations, the divisors / v1 — v? 
represent time-dilation and length-contraction, while “t — va” in place of “t” 
in the first part represents “getting out of synchronism”. 

By a space-isometry (over QQ) we understand a Euclidean isometry (i.e. a 
mapping that preserves Euclidean distance between space-time locations) which 
takes the time-axis to a line parallel to the time-axis. These are affine mappings, 
i.e. linear mappings composed with translations. 


THEOREM 11.10 (DESCRIPTION OF WORLDVIEW TRANSFORMATIONS 
OF Specrel) Assume n > 3, let Q = (Q,+,%*,<) be a quadratic ordered 
field and let f : Q” — Q”. The following are equivalent. 


(i) f is a worldview transformation in a model of Specrel with field-reduct Q. 


(ii) f = 00X00’ for some Lorentz transformation A and space-isometries 
o,o’ (over Q). 


Gii) f is a bijection and preserves relativistic distance, i.e. 


(Vp,q € Q”)u(p, q) = ef (p), f(a). 


Onthe proof (i)=>(ii): By Theorem 11.4(i) we know that Wm is like a Lorentz 
transformation on the “plane of motion’, i.e. on the vertical plane containing 
wline,,,(k), and it takes the subspace of Q” orthogonal to this plane to itself, 
by Theorem 11.4(ii). AxSim then states that Wmķ is a Euclidean isometry on 
this orthogonal subspace. The proof of (i)=(ii) from here on is not difficult. 
(ii) (iii): A possibility for proving this is that one checks by a computation 
that both space-isometries and Lorentz transformations preserve relativistic 
distance. However, we would like to provide more insight here concerning 
(ii)=-(iii). Namely, showing that Lorentz transformations preserve lines of slope 
1 (i.e. that they preserve u(p, q) = 0) is the most important step in proving that 
Specrel is consistent. Fig. 11.18 illustrates a non-computational, geometric 
proof for this crucial part of the proof. (iii)=(i): If f preserves u, then f 
preserves lines of slope 1, preserves lines of slope < 1, and also “it satisfies 
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AxSim’’. The rest follows from the construction we give after Theorem 11.11. 
QED 


By a space-dilation (over Q) we understand a Euclidean dilation (i.e. a 
mapping that “dilates” Euclidean distances between space-time locations with 
a given ratio r € Q) and takes the time-axis to a line parallel to the time-axis. 
These are affine mappings. By a field-automorphism-induced mapping over Q 
we understand the natural extension of an automorphism of Q to Q”. These 
are not necessarily affine mappings, but they are collineations, i.e. they take 
straight lines to straight lines. 


THEOREM 11.11 (DESCRIPTION OF WORLDVIEW TRANSFORMATIONS 
OF Specrel,) Assume n > 3, let (Q, +,*, <) be a quadratic ordered field 
and let f : Q” — Q”. The following are equivalent. 


(i) f is a worldview transformation in a model of Specrel. 


(ii) f = 60X00! oa for some Lorentz transformation A, space-dilations 6, 6’, 
and field-automorphism-induced mapping a. 


Gii) f is a bijection and preserves relativistic distance 0, i.e. 


(Yp, q € Q")[u(p,.g) =9 iff Cl), Fa) = 9). 


On the proof  (ii)<(ili) is a variant of the Alexandrov-Zeeman theorem, see 
e.g. Goldblatt, 1987, App. 2. (i)=(iii) follows from AxPh, and (ii)=-(i) follows 
from the construction we give soon, because (ii) implies that f preserves lines 
of slope 1 and lines of slope < 1. QED 


Remark: We could have proved Theorems 11.4—11.7 by first deriving the 
properties of the worldview transformations as in Theorems 11.10, 11.11, and 
then deriving the paradigmatic effects (i)—(iii) from their properties. We think 
that deriving the predictions of relativity theory directly from the axioms is 
more illuminating. For the student, Lorentz transformations appear as non- 
observation oriented theoretical constructions not explaining why we are doing 
what we are doing. In our approach, stating the axioms in Specrel and then 
deriving the three paradigmatic effects motivate the introduction of Lorentz 
transformations. In some sense, Specrel can be considered as an “implicit 
definition” of the Lorentz transformations. 


We now turn to an exhaustive description of all models of Specrel, and 
Specrel. 
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Figure 11.18. The Light Axiom states that if we switch on a light source for a moment, we will 
observe a light-sphere expanding away from us with the speed of light, and that we are all the 
time in the center of this light-sphere. Assume that observers m, k are present in the event of 
switching on the light source and that they are moving relative to each other. Then both observers 
m and k have to observe that they are in the center of the photon-sphere! How is this possible? 
This figure illustrates how. Let % be in the direction of movement of k in m’s worldview, 
and let y be any spatial direction orthogonal to z. The expanding light-sphere in space-time 
when concentrating on the 3-dimensional subspace determined by f, z, y is a cone. k’s plane of 
simultaneity is tilted just so that k is in the center of the ellipse that is the intersection of this 
plane with the light-cone. For this, the “long axis” of the ellipse is tilted just the amount that it is 
symmetric to the worldline of k (w.r.t. a photon-line as in Fig. 11.17), and the “small axis” of the 
ellipse is parallel with y. This implies “k’s clocks getting out of synchronism” (Theorem 11.4). 
The time-unit 17 of k is on k’s worldline exactly so “high” that the length of the “short axis” of 
the ellipse is 1, when AxSim is true (and arbitrary otherwise). This implies that k’s time flows 
slowly as seen by m when AxSim is true. (Paradigmatic effects in Theorem 11.6!) The other 
space-unit 1* of k is chosen so that the length of the “long axis” of the ellipse counts as 1, too. 
With this choice of the units of measurement, k sees the ellipse as a circle, hence k thinks that 
he is in the center of the light-cone. There is enough room for everyone in the center of the 
expanding lightsphere! 
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Given an arbitrary quadratic ordered field Q, let PLines and TLines de- 
note the set of all straight lines in Q” of slope 1 and of slope < 1, respec- 
tively. Let WT denote the set of all transformations of Q” which preserve both 
PLines and TLines. Then WT forms a group, and Theorem 11.11 describes the 
members of WT. 

Consider M = (Q; B,Ob, Ph; W) — Specrel, in which Ob # Ø. Then 
Ph and Ob are disjoint, by Theorem 11.7(i). We let 


Bı := B — (Ph U Ob). 


In the discussion below we will see that (after having chosen 9), the “heart” 
of a Specrel, model is a subgroup WTp C WT such that {f-t[t] : f € 
WT ,} = TLines. Here f|H] := { f(a) : a € H} for any function f and subset 
H of the domain Dom(f) of f, as usual. Having chosen the heart WT of our 
model, we still have to decorate it with “observer names” (Ob), “photon names” 
(Ph), and with extra bodies Bı not necessarily in Ob U Ph. This decorating 
or labelling gives rise to an extra plurality of (nonisomorphic) possible models 
for Specrel, in addition to the possible choices of WT. (These choices will be 
formally specified in items (1)-(v) below.) Below we present the details giving 
precise meanings to what we understand by the above, e.g. by “labelling”, 
“heart” etc. The reader not interested in the details might skim over them just 
to have an impression and continue serious reading with Corollary 11.12. 

Let us return to our M = (Q; B,Ob, Ph; W) — Specrely. For any 
observer m € Ob let us define the “worldview of m” as a structure 


Wm := (Q”, wline,,(b) : b € B) 


where Q” is the carrier set and wline,,(b) is a one-place relation (or unary 
predicate) with relation symbol b (denoting this subset of Q”) for each b € 
B (recall that wlinem(b) C Q”, and cf. Fig. 11.1 on p.611). The set of all 
worldviews contains exactly the information content of W N Ob x B x Q". 
Let W, := W — Ob x B x Q”. When we want to define a model of Specrely, 
we have to define the 6-place relation W. Instead of defining W directly, often 
it is easier to define the set of worldviews along with W4. 

All these worldviews are isomorphic with each other, actually the worldview 
transformations are isomorphisms between the worldviews, i.e. Wmk : Wm — 
W,, is an isomorphism for any m, k € Ob, by the definition of W,,,,. What do 
the worldviews Wm look like? The carrier set is Q”, the photons are distributed 
surjectively on the PLines by AxPh; the observers are distributed surjectively 
on the TLines by AxLine, AxThEx, Theorem 11.7; wlinem(m) = t by AxSelf. 
Instead of specifying all the worldviews one-by-one, we can specify one “Pla- 
tonic” (or generic) worldview 


P := (Q",7(b) :b € B), where 
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m | Ph: Ph —> PLines is surjective, 

m | Ob: Ob —> TLines is surjective, and 

TÌ Bi: Bi—>{Y:Y CO", 
and then for each observer m € Ob we can specify an element of WT which 
describes how m realizes this Platonic worldview, i.e. we specify a function 

w:Ob—WT such that w(m)|r(m)] = t; 
and then we define Wm as the image of 8 by the function w(m). The information 
content of m [ Ob : Ob —> TLines can be recovered from this last function w 
(by n(m) = w(m)~[#]), so we may skip specifying m | Ob. 

With the above intuition in mind, we can construct a model of Specrel, by 

specifying a quintuple (Q, w, 7, 3, W1) with the following properties: 
(i) Q is a quadratic ordered field, 


(ii) w,7, 8 are functions with disjoint domains Ob, Ph, Bı respectively, 
(iii) w : Ob — WT is such that {w(m)~![£] : m € Ob} = TLines, 
(iv) 7: Ph —> PLines is surjective, 


(vy) 8: By 3 {¥:YCQ"}, and W, C (B, UPA) x (By UPhUOD) x 
Q". 


Let us call a quintuple satisfying the above conditions a pre-model. From any 
pre-model (9, w, r, 3, W1) we can construct a model of Specrel, by defining 


M(N, w, r, 8, W1) := (9; B, Ob, Ph; W) where 


Ob := Dom(w), Ph := Dom(7), B := Dom(6)U Obu Ph; and W 
is defined the natural way 


W := { (m, b, p) : p € w(m)[6(b)], m € Ob}U{ (m, ph, p) : p € w(m)[7(ph)], 
m € Ob} U {(m, k, p) : p € w(m)[w(k) tE], m € Ob} U Wi. 


It can be checked that all models constructed from pre-models are models of 
Specrel,; and conversely, all models of Specrel,, with nonempty observer-part 
arise this way from pre-models. 

We described the models of Specrel) with Ob 4 Ø. The description when 
Ob = @) is easy. The description of the models of Specrel is exactly like above 
with the only change that in place of WT we use its subset WT* characterized 
in Theorem 11.10. 


COROLLARY 11.12 (CONSISTENCY) Specrel is a consistent theory, i.e. for 
no formula ọ can both vp and its negation ~g be derived from Specrel. More- 
over, Specrel + (Ob 0) is also consistent. 
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Proof To construct a model of Specrel + (Ob 4 Ø) we have to show that 
there exists at least one pre-model (Q, w, 7,3, W1) with w : Ob — WTH. Of 
the conditions (i)—(v) in the definition of a pre-model, only condition (iii) is not 
trivial to satisfy. However, Theorem 11.10(ii) shows that for all £ € TLines 
there is w € WT™ which takes f to @, and we are done. QED 


Corollary 11.12 above completes justification of the move NK++ NK. It 
shows that by this move, we indeed got rid of all contradictions between NK 
and the Light Axiom. 

Having a description of all models of Specrel and Specrel, at hand makes 
it easy to see which statements follow from Specrel and Specrel, and which 
do not. As an example we include the following. 


COROLLARY 11.13 Let n > 3. Specrel F (Vm,k € Ob)um(k) = v(m) 
while Specrely 7 (Vm, k € Ob)um(k) = v(m). 


Hint for proof Field-automorphism-induced mappings œ can occur in 
worldview-transformations in models of Specrel,, but not in models of Specrel. 
QED 


THEOREM 11.14 (INDEPENDENCE OF THE AXIOMS) Assume n > 3. 
(i) (Specrel, — {AxLine, AXThEx})  AxLine. 


(ii) Specrel — {AxLine} is an independent axiom system, i.e. (Specrel — 
{Ax}) '/ Ax for any element Ax in Specrel different from AxLine. 


(iii) Every model of Specrel, — {AXTHEx} can be extended to a model of 
Specrel,. Hence if formula n is universally quantified in the sort B and 
Specrel, | n, then (Specrel, — {AxLine, AXThEx}) F n. The same 
holds for Specrel in place of Specrel,. QED 


By Theorem 11.14(ii) above, all our paradigmatic effects can be proved 
in the more economical fragment Specrel — {AxLine, AXThEx} of Specrel. 
(This is so because the paradigmatic effects can be reformulated as sentences 
universally quantified in sort B, by using Theorem 11.2.) On the other hand, 
Theorems 11.4, 11.6 do not hold if we omit any one of the axioms of Specrel, — 
{AxLine, AXThEx}, and Theorem 11.6 does not hold if we omit AxSim. Such 
investigations of economy asking which axioms are needed for proving what 
theorem are called “reverse relativity theory” motivated by the highly successful 
branch of mathematics called “reverse mathematics” and is pursued in Andréka 
et al., 2002. We will return to this important subject in Sec. 2.7. 

Specrel has many non-elementarily equivalent models over any quadratic or- 
dered field. We show that Specrel can be extended to a theory Specrel U Comp 
which is categorical over any quadratic ordered field, it can be extended to a 
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complete and decidable theory, and Specrel can also be extended to a hered- 
itarily undecidable theory. Both extensions are natural. (Cf. Theorem 11.15 
below.) 

Recall that from any pre-model (Q, w, 7, B1, W1) we can construct a model 
of Specrel,. The most natural pre-models for Specrel are (Q, Id | WT*, Id [ 
PLines, Ø, 0). We will call the models constructed from these standard models 
for Specrel. Thus, in the standard models we include all the possible kinds of 
observers, but otherwise we are as “economic” as possible. There is exactly 
one standard model of dimension n over any quadratic ordered field Q. We are 
going to give a complete axiom system for these standard models. 


AxCoord (Vm € Ob)(V space-isometry S of Q”)(Sk € Ob)Win~ = S. 
AxExt® (Vm, k € Ob)(Wmk = ld > m = k). 
AxExt?” (Yph, ph’ € Ph) (Ym € Ob) (wline,,,(ph) = wline,,,(ph’) — ph = ph’). 
AxNobody B=ObUPh and W C Ob x B x Q7? . 
Comp := {AxCoord, AxExt® , AxExt?”, AxNobody}. 





The above are natural axioms which hold in all standard models of Specrel. 
AxCoord expresses that each observer can “re-coordinatize” his worldview 
with a space-isometry. There is a quantifier ranging over space-isometries in 
this formula. Nevertheless, this axiom can be expressed with a first-order logic 
formula because space-isometries are affine mappings and hence can be “coded” 
with the images of the n unit-vectors 1;. The next two axioms in Comp say, 
intuitively, that of each kind of observers and photons we have only one copy 
(or, in other words, according to Leibniz’s principle, if we cannot distinguish 
two observers or photons with some observable properties expressible in our 
language, then we treat them as equal). Hence we call them extensionality 
principles, whence the abbreviation AxExt. The example of AxExt?” reveals 
that here we consider only space-time-oriented properties of photons, hence 
two photons of different “color” but same worldline are not distinguished in 
the theory Comp. AxExt™® also expresses that we really identify observers 
with coordinate systems. These axioms are natural to assume, we did not 
include them in Specrel because these “simplifying axioms” are not needed 
for proving the theorems. The last axiom in Comp says that every body is 
an inertial observer or photon. This is a real restriction that we usually do 
not want to make when we use AxLine. The main reason is that we can treat 
accelerated observers in Specrel if we do not make this restriction AxNobody, 
see Sec. 3.1. Treating accelerated observers in Specrel is important for the 
transition from special relativity theory to general relativity theory, as we shall 
see. AXNobody excludes accelerated bodies. So we do “pay a physical price” 
for assuming AxNobody. 
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By a theory we understand an arbitrary set of first-order logic formulas (i.e. 
we will not assume that a theory contains all its semantical consequences). 
We call a theory Th decidable (or undecidable respectively) if the set of all 
first-order logic semantical consequences of Th is decidable (or undecidable 
respectively). We call Th complete if it implies either y or ~o for each first- 
order logic formula y without free variables (of its language). It is known that 
the theory of quadratic ordered fields is undecidable. A quadratic ordered field 
is called real-closed if every polynomial of odd degree has zero as a value. This 
last requirement can be expressed with the infinite set RC := {@an41:n E w} 
of first-order logic formulas, where ¢,, denotes the following sentence 








Vao...Vaniy(tn £ 0 —> zo + z1- Y +: + Tn: y” =0). 


Tarski proved that the theory of real-closed fields is complete and decidable (cf., 
e.g. Hodges, 1993, Theorem 2.7.2, p. 67, p. 92). The above suggests that if we 
want to obtain interesting and relevant decidability-theoretic results, then we 
have to concentrate on real-closed fields; or at least include a decidable theory 
of field-axioms into our theories. 


THEOREM 11.15 Letn > 3. 


(i) The models of Specrel U Comp are exactly the models isomorphic to stan- 
dard ones. 


(ii) Specrel U Comp U TF is complete and decidable, for any complete, de- 
cidable theory TF of quadratic ordered fields. 


(iii) Specrel U (Comp — {Ax}) U RC can be extended to a hereditarily unde- 
cidable theory Th for any AX € Comp, in the sense that no consistent 
extension of Th is decidable. 


For proof of Theorem 11.15 and for related results we refer to Andréka et al., 
1999, Sec. 7, Andréka et al., 2004, Sec. 7. 


In the standard models of Specrel there is no orientation for time, “reversing 
time” is an automorphism of these models. In relativity theory, both special 
and general, we sometimes use the fact that “time has a direction”, i.e. that AxT 
below is assumed. 


Axt (Vm,k € Ob)(Vp,q E€ t)[(um(k) < 1A pe < Ge) > Wem(p)e < 
Wem (Qe ] 5 


Axiom AxT expresses that every observer sees the time of another slowly mov- 
ing observer “flow forwards”, i.e. m sees k’s clocks ticking “forwards” and not 
“backwards”. All our theorems so far are true for Specrel + Ax? in place of 
Specrel with minor modifications. 
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2.6 Observer-independent geometries in relativity theory; 
duality and definability theory of logic 


According to the approach taken so far, each observer observes the world 
through the “looking-glass” or “spectacles” of his own coordinate system. The 
question comes up: Is there an observer-independent, “absolute” reality which 
the individual observers observe through their respective coordinate systems, 
or is the set of worldviews of the different observers just an ad-hoc collection 
of subjective personal views? (The philosophy of subjective idealism contra 
the assumption of the existence of an objective external world.) We will see 
that relativistic space-time (or relativistic geometry) provides such an observer- 
independent reality. Namely, in the present subsection we show that the ob- 
servers (and their coordinate systems) can be defined from relativistic distance 
pas defined at the end of Sec. 2.5, in models of Catrel := Specrel U Comp. 
If we regard the worldviews of the various observers as “subjective” (in some 
sense), then p is “objective” in the sense that p is the same for all observers, in 
Specrel. Thus relativistic distance u provides such an observer-independent, 
absolute reality. This statement will be made more tangible in the definition 
of the observer-independent geometry Mg(Jt) associated to Specrel models 
M below. 

We already saw that in models of Specrelg, all observers observe the same 
events. Let Events denote the set of events observed by some (or equivalently 
by each) observer, 


Events:= {ev,,(p) : m € Ob, p € Q”}. 


In models of Specrel, we can define relativistic distance yz of events, by 
Theorem 11.8, as 


ule, e’):= p(lOC,,(€), 10C,,(e’)), for any observer m € Ob. 


Relativistic distance of events is a function u : Events x Events — Q. Given 
M H Specrel we define its metric-geometry (or Minkowski geometry) Mg(It) 
as a two-sorted structure as follows: 


Mg(0t) := (Events, u; Q,1). 


Mg(20t) is also referred to as the space-time of IN. The two sorts of Mg(2t) are 
Events and Q; p is a function of sort Events x Events — Q and 1 is a constant 
of sort Q. We want to state a strong equivalence between Mg(90) and WM. These 
two structures have different vocabularies (or signatures, or languages). 

The part of logic that connects structures and theories on different vocabu- 
laries is called definability theory. The strongest kind of connection between 
two theories is definitional equivalence of theories. When two theories are def- 
initionally equivalent, we say that they are lexicographical variants of the same 
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theory, the only difference being that they use different concepts of the theory 
as basic ones. We will need definitional equivalence of first-order logic theories 
on vocabularies that have different sorts (or universes), hence the definition- 
ally equivalent models will have different kinds of universes. This amounts 
to defining new “entities” in a model, not only new relations or functions on 
already existing entities as in “standard” one-sorted definability theory of first- 
order logic. Definability of new sorts is important in the kind of definability that 
arises in relativity theory; therefore we worked out such a definability theory 
in Andréka et al., 2002, Sec. 6.3 and Madarász, 2002, Sec. 4.3. Below we recall 
the elements of definability theory that we are going to use. 

Let L be a vocabulary, possibly many-sorted, and let L’ be an expansion of 
L, i.e. L’ may contain new sorts, and new relation and function symbols. The 
L-reduct of a structure W on vocabulary L’ is the obvious thing (we “forget” the 
interpretations of symbols not in L). Let Th and Th’ be theories on vocabularies 
L and L’, respectively. We say that Th’ is a definitional expansion of Th iff 
the following (i)-(i1) hold: 


(i) The models of Th are exactly the L-reducts of models of Th’. 


(ii) For any two models Dı and Mə of Th’ with the same L-reduct there is 
a unique isomorphism between Mt and Mə that is the identity on this 
common reduct. 


Let Th; and Tho be arbitrary theories (possibly on completely different 
vocabularies). We say that Th; and Tha are definitionally equivalent when 
they have a joint definitional expansion Tha. (More precisely, definitional 
equivalence is the transitive closure of the notion just defined. In this subsection 
we will not need to take transitive closure.) 

Intuitive explanation for definitional equivalence of theories: By “Th’ is a 
definitional expansion of Th” we mean that each sort, relation and function in 
the vocabulary of Th’ that is not present in the vocabulary of Th is actually 
defined in Th’ in the following sense. Properties (i)-(ii) above express that 
we consider a new relation (i.e. one in L’ but not in L) on an existing sort as 
defined (in Th’) if it can be “put” on every model of Th in a unique way so that 
it satisfies Th’, and we consider a new sort (and relations on it) as defined if it 
can be added to each model of T'h in a unique way, up to a unique isomorphism. 

In definability theory of logic, there are a semantical and a syntactical ap- 
proach to definability, and of course the interesting thing is to state their equiva- 
lence (this is Beth’s theorem in the usual definability theory of first-order logic). 
We presented here the notions of the semantic approach; when Th’ is a def- 
initional expansion of Th we can say that Th’ is an “implicit, or semantical 
definition” of the symbols not occurring in Th. In Andréka et al., 2002, Sec. 6.3 
and Madarász, 2002, Sec. 4.3 we worked out the “syntactical” counterpart of 
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this definability, i.e. we gave concrete prescriptions for what an “explicit, syn- 
tactical definition” of a new element of the vocabulary can look like. Then 
we proved the analogue of Beth’s theorem (stating that a new element of the 
vocabulary has an implicit definition exactly when it has an explicit definition). 
When two theories are definitionally equivalent, there is a computable meaning- 
preserving translation function between their languages (see Madarász, 2002, 
4.3.27 and 4.3.29). For more on definability theory we refer to e.g. Makkai, 
1993. 

We now proceed to state definitional equivalence between theories occurring 
in relativity theory. From now on, in the present subsection, we assume n > 3. 

The formula u(p, q) = (pı — q1)? — (p2 — q2)? — ++» — (Pn — qn)? is referred 
to as the (squared) Minkowski metric and the structure (Q”, u; Q, 1) is the 
(metric) Minkowski geometry over Q. For a class K of structures, IK denotes 
the class of all structures isomorphic to elements of K. 





THEOREM 11.16 (DEFINITIONAL EQUIVALENCE BETWEEN METRIC- 
GEOMETRIES AND Catrel MODELS) Assume n > 3. 


(i) Catrel is definitionally equivalent to the first-order logic theory of its 
metric-geometries, i.e. Catrel and Thm in (ii) below are definitionally 
equivalent. 


(ii) 1{(Events, u; Q,1) : M = Catrel } =: MG is axiomatizable by finitely 
many formulas, i.e. there is a finite axiom system Thm such that MG is 
the class of all models of Thm. 


(iii) 1{(Q”, u; Q,1) : Q is a quadratic ofield} = MG. Le., the class of 
Minkowski geometries (over quadratic ordered fields) coincides with the 
class of metric-geometric models of Catrel. 


Outline of proof (i): We define a joint definitional expansion Th3. The 
vocabulary of Tha contains all the symbols occurring either in Specrel or in 
Mg(2t), plus one new n + 2-place relation symbol J of type B x Events” +t., 
Let M = (Q, +, x, <; B, Ob, Ph; W) — Specrel, be given, and define 


Jm := { (m, CVm(0), CVm(1t),---;@Vm(1n)) : m € Ob} U 
{(ph,e1,...,en41) : ph € Ph, e1,...,e€n41 € Events, 
ph € e1,...,ph E€ e,41}, and the expansion of M 

F(t) := (Q, +,*,<,0,1; B,Ob,Ph; Events, y; W, Jm), 


Ths is the set of all formulas valid in {F (M) : M  Catrel}. 
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Clearly, Joy gives an “interpretation” of observers and photons in Events, and 
so makes a connection between the two “alien” sorts B and Events. The fol- 
lowing are the main ideas in showing that Ths is a definitional expansion of 
T hm (the theory of metric-geometries of Catrel). We have Events, jz, Q, 1 at 
our disposal and we have to “define” (or recover) +, x, <, 0, B, Ob, Ph, W, J. 
First we define 0 := pu(e, e), then we define “lightlike collinearity” on Events 
by using p as follows: e1, €2, e3 are lightlike collinear iff p(e;,e;) = 0 for 
all 7,7 = 1,2,3. From lightlike collinearity then we define usual collinear- 
ity as in the proof of the Alexandrov-Zeeman theorem in Goldblatt, 1987, 
App. 2, or in Andréka et al., 1999, Andréka et al., 2004. For the idea of 
this part of the proof see Fig. 11.19. A proof for (a generalization of) the 
Alexandrov-Zeeman theorem using a different, elegant idea is in Horvath, 2005. 
A definability-theoretic analysis of the Alexandrov-Zeeman theorem in an ax- 
iomatic setting can be found in Pambuccian, 2006. From collinearity and 0, 1, u 
we define the field-operations +,* by using Hilbert’s coordinatization tech- 
nique (see e.g. Goldblatt, 1987, pp. 23-27 or Andréka et al., 2002, Sec. 6.5.2). 
Since the original field was quadratic and ordered, we can recover the order- 
ing <, too. From collinearity and yz we can define the so-called relativistic (or 
Minkowski) orthogonality relation (see Fig. 11.22), and from ps again then we 
can define the n + 1-tuples of events (eg, €1,..., €n) that correspond exactly 
to (Vin (0), EVm(1t),---,;@Vm(1n)) for some observer m by requiring that 
(eo, ei) = 1 and eg, e; is orthogonal to eo, e; for all i, j = 1,...,n, i Æ j. 
We can use these to define Ob, Ph, B, J and W. In the above we made use of 
the fact that all the constructions can be tracked with first-order logic formulas. 
Showing that T'hg is a definitional expansion of Catrel is the easier direction, 
for a proof see Madarász, 2002, p. 241. (ii): In the above construction, we 
defined the operations of Catrel by using pz, 1, thus we can express the finitely 
many axioms defining Catrel by using jz, 1 and we are done. (iii): The func- 
tions eV,,, and loc,,, define isomorphisms between the structures (Q”, u; Q, 1) 
and (Events, u; Q, 1). QED 


From the proof of Theorem 11.16 we actually can construct a finite theory 
Thm axiomatizing the metric-geometries MG. This Thm, however, is com- 
plicated and not really illuminating. It would be nice to find a streamlined, 
finite axiom system Th axiomatizing MG which contains few and easy-to- 
understand, illuminating axioms about yz, 1. 

We called a property absolute if every observer “sees” it the same way. This 
“absolute” means also “observer-independent” or “coordinate-independent”. 
Theorem 11.8 states that relativistic distance jz is such an absolute property. 
Clearly, every formula expressible by the use of 4,1 is absolute, too. The 
corollary below says that these are all the absolute coordinate properties of 
events, in Specrel. By acoordinate property of events we understand a property 
expressible in terms of the coordinates loc,,,(e) of events e; more concretely 
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Plane(é, p) Æ P 





Figure 11.19. Collinearity can be defined from lightlike collinearity, as follows. Assume n = 3. 
Given a lightline £, the plane P tangent to the light-cone and containing £ is the set of those 
points p through which no lightline intersecting @ goes. The reason for this is illustrated in 
the two parts of the figure. Then we get all spacelike lines as intersections of tangent planes. 
Then each timelike plane can be defined by a pair of intersecting lightlines and the spacelike 
lines connecting them; timelike lines then are the “new” intersections of timelike planes. In the 
above, spacelike, timelike lines, and lightlines are straight lines that lie outside, inside, and on 
the light-cone, respectively. A plane is timelike if it contains a timelike line. The case n > 3 is 
similar. 





by a coordinate property of events €1,...,¢€, we understand a formula either 
in the form (Vm E€ Ob)y(loc,,(e1),...,10C,(e,)) or in the form (3m € 
Ob) 7(loc,,(€1),..-,10Cm(e,)) where w is a formula in the vocabulary of the 


field-reduct (Q, +, x, <)”. 


COROLLARY 11.17 Every relation definable (in FOL) on Events in a model 
of Catrel can be defined from u and 1. Every coordinate-property of events in 
a model of Specrel can be defined from n and 1. 


Theorem 11.16 can be interpreted as saying that metric-geometries are the 
observer-independent, “absolute realities” corresponding to models of Specrel. 
If we abstract from the concrete values of the metric-properties in the metric- 
geometries, we get the so-called causal-geometries, to be defined below. These 
correspond to the “absolute realities” of models of Specrelg. We are going to 
elaborate these ideas. 

Let us call events e, e’ causally separated, in symbols e ~e e’, iff there is 
either an observer or a photon that participates both in e and in e’, i.e. iff eN e'N 
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(ObUPh) Æ Ø. We call two space-time locations p, q € Q” causally separated, 

in symbols p ~e q, iff time(p, q) > space(p, q), i.e. iff (p1 — q1)? > (p2 — 

q2)? +--+ (Pn — qn)?. Visually, p ~e q means that q is inside or on the light- 

cone emanating from p. In Specrel, we have (Events, ~e) = (Q”, ~e), in 

fact loc,,, and ev,,, are isomorphisms between these structures, for any mEOb. 
Let M H Specrel, and define its causal-geometry as 


Cg(M) := (Events, ~e). 


Ants and elephants may use different units of measurement (e.g., their feet). 
The following axiom expresses that we abstract from the value of the units of 
measurement. We do so by requiring that all kinds of units of measurement 
be there. (In connection with the intuition/philosophy related to the follow- 
ing “ant-elephant” axiom cf. the Incredible Shrinking Man in Nicholls, 1982, 
pp. 194-195.) 


AxDil (Vm € Ob)(VA > 0)(Sk € Ob) (Vp € Q”) Wing (p) = Ap. 
Catrelo := Specrel, U Comp U {AxDil} = Catrel — {AxSim} U {AxDil}. 





Assume M | Catrelo. Then every part of Jt can be recovered from Cg(Mt), 
except 0 and 1. By this we mean that B, Ob, Ph, W, Q, < all can be defined (or 
recovered) from Cg(t), but instead of +, x, which are not definable, we can 
define their affine ternary versions +3, *3 where 


+3(£,y, 2) :=t +Y- zZ, *3(Z, y, zZ) = £ * y/z. 


Alternately, Mt can be defined over Cg(MM) parametrically only, i.e. if we add 
two (arbitrary) constants to Cg(9t). So we have here an analogue of Theo- 
rem 11.16 working between Catrelo and its causal-geometries of form Cg(2). 
In particular, Wt and Cg(M) are definitionally equivalent in the parametric sense 
of definability. 

Instead of stating the precise analogue of Theorem 11.16 for this intimate 
connection between causal-geometries and Catrelg, we turn to the question of 
what the definable relations in causal-geometries are. 

Algebraic logic is a branch of logic that investigates the structure of the 
definable concepts in a theory, or in a model of a theory. Let us consider 
Cg(%M) for an arbitrary M | Catrelo. The definable relations in Cg(9t) are 
exactly those absolute properties of events which do not involve concrete values 
of the metric u. We will call these relations causal-relations. 

The unary (i.e. one-place) causal-relations are Events and Ø. What are the 
binary (i.e. two-place) causal-relations? 

We call events e, e’ timelike (lightlike, spacelike) separated, in symbols 
en, e (e ~e €d, e ~s e) iff [e # e andene NOb # Dlene’n 
Ph Æ 0, e N e' N (Ob U Ph) = 9, respectively)]. On the “coordinate-side”, 
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we call space-time locations p,q timelike (lightlike, spacelike) separated, in 
symbols p ~t q (p ~e q, p ~s q) iff [p 4 q and time(p,q) > space(p, q) 
(time(p,q) = space(p, q), time(p,q) < space(p, q), respectively)]. These 
are corresponding properties via the bijections loc,, and eV, for m € Ob, as 
before. 

Timelike, lightlike, and spacelike separability of events are all causal-relations, 
i.e. they can be defined from ~e. In fact, all these four relations can be defined 
from each other. 

Below we sketch how ~; can be defined from ~e. The argument is easier 
to follow in the isomorphic structure (Q”, ~e}. The points causally separated 
from a point x are in the “solid” light-cone emanating from x. This light-cone 
consists of two separate parts, the “upward” and the “downward” parts. We 
cannot distinguish with a formula the two separate parts of the light-cone, but 
we can express that “y, z are in the same half-cone of x”, in symbols y =, z, 
as follows. 





Y =r Z & |r ve yY AT we z A Jwr we w Anw we yA rw we z). 


From this then we can define timelike separability as follows: 





£ ~t y S&S Jzw(t wez we y Nnr =; y AT Nye Wye yAn =w YN 
=z ~ve w). 


We used four variables in the above definition. By using 3 variables only, 
~z is not definable from ~e. This can be proved by using the techniques of 
algebraic logic, as follows. The four relations ~+, ~g, ~s, Id form the atoms of 
the Boolean algebra they generate. All these relations are symmetric, i.e. they 
are their own converses. Moreover, the relational composition of any distinct 
two is Di := —ld, while the relational composition of any non-identity one 
with itself is the unit Events x Events of the Boolean algebra. Hence they 
form a relation algebra. Relation algebras are introduced and briefly discussed 
in Ch. 3 (see also Henkin et al., 1985, Hirsch and Hodkinson, 2002, Andréka 
et al., 2001). The elements of a concrete relation algebra are binary relations, 
and the operations are the Boolean ones together with relational composition 
of binary relations, taking converse of a binary relation, and the relation Id as 
a constant. The binary causal-relations then form a relation algebra. We can 
check that in this relation algebra ~z is not in the subalgebra generated by ~e, 
hence ~+ cannot be defined from ~e by using only three variables, by Tarski’s 
theorem, Tarski and Givant, 1987 or Ch. 3, Proposition 2.4. That ~; can be 
generated from ~, by using 4 variables is equivalent to the fact that, in the 
so-called 4-dimensional cylindric algebra of 4-placed causal-relations, ~+ is 
indeed in the subalgebra generated by ~e. For cylindric algebras we refer to 
Henkin et al., 1985, Henkin et al., 1981, Andréka et al., 2001. 
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Figure 11.20. The binary causal-relations (in Specrel, ) form a relation algebra. 


One can prove that all the binary causal-relations are the ones occurring in 
the above relation algebra, which is represented in Fig. 11.20. Thus we have 
described all the binary causal-relations. Similar definability results for the 
special case Q =“‘the rational numbers” are in van Benthem, 1983, pp. 23-30. 

There are infinitely many ternary (i.e. 3-place) causal-relations. The most 
often used ternary and 4-place causal relations are the following ones (again, it 
is easier to define their “coordinate-versions” in (Q”, ~-)). 

Collinearity of 3 space-time-locations is a causal relation, let coll(p, q,r) 
denote that p, q, r are collinear, i.e. they lie on a straight line. 

Betweenness: Bw(p, q, r) iff “coll(p, q,r) and q is between p and r”. 

Equidistance: Eq(p, q, r, s) iff “p(p, q) = a(r, s)”. Minkowski-circles (or 
Minkowski-spheres) can be defined from equidistance, cf. Fig. 11.21. 

Orthogonality: Ort(p,q,7r,s) iff “the line connecting p,q is Minkowski- 
orthogonal to the line connecting r,s”, i.e. iff (pı — qı)(rı — 51) — (p2 — 
q2)(T2 — 82) — +++ — (Pn — Qn) (Tn — Sn) = 0. For illustration see Fig. 11.22. 

Betweenness with ratio p: Let p be any rational number. Then Bw,(p, q,r) 


” 


iff “Bw(p, q,r) and u(p, q) = p * alq, ry”. 
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Figure 11.21. Minkowski circles. 


The above are all definable from ~,. The last example shows that there are 
infinitely many ternary causal-relations. 

If we have time-orientation, i.e. in models of Specrel, + AxÎ, the following 
important relation can also be defined. Event e causally precedes event e’, 
in symbols e <e e iff (e ~e e and (4m € Ob)time,,(e) < time,,(e’) ). 
In Specrel, + Ax}, e <e e’ is equivalent with the simpler formula (Vm € 
Ob)time,,,(e) < time,,(e’), assuming n > 2. It can be proved (analogously 
to Corollary 11.5) that the corresponding property in space-time locations is: 
p <e q iff (p ~e q and pe < q). <c is also called causality relation, or 
after, the first axiomatization of special relativity in Robb, 1914 axiomatized 
this causality relation. The general relativistic version of <e is quite important, 
too, and is more intricate than the Specrel, version; cf., e.g., works of Penrose, 
Malament, Busemann. 

Finally, we list some absolute relations that can be defined in Mg(2t), but 
cannot be defined in Cg(0t), for WM € Catrel. Clearly, yz is such. 

Minkowski scalar-product: Q4(p, 94,7, 8) := (pi — qi) * (rı — $1) — (p2 — 
q2z)*(r2— 82) —++:—(Pn—4n)*(Tn—Sn). We note that g4(p, q, p,q) = H(p, q) 
and Q,(p,q,7, 5) = 0 iff Ort(p, q,r, s). Hence, g4 “codes” both Minkowski- 
distance and “relativistic angle”. 

Relativistic (non-squared) distance of causally separated points: 
rd(p,q) := y (pı — q1)? — (p2 — 2)? — -+ - — (Pn — dn). This is a partial 
function defined exactly when the expression in the argument of the square 
root is nonnegative, i.e. when p ~e q. rd(e, e’) > 0 means proper time elapsed 
between e and e’, for any observer who takes part in both e and e’ (and that there 
is such an observer), i.e. any observer who takes part in both e, e’ measures that 
the elapsed time between e and e’ as rd(e, e’) (and there is such an observer). 
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We note that jz, g4, rd are definable from each other. Actually, we will make 
use of this in our section on general relativity, in Sec. 3.4, where we will use 
the binary version g of g, which is defined by 


g(p, q) = gulp, 0, q, 0). 




















Figure 11.22. £ is Minkowski-orthogonal to each straight line in P. 


For more concrete definitions and for intuition for the above relations we refer 
to e.g. Goldblatt, 1987, or Madarász, 2002, Sec. 4.2. 

By Theorem 11.16 and the analogue for Catrelo, if we want to study special 
relativity in the form of Catrel or Catrelg, then this can be done equivalently by 
studying the simple metric-geometries and causal-geometries (Events, u; Q, 1) 
and (Events, ~c), using their finitely axiomatized theories Thm and Th, re- 
spectively. (A finite The can be found in Latzer, 1972.) The possibility of 
switching to the geometries instead of the original models (without losing in- 
formation) is useful e.g. because the transition from special relativity to general 
relativity (GR) is quite smooth on the level of geometries. The study of GR 
on this causal-geometric (axiomatic) level of abstraction is promoted e.g. in 
Kronheimer and Penrose, 1967. Busemann, 1967 uses the generalization of rd 
in his approach to general relativity space-times. The study of GR on the metric- 
geometric level using the generalization of g is most common, see e.g. Wald, 
1984, Hawking and Ellis, 1973, Rindler, 2001 (but in this “g-oriented” cases the 
linguistic economy of the Kronheimer-Penrose approach is usually sacrificed). 


One of the most useful and most interesting branches of mathematical logic 
is, in our opinion, definability theory. Definability theory is strongly related 
to relativity theory, in fact its existence was initiated by Hans Reichenbach in 
1924 (Reichenbach, 1969) motivated by relativity theory. Reichenbach in his 
works emphasized the need of definability theory and made the first steps in 
creating it. It was Alfred Tarski who later (1930) founded and established this 
branch of mathematical logic. 
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Very briefly, the reason for the need of definability theory (of logic) in rela- 
tivity theory is as follows. When one sets up a physical theory Th, one wants to 
use only so-called observational concepts, like e.g., “meeting of two particles”’.’ 
While investigating the theory T’h, one defines new, so-called “theoretical” con- 
cepts, like e.g. “relativistic distance of events”. Some defined concepts then 
prove to be so useful that one builds a new theory Th’ based on the most useful 
theoretical concepts, and investigates this new theory Th’ in its own merits. 

The new theory Th’ usually is simple, streamlined, elegant—built so that 
we satisfy our aesthetic desires. The original theory T'h contains its own in- 
terpretation, because we defined it so. The physical interpretation of the new 
streamlined theory Th’ is provided by its connection with Th. The strongest 
known relationship between two theories is definitional equivalence. When 
Th and Th’ are definitionally equivalent, in the rigorous sense of definability 
theory of first-order logic, the observational oriented theory Th can be recap- 
tured completely from the theoretical-oriented streamlined theory Th’; and vice 
versa, the theoretical concepts of Th’ can be defined (justified) over the obser- 
vational Th. Looser relationships between Th and Th’ are also very useful, 
these kinds of relationships between theories are called interpretability and du- 
ality theories. Cf. van Benthem, 1982 for more on logic, definability theory, 
model theory for empirical theories. 

In Secs. 2.1—2.3 when we formalized special relativity in first-order logic, we 
tried to choose the basic concepts of our language as observational as possible; 
and we introduced the more theoretical concepts of relativity as definitions 
at later stages, when development of the theory justified them. Eventually, 
in Sec. 2.6, this process lead to the introduction of a new theory with new 
basic concepts (new vocabulary, like ~e, <c). This is a natural way of theory 
development, theory “understanding”, theory analysis. In modern approaches 
to logic, theories are considered as dynamic objects as opposed to the more 
classical “eternally frozen” idea of theories. For approaches to the dynamic 
trend in mathematical logic cf. van Benthem, 1996. 





7 The concepts potentially usable in scientific theories (such as e.g. relativity) have been partially ordered in 
the literature as being more observable (and less “theoretical’”’) or less observable and more theoretical. Here 
“observable” also means primary or empirical. This observable/theoretical distinction, or rather hierarchy, is 
recalled from the literature (of relativity theory) in e.g. Friedman, 1983, pp. 4-5. This observable/theoretical 
hierarchy is not perfectly well defined and is known to be problematic, but as Friedman puts it, it is still better 
than nothing. E.g. the motion of the oceans called tides are more observable (or closer to be observable) than 
the pull of gravity of the Moon which, we think, is causing them. That is, the gravitational force field of a 
mass-point (like the moon) is a more theoretical concept than the motion of a body (e.g. ocean’s shore-line). 
Actually the gravitational force field might turn out to be a “wrong concept” and we may have to replace 
it with something else like the curvature of space-time. Probably the motion of the ocean’s shore-line will 
be less questionable as a “something” which one can talk about. As Friedman, 1983, p. 4 points out, the 
observational/theoretical distinction is not an absolute one. E.g. what is an observational concept at a certain 
stage of theory development might turn out to be a theoretical one later. 
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Theories form a rich structure when we investigate their interconnections. 
Algebraic logic establishes a duality between hierarchies of theories (on differ- 
ent vocabularies) and between classes of algebras, cf. e.g., Henkin et al., 1985, 
Sec. 4.3 or Andréka et al., 2001, Part II. Investigating a theory via investigating 
the hierarchy of its different perspectives and subtheories is like investigating 
a 3-dimensional object from all sides. This leads us to the subject of the next 
subsection. 


2.7 Conceptual analysis and “reverse relativity” 


In the previous subsections we strengthened Specrel to a complete theory 
Catrel+RC. But in reality, when working with a theory, we do not want to 
make our axioms generate a complete theory. Our purpose is just the opposite: 
we want to make our axioms as weak, simple, and intuitively acceptable and 
convincing as possible while still strong enough for proving interesting theo- 
rems of relativity theory. Similar striving for economy of assumptions is e.g. 
in Szabó, 2006, Szabó, 2002, Ax, 1978. The reasons for wanting to study weak 
theories as opposed to strong ones are, among others, the desire for answering 
the “why-type questions”, and seeking a conceptual analysis of the theory. For 
more on this we refer to Andréka et al., 2002, Sec. 1.1. Further reasons for 
striving for weak physical theories having many models are presented in van 
Benthem, 1982. Namely, e.g. “small” mechanical systems like our solar system 
or another one or our galaxy can be regarded as many different “small models” 
of e.g. Newtonian mechanics. 

Among other things, we can use logic to find out which axioms are responsi- 
ble for certain surprising predictions of relativity theory like e.g. “no observer 
can move faster than the speed of light”, “the twin paradox” or issues concern- 
ing the possibility of time travel. We can call such studies “reverse relativity” 
alluding to the analogy with the highly successful direction called reverse math- 
ematics, cf. e.g. Simpson, 2005, Friedman, 2004. 

In reverse relativity, we single out an interesting prediction of relativity 
theory like “observers cannot move faster than light (NOFTL)”, or one of our 
paradigmatic effects in Theorem 11.6 and ask ourselves which axioms (of e.g. 
Specrel) are responsible for the prediction in question, cf. Theorem 11.7. Let y 
denote the prediction in question. So typically we know that Specrel | y and 
that y “is interesting”. Then we ask ourselves whether the whole of Specrel is 
needed for proving y. (Recall, the axioms of Specrel are “assumptions”, hence 
they cost money so to speak.) A further question is to ask which fragment 
of Specrel is needed/sufficient for proving y. This type of research has been 
carried through for several interesting choices of p, e.g. in Andréka et al., 2002, 
Madarász, 2002, Andréka et al., 2004. 
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Let us take as an example for ọ the prediction NoFTL (i.e. that no observer 
can move faster-than-light relative to another) established as Theorem 11.7(ii), 
p. 636. Certainly, NoFTL is an interesting prediction, indeed, many thinkers 
tried to get rid of NoFTL either by using “tachions” or by circumnavigating 
it by using wormholes, cf. Sec. 4 for the latter. An instructive “saga” of such 
efforts is provided in Thorne, 1994. The analysis of NOFTL tells us that Specrel 
can be considerably weakened without losing NOFTL. By Theorem 11.7 in this 
chapter, the assumption n > 2 is needed, however. The two key axioms of 
Specrel are the Light Axiom, AxPh, and AxEvent (in some sense). It turns 
out that both of these are needed for NOFTL. However, both of them can be 
weakened considerably without losing NOFTL. In case of AxPh, isotropy is 
not needed for NoFTL. Of AxPh, it is enough to assume that photons are not 
like bullets, they do not race with each other, and they can be sent from each 
point in each direction; i.e. that for any observer m in each direction d there is 
a number cm(d) € Q representing the speed of light for m in direction d. Of 
AxEvent, it is enough to assume that if m sees an event on the worldline of k, 
then k also sees that event; and that if m sees an event that k sees then m sees 
all events in a neighborhood (in k’s coordinate system) of this event. Some 
reflection reveals that this is a more natural, milder assumption than AxEvent 
was. As it turns out, the rest of the axioms of Specrel, can also be weakened 
without losing NoFTL. 

Careful analysis of the noFTL prediction can be found in Madarász, 2002, 
2.8.25, 3.2.13, 3.2.14, Madarász et al., 2004, Theorem 3, Theorem 5, Andréka 
et al., 2002. Similar pieces of conceptual analysis, analysing predictions sim- 
ilarly interesting (like NoFTL) can be found in Andréka et al., 2002, Sec. 4.2, 
Andréka et al., 2004, Madarász et al., 2004, Madarász et al., 2006b, Madarász 
et al., 2006a. Predictions that have been analyzed in these works include the 
twin paradox, the paradigmatic effects, the effect of gravity on clocks. 


3. General relativistic space-time 


In this section we extend our logic-based study of relativity from special 
relativity to general relativistic space-time (GR space-time). In particular, in 
Sec. 3.6 we present a purely first-order logic axiomatization Genrel for GR 
space-time. Theorem 11.28 is a kind of completeness theorem for Genrel. 
Besides providing a first-order logic axiomatization of GR space-times (anal- 
ogously to Sec. 2) and comparing it with Specrel, we will put extra emphasis 
on discussing the exotic properties of various distinguished examples of GR 
space-times in Sec. 4. One of the reasons for this is that these exotic GR space- 
times are at the center of attention nowadays, e.g. because of their fantastic 
properties and because astronomers have been discovering examples of these, 
e.g. finding observational evidence for huge black holes in the last 15 years. 
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Another reason for putting emphasis on examples is that while Specrel has ba- 
sically one intended model, general relativity (GR) has many different intended 
models (e.g. various kinds of exotic black holes, wormholes, timewarps, mod- 
els for the expanding universe, the Big Bang, to mention a few). This contrast 
between special relativity and GR motivates our shifting the emphasis to dis- 
tinguished models in what comes below. This kind of motivation is further 
elaborated in Taylor and Wheeler, 2000. 

A motivation for the logical analysis of GR is that, in principle, GR space- 
times permit such counter-common-sense arrangements as is time travel (in one 
form or another). This was discovered by Kurt Gödel during his cooperation 
with Einstein. But the so-called paradoxes of time travel offer themselves for 
a logical analysis, since these kinds of circularity are the “bread-and-butter” of 
the logician ever since Gédel’s incompleteness proof or since the first logical 
analysis of the liar paradox and its variants. Even if we would want to exclude 
time travel by some axiom like one or another form of the so-called Cosmic 
Censor Hypothesis, it remains a question how to find and justify a natural 
axiom to this effect without making unjustified assumptions. This dilemma 
is illustrated by the debates about the various forms of the Cosmic Censor 
Hypothesis and related assumptions discussed e.g. in Earman, 1995. 


3.1 Transition to general relativity: accelerated 
observers in special relativity 


In Specrel, we restricted attention to inertial observers. It is a natural idea 
to generalize the theory to including accelerated observers as well. Actually, 
when creating general relativity, Einstein emphasized that accelerated observers 
should be included, cf. Einstein, 1961, pp. 59-62. Indeed, the usual transition 
from special relativity to the general theory of relativity goes as follows. First 
special relativity is generalized to accommodate accelerated observers, and then 
one introduces Einstein's principle of equivalence (EPE) which states that the 
phenomena of acceleration and gravity are equivalent (in a carefully specified 
concrete sense). Then, at this point, our language is rich enough to talk about 
gravity in the form of acceleration. After this point, one refines the theory, 
arriving at GR, and then it all hangs together to form a worldview broader than 
special relativity and also broader than Newtonian gravitation theory. The above 
is illustrated by e.g. Einstein, 1961, the classic general relativity book Misner 
et al., 1970, pp. 163-165, Rindler, 2001, e.g. p. 72, 83.8, 812.4, pp. 267- 
272. Even works intending to venture to the unknown beyond GR use the 
above “methodology” of starting by accelerated observers, cf. e.g. Smolin, 
2001, pp. 77-80. 
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The same is done in the research area reported in the present work. Namely, 
in Sec. 2 and in related works, the logical analysis of special relativity is done, 
yielding e.g. the hierarchy of theories containing Specrel,, Specrel, Catrel. 
The next stage extends Specrel by considering new kinds of entities called 
accelerated observers and stating further axioms governing their behavior. This 
yields anew theory Accrel which can be regarded as an extension and refinement 
of Specrel. Gravity can be studied in Accrel in form of acceleration; this is done 
e.g. in Madarász et al., 2006a, in the spirit outlined above. The works Andréka 
et al., 2006b, Madarász et al., 2006b, Madarász et al., 2006a which study Accrel 
stay inside the purely first-order logic based approach represented by Specrel in 
Sec. 2 of this chapter. Using the experience and motivation gained by studying 
Accrel, in Sec. 3.6 we introduce a first-order logic theory Genrel for general 
relativistic space-time. All this converges to a logical analysis of GR. 

Instead of recalling Accrel, which is very similar in spirit to the axiom 
system Genrel in Sec. 3.6, we summarize its main features relevant to Genrel. 
In Accrel, “accelerated observer” means “not necessarily inertial observer”, 
and “observer” means a body that has a worldview, i.e. which occurs in the 
domain of the worldview relation W. Thus, an accelerated observer has a 
worldview. Roughly, the worldview of an accelerated observer k is obtained 
from the worldview of an inertial one, m, by re-coordinatizing it along a smooth 
bijection W,,~ with an open subset of Q” as its domain. Thus k may use only 
part of Q” for coordinatizing events, and more importantly, the worldlines of 
inertial observers and photons are no longer straight lines in an accelerated 
observer’s worldview. I.e., AxEvent and AxLine cease to hold. They hold 
in generalized, weaker forms only. Specifically, an accelerated observer can 
recognize worldlines of inertial bodies as so-called “geodesic curves”, this is 
the motivation for Sec. 3.3 and for the axiom AxLine in Genrel. 

The key axiom of accelerated observers states that at each moment of his life, 
each accelerated observer sees the nearby world for a short while as an inertial 
observer does. Technically, in Accrel we formulate this as stating that at each 
moment of the life of an accelerated observer k there is a so-called co-moving 
inertial observer m such that the linear approximation (i.e. the differential) of the 
worldview transformation Wx at this space-time point is the identity function. 
This axiom, called AxAcc in the quoted works, is the connecting point between 
the worldviews of inertial and non-inertial observers. The counterpart of AxAcc 
in the present work will be discussed next. 

We can think of an accelerated observer in special relativity as a spaceship 
which uses fuel for accelerating (in a space where there is no gravity). When 
the drive is switched off, the ship will transform into an inertial ship—this is 
the co-moving inertial observer at the event of switching off the ship-drive. Or, 
equivalently, we can think of the co-moving inertial observer as a spaceprobe 
which was let go from the ship—a metaphorical apple dropped in space. The 
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ship can measure its own acceleration by measuring the acceleration of the 
spaceprobe; just as here on Earth we can measure gravity by measuring the ac- 
celeration of a dropped apple. EPE then implies that the spaceship can interpret 
“falling of the metaphorical apples” either by thinking that he is accelerating 
in an empty space, or by thinking that he is suspended in a space where there 
is gravity, and dropped apples fall because they are no longer suspended. By 
EPE, the worldview of an accelerated observer in special relativity is similar 
to a “suspended” observer in a space-time where there is gravity. The gravita- 
tional counterpart, by EPE, of AxAcc is Einstein’s Locally Special Relativity 
Principle which we recall at the beginning of Sec. 3.2; it will be our starting 
point in defining GR space-times. 

Summing up: by EPE, investigation of gravity can be reduced to the inves- 
tigation of the worldlines of inertial bodies in a GR space-time. 

Let us turn to the reasons of why the transition from special relativity to 
general relativity goes via accelerated observers and EPE. In Sec. 2, we chose 
to derive special relativity from the outcome of the famous Michelson-Morley 
experiment, i.e. from the Light Axiom. However, as we already mentioned, 
relying on the Light Axiom is not really necessary. As Einstein always empha- 
sized (e.g. in Einstein, 1961), relativity can be derived from a deep philosophical 
principle called the special principle of relativity (SPR). SPR has been around 
in our culture for 2500 years (roughly), hence it is well understood and it blends 
nicely with our best understanding of the world. Roughly, SPR says that the 
Laws of Nature are the same for all inertial observers. The modern form of 
SPR was articulated by the Normann-French Nicole d’Oresme around 1300 
(Paris) and (a bit more thoroughly) by Galileo Galilei (around 1600). After 
Olaf Roemer, James Bradley and followers discovered that the speed of light 
is finite (and related issues were clarified), SPR could have been used? to show 
inconsistency with the Newtonian worldview and then to derive special relativ- 
ity (analogously to the train of thought we used in Sec. 2). This in turn would 
have predicted the outcome of the Michelson-Morley experiment.’ Einstein 
elaborated this idea in detail, and in particular, emphasized that special rela- 
tivity can be derived from SPR (in place of the Light Axiom). The same kind 
of philosophical taste led Einstein to asking why SPR is restricted to inertial 
observers only. Why aren’t the Laws of Nature the same for all observers (not 
only the inertial ones)? After all, we ourselves sitting on the surface of the 
Earth (and fighting gravity all the time) are not inertial observers according to 
the definition used in SPR. 





8With hindsight, this possibility was there around the 1830’s or so. Roemer made the discovery around 
1680, but it was not generally accepted until 1750 approx. 

°Of course, for this, light propagation needs to be regarded as Law of Nature, but as Einsteinpoints out, this 
is absolutely natural. 
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So, Einstein started working towards GR by generalizing SPR to the general 
principle of relativity (PR) which says that the Laws of Nature are the same for 
all observers, including accelerated ones. This move creates some extra tasks to 
handle, because accelerated observers experience the existence of a new “force- 
field”, namely gravity. So Einstein introduced his principle of equivalence 
EPE unifying acceleration created by gravity with “ordinary gravity”. Now, to 
uphold PR we have to regard gravity (and light propagation of course) as part 
of what constitute Laws of Nature. This creates some extra tasks (mentioned 
above) since in Specrel properties of gravity were not part of the picture. As 
we will see in the next section, this extra work can be handled leading to a 
unification broader than that provided by special relativity. The new theory 
GR unifies space, time, motion, light-propagation, and gravitation into a single 
purely geometrical perspective. 


3.2 Einstein’s “locally special relativity principle” 


Einstein’s locally special relativity principle saying that General Relativity 
is locally Special Relativity is the following. Let p be a point in a GR space- 
time. Then if we drop a small enough spaceship, put an experimental scientist 
in the spaceship who lives for a short enough time, the experimentalist will 
find special relativity true in the spaceship. This holds true even on the event 
horizon of a spinning black hole or wherever you want. Of course, it is crucial 
that the spaceship is small enough and that its life is considered only for a small 
enough time interval. This is Einstein’s locally special relativity principle. See 
Fig. 11.23. These local tiny spaceships will appear later as “local reference 
frames” LFR’s. They play the same role in GR as co-moving observers did in 
AxAcc. 

Next we implement Einstein’s locally special relativity principle formulated 
above for formalizing GR space-times. In this and in the next few subsections, 
for simplicity, we will use R in place of an arbitrary linearly ordered quadratic 
field Q, and also we will use n = 4. Later, in Sec. 3.6 we will return to the 
generality of Q and n > 2. 

For general relativity, we will use global coordinate frames, GFR’s. A global 
coordinate frame is based on an open subset of Rt. So a global coordinate frame 
looks like a special relativity frame, we even call one of the coordinates time, 
the others space etc. The difference is that in a global frame the coordinates do 
not carry any physical or intuitive meaning.'? They serve only as a matter of 
convention in gluing the local special relativity frames, LFR’s, together. For 
simplicity, at the beginning we will pretend that the coordinate system of our 
global frame is the whole of Rt. Later we will refine this to saying that the 





10To be precise, the topology of the global frame will be relevant. 
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global frame is an open subset of R*. And even later, in Sec. 3.6, we will 
generalize this to be a manifold. Since the differences are extremely minor and 
secondary from the point of view of the basic notions we are going to introduce 
now, let us first pretend that the global frame is R£. 








Figure 11.23. Einstein’s locally special relativity principle: where-ever we drop a small enough 
spaceship, for a short enough time it will experience special relativity. 


Imagine a general relativistic coordinate system, a GFR, representing the 
whole universe, with a black hole in the middle etc. So we are looking at 
the bare coordinate grid of R* intending to represent the whole of space-time. 
What is the first thing we want to specify for our readers about the points of this 
grid R4? Well, it is how the local tiny little special relativistic space-times are 
associated to the points p of R*, in accordance with Einstein’s locally special 
relativity principle formulated at the beginning of this subsection. Thus, to every 
point p of R* we want to specify how the local special relativity space-time at 
point p is squeezed into the local neighborhood of p. The point is in specifying 
how the clocks of the LFR slow down or speed up at p, and which axis of the 
local LFR points in what direction and is distorted (shortened/lengthened) in 
what degree. The LFR at p corresponds to the metaphorical spaceship dropped 
at p as in Fig. 11.23. 

How do we specify the local frames LFR’s? A local frame Lp at p will be 
a bijective mapping Lp : Rf — Rt such that L,(0) = p. We will think of 
the first R4 as the coordinate system of special relativity, or of the Minkowski 
space represented by LFR, and of the second Rt as the global frame GFR upon 
which we want to build our GR space-time. Thus we write L, : LFR — GFR, 
where formally LFR := GFR := Rt. Since we want to use our local frames 
to specify how the tiny clocks slow down in the “linear limit” (roughly, in an 
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Figure 11.24. The local frame at p is an affine mapping Lp of R* to R* taking 0 to p. We will 
use Lp in small neighborhoods of 0 only. 


“infinitesimally small” neighborhood of p), we will choose these Lp’s to be 
affine mappings. 

So, the key device in building our GR space-time is associating to each point 
p of our global coordinate grid GFR an affine transformation L, mapping the 
Minkowski space represented by LFR to the global frame GFR. 


DEFINITION 11.18 (GENERAL RELATIVISTIC SPACE-TIME) By a general 
relativistic space-time we understand a pair (M, L) where 


M C Rt isopen and 


L is a function defined on M such that L(p) : Rt — Rt is a bijective 
affine mapping which takes the origin 0 to p, for each p € M. Further, 
we require that L (as a function of p € R$) be infinitely many times 
continuously differentiable, in short smooth. 


For better readability, we will write L, to denote L(p). We will use local 
special relativity frames (the LFR’s) for importing the notions of special rela- 
tivity to our GR space-times. We will use the inverse mapping L; 1 of the affine 
transformation Lp to translate our general relativistic problems to special rela- 
tivity, and we will use L, to bring back the answers special relativity gives us. 
Though L 1 is defined on the whole of R4, we will use it only in small enough 
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neighborhoods of p (i.e. we will use it in the limit, more and more accurately 
as we close on p). Restricting attention to small neighborhoods of p is what is 
meant by saying that GR can be locally reduced to special relativity, but only 
locally. If we want to solve a problem at a point q farther away from p , then 
we will have to use the mapping Lq associated to q in place of using Lp. 

We can specify the local frame Lp by the images of the four unit-vectors 1,, 
as follows: G(p) = (Gi(p),...,Gz(p)) is a 4-tuple of vectors (i.e. elements 
of R4) such that Lp : R4 — Rt is the affine transformation which maps the 
origin 0 to p, and 1; to G;(p) + p, for i € {t, x, y, z}, see Fig. 11.24. Thus, we 
can specify a GR space-time (M, L) by simply specifying four vector-fields. 
Here we use the word “field” as in analysis and not as in algebra. Thus “field” 
in “vector-field” means that we have a vector at each point of M C R4. This 
gives us the following equivalent definition for a GR space-time. 


DEFINITION 11.19 (GENERAL RELATIVISTIC SPACE-TIME IN VECTOR- 
FIELDS FORM) By a general relativistic space-time in vector-fields form we 
understand a four-tuple (G+, Gx, Gy, Gz) of vector-fields such that 


each Gi : M — Rt is smooth, where M = Dom(G;) © Rt is open (i € 
{t, T, Y, z}) and 


the vectors Gi(p), .. ., Gz(p) at each point p € M are linearly independent 
in the usual sense. (This means that the affine mapping they specify is a 
bijection.) 


An advantage of Def. 11.19 is that it is simple, and that it is in this form that 
we can “draw” or visualize a general relativistic space-time. See Figs. 11.30, 
11.31, 11.33, 11.40, 11.34. 

Now, how do we use a GR space-time, i.e. such a 4-tuple of vector-fields, 
for representing some aspects of reality? Very, very roughly, the information 
content of a GR space-time G = (G;,...,G,) can be visualized as follows. 
The vector tetrad G;(p),...,G_(p) at point p tells us how the measuring in- 
struments (clocks, meter-rods) of the tiny little inertial observer we imagine 
as being dropped at p go crazy (go wrong) from the point of view of the big, 
global general relativistic coordinate grid GFR we are using in G. This in- 
formation is very subjective, since as we said, the big global coordinate grid 
carries no physical meaning, it is conventional. But some objective content can 
be extracted from this subjective information. As we said, for a fixed p, the 
vector tetrad G;(p),...,Gz(p) wants to represent how the local frame is glued 
into the holistic picture of the global frame. The important point is, however, 
how the individual local frames are distorted, rotated etc w.r.t. each other, the 
big global frame grid is only a theoretical, conventional device to serve as a 
common denominator in arranging the little local frames relative to each other. 
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What can be described in terms of the 4-tuple (G4, . . . , Gz) of vector-fields? 
Well, we can describe the (potential) worldlines (parameterized with wristwatch 
times) of inertial observers and the worldlines of photons. We will see that 
knowing what the potential worldlines of inertial observers and photons are 
tells us everything important about a GR space-time. Gravity, curvature can be 
defined explicitly from knowing the above mentioned worldlines. 


3.3 Worldlines of inertial observers and photons in a 
general relativistic space-time 


The worldlines of inertial observers will be described mathematically as 
timelike geodesic curves. We now turn to defining these. In this subsection we 
fix a general relativistic space-time (M, L}, and we letGFR := M, LFR := R4. 

By a (smooth) curve f we understand a smooth mapping f : I — Rt, where 
I is an open interval of R. By a point of the curve we mean a point in its range. 

Intuitively, the curve f : I — GFR is called timelike at point p iff the local 
frame at p “sees” an observer co-moving with the curve at p. In more detail, 
the curve f is called timelike at p iff the speed of f as seen by the local frame at 
p is smaller than 1. This means that the tangent of the curve L o f has slope 
smaller than 1, at the origin; geometrically this means that the tangent straight 
line at the origin is within the light-cone. The curve f is called timelike iff the 
curve f is timelike at each of its points p . (Note: this is independent of how 
the curve f is parameterized.) See Fig. 11.25. 
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Figure 11.25. The curve f is timelike at point p. 


Note that talking about the tangent of L;' o f : I — LFR involves nothing 
“fancy”, since (at this step) we are in a special relativity space-time and we 
are using its Euclidean geometry over R4 and we are looking at a smooth 
curve in it. 
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We think of a timelike curve as a curve that in principle can be the worldline 
of a (perhaps accelerated) observer. 

When is a timelike curve f : J — GFR called a timelike geodesic? First 
we have to check whether the curve f represents (or measures) relativistic time 
correctly. Here, the parametrization will be important. In what follows, |a, b] 
denotes the closed interval of R with endpoints a,b, i.e. [a,b] := {x E€ R : 
a < x < b}. For the definition of relativistic distance rd : Rt x R* — R see 
Sec. 2.6, p. 656. 


DEFINITION 11.20 We say that f represents time correctly if the following 
statement holds. For every t € I, and for every positive £ there is a pos- 
itive ô such that for all s € |t — 6,t + ô| it holds that |s — t| agrees with 
what rd( f(s), f(t)) is as measured by the local frame determined by L p) up 
to an error bound by € x |s — t|. Here “td( f(s), f(t)) as measured by the 
local frame” is the relativistic (Minkowski) distance between L; 1(f(s)) and 
L,,'(f(t)) understood in special relativity. Formally this is: 





(Vt € I)(Ve > 0)(38 > 0)(Vs € [t — ô, t + ôl) 
|rd(L5*(f(s)), L5 (F()) — |s — tl] < € * |s — tl. 


We call a curve time-faithful iff it is timelike and represents relativistic time 
correctly. Intuitively, a timelike curve is time-faithful iff at each point p of the 
curve, the local frame at p “sees” an observer co-moving with the curve such 
that the parametrization of the curve “agrees” with how time passes for this 
co-moving observer. See Fig. 11.26. 











LFR GFR 


J co-moving inertial observer 





Figure 11.26. A time-faithful curve at p. 
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We imagine that a time-faithful curve f is the worldline of an observer b such 
that the parameter t measures proper time of b; or in other words, t shows the 
time on the wristwatch of b. We imagine the motion of b such that f(t) in GFR is 
the space-time location of observer b at his wristwatch time t. The condition in 
Def. 11.20 serves to ensure that wristwatch time t of observer b (whose motion 
is represented by the curve f) agrees with the relativistic time interval measured 
by the relativistic metric rd at the local frame which is situated at the location 
f(t). More precisely, small time intervals on the wristwatch of b agree with 
the relativistic time interval measured by the rd’s of the local special relativity 
frames. Thus f is the general relativistic analogue of the special relativistic 
wline,,,(b) + parametrization with “proper time” or “inner time” of b. 

Put differently, we think that a timelike curve can be the worldline of (the 
mass-center of) a spaceship which uses fuel (i.e. uses its ship-drive) for accel- 
erating and decelerating. The curve is time-faithful if the parametrization of 
the curve agrees with “inner time” of the spaceship. 


DEFINITION 11.21 (TIMELIKE GEODESIC) By a timelike geodesic we un- 
derstand a time-faithful curve f : I — GFR satisfying the following. For any 
tin I there is a neighborhood S of f (t) (understood in R*) such that inside S, 
f is a “straightest possible” curve in the following sense: For any two points 
p,q of S connected by f, the distance of p and q as measured by f is maximal 
among the distances measured by “competing” time-faithful curves inside S. 

Formally, this maximality condition is expressed by the following. Assume 
that h is a time-faithful curve with range inside S. Assume that p = f(s) = 
h(s'), q = f(r) = h(r') and h(t’) is in S for all t which are between s' and 
r’. Then |s — r| > |s" — r'|. See Fig. 11.27. 


If f is a timelike geodesic, then we imagine that an inertial observer b can 
move along it. By an inertial observer b we imagine (the mass-center of) a 
spaceship with ship-drive switched off, i.e. a spaceship which does not use fuel 
for influencing its motion. 

The above definition of timelike geodesics is the natural reformulation of 
the Euclidean notion of geodesics (straight curves in a possibly curved surface 
of Euclidean 3-space) with “minimal” replaced by “maximal” (cf. Hawking 
and Ellis, 1973, Proposition 4.5.3, p. 105). If one thinks about this maximality 
condition, one will find that it is strongly connected to the Twin Paradox of 
special relativity. Indeed, the Twin Paradox is exactly the statement that in 
special relativity, worldlines of inertial observers are timelike geodesics in the 
sense of Def. 11.21, cf., e.g., Madarász et al., 2006b, Theorem 3.1. 

By the above definition of timelike geodesics, we can express what world- 
lines of inertial observers are in GR space-times (M, L}. The definition of 
photonlike geodesics is analogous, and goes as follows. 
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Figure 11.27. f is a timelike geodesic curve. 


The curve f is called photonlike at p iff the speed of f as seen by the local 
frame at p equals 1. In more detail, this means that the tangent of the curve 
Lo l o f at the origin has slope 1. The curve f is called photonlike iff the curve 
f is photonlike at each of its points p. 

We imagine that a photonlike curve can be the worldline of a photon perhaps 
directed (diverted) by suitably many mirrors. 

A photonlike geodesic is a photonlike curve f with the property that each 
point in the curve has a neighborhood in which f is the unique photonlike curve 
through any two points of f. (In more detail, let F denote the range of f. Then 
any point in F has a neighborhood S' such that whenever f’ is a photonlike 
curve connecting two points of F N S and such that F’=the range of f' is inside 
S, we have that F’ C F, cf. Hawking and Ellis, 1973, Proposition 4.5.3.) 

We imagine that photonlike geodesics are worldlines of photons. Let us 
notice at this point that a GR space-time (M, L) determines “inertial motion” 
and also determines how photons move. 


3.4 The global grid seen with the eyes of the local grids: 
general relativistic space-time in metric-tensor field 
form 


In the previous subsection, we imported special relativistic notions by using 
the local frames, the Lp’s. We used the inverse Ly 1 of the affine transformation 
L, to translate our general relativistic “problems”, or “questions”, to special 
relativity, and then we used L, to bring back the answers special relativity gave 
us. Since we will use the notions of special relativity this way all the time, 


it is useful to “transport”, via Lp, the most useful notions of special relativity 
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themselves to our general relativistic frame GFR to be ready for use when we 
need them. Thus, in each point p of GFR, we can “store”, e.g., relativistic 
squared distance u of special relativity transported via Lp; we will denote this 
by u, and we will call it the“local special relativistic squared distance p at 
p”. This way we will get “fields” of notions, where we use the word “field” as 
in analysis and not as in algebra (as mentioned on p. 667). Special relativistic 
squared distance (i.e. Minkowski distance) ps and scalar product g4 were defined 
on p. 638, p. 656. Here we use the simplified version g(p, q) = 9,(p, 0, q, 0) 
of g as introduced on p. 657. 


DEFINITION 11.22 (FIELDS OF “TRANSPORTED” SPECIAL RELATIVITY 
NOTIONS) Let (M, L) be a general relativistic space-time. For any p E€ M we 
define u, : Rt x Rt — Ras follows: for any q,r € Rf 


Hp(q,r) = u(L3 (q), L5 (r)); and similarly we define 
9,(9,7) := g(L7 (q), Le): See Fig. 11.28. 











LFR GFR 


Figure 11.28. The metric p, of local special relativity at p. d = u, (q,r). 


In the literature, the most often used form for specifying a GR space-time is by 
transporting the Minkowski scalar product g to each point p of GFR. The reason 
for this is that it is easy to make calculations with these data. From (M, LY then 
we get (M,Q,,)pem, usually just written as (M, g). We call g := (9, : p E M} 
the metric-tensor field of (M, L}, and we call (M,g) the metric-tensor field 
form of (M, L}. 

Since g is linear in its two arguments, the most convenient way of specifying 
9, : R4 x Rt — Ris to specify g,,(1; + p, 1; + p) forall 1 < i,j < 4: 


Qij(P) = gp(l: + p,1j +p) = g(Lp (l: + p), Lp (1; + p)), for all 
1 . 
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Then we can specify (M, g) by associating the 4 by 4 matrix (g;;(p) : 1 < 
i,j < 4) to each point p € M. What is the meaning of the g;;(p)’s? Well, 


a/ |g; (p)| tells us how long the i-th unit-vector of the big grid GFR is in the eye 
of the local special relativity frame at p. On the other hand, g;; (p) fori # j tells 
us what “angle” between the unit-vectors 1; and 1; of the big GFR is as seen by 
the local special relativity at p. If g;; (p) = 0, then the local special relativity 
at p (also) thinks that 1; and 1; are orthogonal to each other. If g,,(p) = 1, 
then time at the local special relativity at p flows just as in the GFR grid. If, 
say, 9,,(p) = 4, then two hours pass in the local special relativity at p while 
in the big grid only one hour passes, hence local special relativity LFR-time 
at p is twice as fast as coordinate GFR-time. In general, g;,(p) > 0 tells us 
how the local special relativity sitting at p sees “time of the coordinate grid 
GFR” to flow (how much slower or faster). If g,,(p) is negative, then the local 
special relativity at p “sees” the time-axis of the GFR as a spatial direction, and 
not as a “temporal direction”. This means that in the local special relativity 
at p, no observer can “move/live” in the space-time direction 1; of the GFR. 
If g,.,(p) = —1, then the local special relativity at p sees that spatial distance 
along the x-axis behaves like the one in the big GFR-grid. Also, g;;(p) > 0 iff 
1; of GFR is timelike as seen by the LFR. Etc. 

By definition, plq, r) is the relativistic squared distance between q and r 
as seen by the LFR at p, and p, can be expressed using the g;;’s as follows: 


My(a,7) = X {9 (p) * (qi — pi) * (rj — pi): 1< i,j < 4}. 


The “infinitesimal version” of the above formula is called the line-element, 
(x) below. In this chapter, we will use the line-element only as an economic 
linguistic device for specifying the matrix (g;;(p) : 1 < i, j < 4). Namely, for 
p E€ M, the line-element at p is 


(x) ds?(p) = J {aididj:1 <i <j < 4}. 


In the above, we consider ds”, d1, ..., d4 as “specific linguistic markers”, the 
information content of the line-element (x) above is 


Q,;(p) = ai for 1 < i < 4, and 
9;;(P) = 9j(p) = say forl <i <j <4. 


E.g. if at p € M the line-element is ds?(p) = dt? — dx? — dy? — dz?, then 
Gi (p) =1, Inn (P) F Jy (P) T g..(P) = —1, and Gi; (P) = 0 fori £ j. For 
more examples see Sec. 4. In Sec. 4 we will use the line-element in specifying 
a given space-time (M, L), not only because of its economy, but also in order 
to keep comparability with the literature. We want to emphasize that, in this 
chapter, the line-element is just a convenient linguistic way of specifying g,,, 


we will not attach independent meanings to ds”, d1, ..., d4. 


674 HANDBOOK OF SPATIAL LOGICS 


Let (M, L) be a GR space-time and let G = (G4, ..., Gz} and (M,g) be 
its vector-fields and metric-tensor field forms, respectively. Now, G contains 
the same information as (M, LY, but (M, g) contains slightly less information. 
However, as we will see in the next section, the really relevant “information” of 
a space-time (M, L} is what is contained in (M, g). We use GR space-times in 
the form (M, L} or G because these are easy to draw (visualize), see Figs. 11.30, 
11.31, 11.33, 11.40, 11.34; while a GR space-time in the form of (M, g) is not 
so convenient to draw. 

From the perspective of the present subsection, in a GR space-time (Gi,..., 
Gz), the tetrad (G;(p),...,Gz(p)) tells us how the big GFR sees the local 
special relativity unit-vectors of an arbitrarily chosen observer in the local 
special relativity space-time that “sits” at p. On the other hand, the matrix 
(9;;(p) : 1 < i,j < 4) tells us how the local special relativity space-time 
sitting at p “sees” the unit-vectors of the big global frame GFR! 


3.5 Isomorphisms between general relativistic space-times 


We said that the big global frame carries no physical meaning, and only 
timelike and photonlike geodesics carry physical meanings, everything else 
(e.g. gravity) can be defined from these geodesics. We give “meaning” to 
this statement (or claim) in the form of defining what isomorphisms of GR 
space-times are. 


DEFINITION 11.23 Let G = (M,L) and Œ@ = (M',L’) be two general 
relativistic space-times. An isomorphism between these two GR space-times is 
a bijection Iso : M — M' such that (i)-(iii) below hold. 


(i) Both Iso and the inverse of Iso are smooth. 


(ii) Iso preserves timelike geodesics. In more detail, for any curve f : I — M, 
f is a timelike geodesic in G iff f o Iso is a timelike geodesic in G”. 


(iii) Iso preserves photonlike geodesics (in the above sense). 


We note that we could omit (iii), because one can prove that it follows from 
(i)-(ii) above. 

By using Def. 11.23, it is difficult to check whether a bijection Iso : M — M' 
is an isomorphism if we know only L and L’ and we did not compute what 
the geodesics are in (M, L) and in (M’,L’). Below we give an equivalent 
definition that uses only the “building blocks” L and L’ of the general relativistic 
space-times. 

We will use the notion of the differential of a differentiable function.: When 
f : Rê — Rt is differentiable, the differential of f at p € Rt is the affine 
transformation D( f)p whichis closest to f at p. Of the properties of a differential 
we will mostly use that f and D(f), take p to the same point, and they take a 
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curve g passing through p to tangent curves, i.e. f(g) and D(f),(g) are tangent 


at f (p). 


THEOREM 11.24 (EQUIVALENT FORM OF DEFINITION OF ISOMOR- 
PHISMS) Let G = (M, L) and Œ@' = (M',L’) be general relativistic space- 
times and let ( M, 9) and (M' , 9’) be their metric-tensor field forms, respectively. 
Let Iso : M — M' be a smooth bijection such that its inverse is also smooth. 
Then (i)-(iii) below are equivalent. 


(i) Iso is an isomorphism between G and G’ in the sense of Def. 11.23. 


(ii) For any p € M, the differential of L'(Iso(p))~! o Iso o L(p) at the origin 
is a Lorentz transformation (on LFR) perhaps composed with a space- 
isometry. See Fig. 11.29. 


(iii) For any p,q,r E€ M we have that 


9p (4:7) = Isoq P(Us0)p(a), D(Es0)p(r)): 








Figure 11.29. Isomorphism between general relativistic space-times. 


In connection with Theorem 11.24(ii) above we note the following. Here, 
the general pattern is L’/(Iso(p))~! o Iso o L(p) : LFR — LFR because 
L(p): LFR — M, Iso: M — M', and L'(Iso(p))~! : M’ + LFR. (More 
precisely, L(p) : LFR— R4 D M etc., but that does not matter here.) Recall 
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that LFR = Rt refers to the special relativistic frame we are using (in the 
present section). 

By the above, we have the basic building blocks of General Relativity at 
place and we can start working, we can start discussing black holes, worm- 
holes, the Expanding Universe, etc. So, from here the reader can jump right to 
Sec. 4 which discusses examples of GR space-times, e.g. black holes. In the 
next subsection we present a FOL axiomatization of GR space-times which is 
analogous to Specrel and which can serve as a starting point for logic-oriented 
investigations of GR similar to the ones in Sec. 2. 

In Sec. 3.6 we will use the following. 


DEFINITION 11.25 A Lorentz manifold is a system of GR space-times con- 
nected by commuting partial isomorphisms. I.e. a Lorentz manifold is a system 
((Gm; mk) : M, k € J) such that for allm, k € J the following hold. 


(i) Gm = (Mm, Jm) is the metric-tensor field form ofa GR space-time (Mm, Lm) 
in the sense of Def. 11.22, except that we do not assume smoothness of 
L, we only assume that Q,,, is smooth. 


(ii) (mz : m,k € J) is a commuting system of smooth functions with open 
domains. Commuting means that bkn © Ymk C Wmp for allm,k,h € J. 


(iii) Ymg is a partial isomorphism between Gm and Gg. This means that Ymk 
is an isomorphism between Gm and Gy restricted to the domain and the 
range Of Wm, respectively, in the sense of Theorem 11.24(iii). 


Thus, a Lorentz manifold could be called an “organized system of GR space- 
times” or a “patched space-time”. It is not difficult to show that the above 
definition of Lorentz manifolds is equivalent with the usual definition in the 
literature, e.g. with the one in Wald, 1984, pp. 12, 23. 


3.6 Axiomatization Genrel of general relativistic 
space-time in first-order logic 


In this subsection we give an axiom system called Genrel to general rela- 
tivistic space-times. This axiom system is formulated in first-order logic and is 
analogous to Specrel, even at the level of the individual axioms. 

The vocabulary (or language) of GR space-time models is the same as that in 
the case of special relativity, with the same intuition as in Sec. 2.1; the motivation 
for the axioms in Genrel is in Secs. 3.2,3.3. In the present subsection we will 
use arbitrary ordered fields and arbitrary dimensions n > 2 for the space-time 
models as in Sec. 2 (and not only R and n = 4). We will use the same notation 
as in Sec. 2, e.g. we will use ev,,,, PLines, etc. 

Convention: The elements of Ob are called inertial observers. The elements 

in the domain Dom(W) = {b : (Sk, p)W (b, k, p)} of W are called (ordinary) 


observers. 
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We will no longer require that an observer m uses the whole coordinate 
grid Q” for coordinatizing the events; the part he uses is denoted by Cd(m), 
and is defined as Cd(m) := {p € Q” : ev,,(p) # O}. If p,q € t, then 
[p,q] := {r E t : P < re < a}. 

AxSelf~ : An observer m in his own coordinate system is motionless in the 
origin, and his worldline is connected, i.e. 


(Ym € B)[wline,,(m) = tN Cd(m)A 
(Vp, q E€ wline,,.(m))[p, q] C wline,,(m) ]. 


We formalize when two subsets h and g of Q” are tangent at p € Q”: 
tangent(h, g, p) means that 


p E€ hN gand (Ve > 0)(3ô > 0)(Vs € [pz — ô, pi + 6]) (Vg E h)(Vr € g) 
[g =rs=s => |q=r|<ex]|s- pl]. 





AxPh~: An inertial observer m at the origin, where he stands, sees photons 
move in each direction with speed 1, and each photon meeting m moves with 
speed 1, i.e. 





(Ym € Ob)| (v£ € PLines)(Vp € LN wline,,,(m))(Sph € Ph) 
tangent(£, wlinem (ph), p) A (Yph € Ph) (Yp € wlinem (ph) N 
wline,,,(m))(Sé € PLines)tangent(@, wlinem (ph), p) |. 





AXxThEx : An inertial observer m at the origin, where he stands, sees inertial 
observers move in each direction with speeds < 1, and sees at least one inertial 
observer in each event, i.e. 


(Ym € Ob) (v£ € TLines) (Vp € £N wlinem(m))(Eak € Ob) 
tangent(/,wline,,(k),p) and (Vp € Cd(m))(Sk € Ob)k € evm(p). 








AxSelf”,AxPh” , AXThEx” express that an inertial observer experiences 
special relativity in the space-time location where he is. Next we formalize 
a generalization of AxLine to general relativity. It will say that in each iner- 
tial observer’s worldview, the worldlines of inertial observers and photons are 
timelike and photonlike geodesics, respectively. In formulating this axiom, we 
will follow the definitions given in Sec. 3.3. We quantified over curves in the 
definition of geodesics. Since we want to use the language of first-order logic, 
instead of arbitrary (smooth) curves, we will quantify over bodies representing 
special curves; namely 3 times continuously differentiable curves which can 
be defined by first-order logic formulas. This will be the only difference in the 
definition. 

Let us call a curve r-smooth if it is r-times continuously differentiable. In 
general relativity, it is enough to use 3-smooth curves in place of arbitrarily 
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smooth curves. E.g. for defining curvature, the Riemann-tensor etc., one needs 
only 3-smooth ingredients in place of smooth ingredients. 

Let y be a first-order logic formula in our present vocabulary, and assume that 
the free variables of y are among t,21,...,%n,Y1,---, Yr Where t, %1,...,%n 
are variables of sort Q. We will denote this assumption as ~ = w(t, Z, Y). We 
can easily express in first-order logic that, at parameter 7, the formula Y defines 
a 3-smooth curve, we will denote this formula by curve(~). To give a flavor 
for this definition, we start formulating Curve(7)). 








fn(y) denotes the formula Vt(Arw(t, T, 7) — Alew(t, z, Y )). This expresses 
that ~ defines a (partial) function at parameter y. When this is the case, 
we will denote by w(t) the value of this function at t. In a similar way, 
we can express that the domain of the function defined by y is an open 
interval. 





vel(w,t) = v denotes the formula fn(w) A (Ve > 0)(Sd > 0)(Vs € [t — ô, t + 
d])(s At — |[(W(s) — Y(t))/(s —t)] — v| < £). This formula expresses 
that the velocity vector of the function defined by w at t is 0 € Q”. Then 
we can express that the velocity vector changes with ¢ continuously, i.e. 
the function defined by w is 1-smooth. Similarly, we can express that it is 
3-smooth. Let curve(w) denote that y defines a 3-smooth function, and 
the domain of this function is an open interval. We note that vel(¢), t) is 
the tangent-vector of the curve ~ at t. The length of this vector depends 
on the parametrization of the curve. 


By the worldcurve of observer k in m’s worldview we understand the world- 
line wlinem(k) parameterized with the wristwatch time of k. We can define 
this worldcurve by the formula ymk = Ymxz(t, T) = y(t, T, m, k) as follows: 


mk := Y(t, T, m, k) denotes the formula “Wgm(t, 0) = Z A W(k, k, t,0)”. 


Intuitively, Ym holds for t, ©% iff m sees k present at coordinates % such that 
k’s wristwatch shows t when k is present at z. Below we use Y(t) as a 
function of t. 

Finally, we express that Y = w(t, Z, Y ) defines a time-faithful curve, or that 
w defines a photonlike curve, respectively as 





al 


timef(7)) denotes the formula “curve(q) A Vt(3TY(t, T) > 
(ak € Ob)(Ss € Q)[ W(t) = Ymu(s) A vel(y, t) = vel(qmx, 8) ])”. 


phot(~) denotes the formula “curve(w) A vVt(3TyY(t, T) > 
dph € Ph)[ w(t) € wline,,, (ph) Atangent(atw, wlinem (ph), y(t) |)” 


To be able to quantify conveniently over parametrically defined timelike and 
photonlike curves, we will use the following axiom schema which is an analogue 














=——~ 
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of the Comprehension axiom schema in Set Theory. Below, we are thinking of 
W(t, T) as defining a curve, hence t € Domy abbreviates the formula ITY (t, T), 
or equivalently, ITY(t, T, Y). We systematically do not indicate ¥ because in 
Ary)(t,Z, Y), the variables in y are free variables, they remain free variables 
while we use w in building up new formulas like Axi, below and eventually in 
postulating the axioms, all free variables become universally quantified. Hence 


e.g. Axi, looks like Vy(... W(t, T, 7) ...). 


In each inertial observer’s worldview, the parametrically definable timefaithful 
curves are worldcurves of (not necessarily inertial) observers; and the photon- 
like curves are worldlines of bodies. Formally: Let y(t, z, 7) be a formula. 
Then 


Axi: (timef(~)) — (Ym € Ob) (ab € B)(Vt € Doma) w(t) = Ymo(t)) A 
(phot(w) — (Vm € Ob) (3b € B){w(t) : t € Domy} = wline,,,(b)). 


COMPR := {Ax3ų : % is a formula of our vocabulary }. 


For any p € Q” and € > 0 let S(p, £):= {q € Q” : |q — p| < £}, the open 
ball (or sphere) of radius £ with center p. We now can formulate 





























“~ np is a timelike geodesic” iff 


timef (ymp) A (Vp € wline,,,(k))(de > 0) (Yq, r € wlinem (k) N S(p, £)) 
(Vb € B)[timef (Ymb) A Wwlinem (b) C S(p, €) Ag = Ymb lt) = Ymk (t) A 
T = mlS) = Ime(s) > |t- s| 2 |” -— s|]. 

“wline,,.(ph) isa photonlike geodesic” can be expressed analogously (see p. 671 
where photonlike geodesics were defined for (M, LY). 





AxLine`: In each inertial observer’s worldview, the worldlines of inertial 
observers and photons are geodesics. Formally: 


(Ym, k € Ob)“ymx, is a timelike geodesic” and 
(Ym € Ob)(Vph € Ph)‘wline,,,(ph) is a photonlike geodesic”. 


Now we turn to formulating a generalization of AxEvent. It expresses that if 
m observes k participate in an event, then k himself “sees” this event. Further, 
if k sees an event that m sees, then k sees all events which occur “near” this 
event in m’s worldview. 


AxEvent’: 


(Ym, k € Ob)(Vp € Q")(k € CVn (p) = (3q € Q”)evk(q) = CVimn(p)) A 
(Dom(w,,,%) is open and Wmx is a 3-smooth function)). 





To formulate a generalization of AxSim, we will use a variant of AxSim that 
works for n = 2, too. For more on this variant we refer to Andréka et al., 2002, 
Secs. 2.8, 3.9, Andréka et al., 2006b, p.162, Andréka et al., 1999. 
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AxSim : Any two inertial observers see each other’s wristwatches run slow 
with the same ratio when they meet: 


(Vm, k € Ob)(Vt, s € Q)[CVin(Yme(t)) = CVE(Vem(s)) => 
\Vel(Ymx, t)| = |vel(Ykm, $)|]. 


To be able to use the notions of continuity and differentiability etc. in arbitrary 
fields in place of R properly, we need the axiom schema of Continuity. Reasons 
and details for this can be found e.g. in Madarász et al., 2006b, Goldblatt, 1987, 
van Benthem, 1983, p. 29. 


AxSup,,: is a formula expressing that every subset of Q defined by w(t, 7) 
with parameter y has a supremum if it is non-empty and bounded. 
Formally AxSup,, is: St/Vt[y(t,7) > t < t] > Hvtt, y) > 
t < t] Ave Nitty) > t < t) — t >t). 








CONT: := {AxSup,, : w is a formula in our vocabulary}. 


The formula schema CONT above is a variant of Tarski’s first-order logic 
version of Hilbert’s axiom of continuity in his axiomatization of Euclidean 
geometry. Itis also strongly related to the induction axiom schema in the 
dynamic logic of actions in the sense of Sain, 1986, Andréka et al., 1982. 


Genrel := {AxSelf~ , AxLine” , AXThEx” , AxPh~ , AxEvent™ , AxSim™ } 
U {AxField} U CONT U COMPR. 


As a first theorem we state that special relativity is the special case of general 
relativity where the worldlines of all observers and photons are straight lines; 
and where the Light Axiom holds. This is stated in the next theorem. We 
conjecture that a much weaker axiom suffices in place of the Light Axiom. 


THEOREM 11.26 GenrelU{AxLine, AxPh} is equivalent to SpecrelUCONT 
UCOMPR in the sense that they have the same models, if n > 3. 


Proof outline Assume Genrel + AxLine + AxPh. First one proves that 
(Ym € Ob)Cd(m) = Q”, by AxLine. Then AxEvent can be proved from 
AxLine, AXPh and AxEvent™ along the lines of the proof in Andréka et al., 
2002, pp. 98-100. Now the axiom AxEOb used in Madarász et al., 2004 holds, so 
one gets that the worldview transformations are affine mappings, by Madarász 
et al., 2004, Theorem 1. Itis proved in Andréka et al., 2002, Theorem 3.9.11 that 
AxSim™ implies AxSim when the worldview transformations are affine. This 
proves one direction of Theorem 11.26. In the other direction we have to prove 
in an axiomatic setting that timelike and photonlike straight lines are timelike 
and photonlike geodesics, respectively. This is done, basically, in Madarász 
et al., 2006b, Theorem 3.1. QED 
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Next we show that the metric-geometric forms of the models of Genrel are 
Lorentz manifolds and each Lorentz manifold is the metric-geometric form of 
a model of Genrel. This will be the analogue of Theorem 11.16 in Sec. 2.6. 


DEFINITION 11.27 Let Q = (Q, +, »*, <) be an ordered quadratic field. By 
a 3-smooth n-dimensional Lorentz manifold over Q we understand ((Mm, Jm, 
Wmk,Q) : m,k € J) where (((Min, Gm); mk) : m,k € J) is a Lorentz 
manifold in the sense of Def. 11.25 with the following changes: 


(a) in place of Rt we use Q”, and 


(b) in place of requiring the metric-tensor fields Q,,, and the partial isomor- 
Dhisms mx to be smooth, we only require that they be 3-smooth. 


The following theorem says that the models of Genrel are exactly the 3- 
smooth Lorentz manifolds over ordered real-closed fields, up to ignoring some 
“decorations” on the Genrel side. Theorem 11.28 can be considered as a com- 
pleteness theorem for Genrel. 


THEOREM 11.28 (COMPLETENESS THEOREM FOR Genrel) Assume 
n> 3. 


(i) There is a theory Comp™ analogous to Comp such that Genrel U Comp™ 
is definitionally equivalent with the class LM of all 3-smooth Lorentz 
manifolds over ordered real-closed fields. 


(ii) There is a definable function Lm that maps the class of all models of Genrel 
onto the class LM. Moreover, if IN | Catrel then Lm(M) is definition- 
ally equivalent with the Minkowski geometry Mg(9N) of IN. 


The proof of Theorem 11.28 is based on the following proposition, which 
states that the models of Genrel are “locally special relativistic”. 


PROPOSITION 11.29 Assume that M | Genrel and m,k € Ob. If p € 
wline,,(k) N t then D(Wmx)p exists and it preserves relativistic (Minkowski) 
distance p. 


Proof outline Assume p € wline,,(k) Nt. Then k,m € ev,,(p) by AxSelf-, 
and so p € Dom(w,,x) and F := D(Wmk)p exists by AXEvent”. By AxPh , 
and the definition of Wx, F takes PLines and only PLines going through p 
to PLines going through p' := W x (p). Thus F is a bijection and by AxSim— 
it preserves p. QED 


To give an idea for the proof of Theorem 11.28, we define Lm(2t) for 
M H Genrel. Let us fix a model M = (Q, +, x, <; B, Ob, Ph; W) H Genrel 
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and let Q = (Q, +, x, <). Let m € Ob and p E€ Cd(m). Let k € eVm(p) N Ob 
be arbitrary (such a k exists by AXTHEX” ) and let p' := Wmg(p). We define 


n I9 


Lyk := D(Wmk)p' © T(p'), where T(p') : Q” — Q” is “translation with p”, 


and 
Jp, İS the metric-tensor field belonging to this L, ;, as defined in Def. 11.22, 
i.e. gp (gr) = g(L7 (4), ED): for all q,r € Q”. 
It follows from Proposition 11.29 that, though Lp, depends on how we choose 


k € @Vm(p), the metric-tensor g,, does not depend on how we choose k € 
€Vin(p). Therefore we will omit the index k from the notation: 


Ip ‘= Ipk Im = (Ip : P E Cd(m)), Gm = (Cd(m),g,,), and 
Lm(M) 7 ((Cd(m), Jm: Wink, Q) : m, k E Ob). 


CLAIM 11.30 Assume IN = Genrel. The following (i),(ii) hold. 


(i) Lm(M) is a 3-smooth Lorentz manifold over Ñ, and Q is an ordered real- 
closed field. 


(ii) The worldlines wlinem (k), wlinem(ph) of observers and photons in m’s 
worldview are timelike and photonlike geodesics in Gm. Conversely, 
any timelike geodesic £ is a worldline of an observer locally, i.e. (Vp € 
L) (Ge > 0)(Sk € Ob)EN S(p,e) C wline,,(k). The same converse 
holds for photonlike geodesics, too. 








“——~ 


We outlined above that the geometry of each model of Genrel is a Lorentz 
3-smooth manifold. The converse is also true, each Lorentz 3-smooth manifold 
over an ordered real-closed field is isomorphic (as a manifold) to the geometry 
Lm(S3t) of a model M of Genrel. 

The extension Comp” for Genrel is completely analogous with Comp. A 
typical axiom in Comp" states that if the worldlines of two photons ph, and 
ph, coincide for all observers, then ph, = phy. Further, the worldline of a 
photon is a maximal photonlike geodesic. The point in the axioms in Comp” is 
to eliminate things like the multiplicity of otherwise undistinguishable objects 
(like ph, and ph, above) which cannot be defined over LM because they are 
undistinguishable. In some sense these statements are incarnations of Occam’s 
razor. 

We omit the rest of the idea for proof of Theorem 11.26. QED 


What we call the worldview of an inertial observer m € Ob in Genrel 
corresponds to “spaceships with ship-drive switched off”: the worldline of the 
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center of mass is a geodesic, but we did not care about whether the spaceship 
rotates or not. One can base an axiomatization of GR on worldviews of the 
so-called “local inertial frames” (LIF’s, cf. Rindler, 2001, pp. 177—179) which 
correspond to nonrotating inertial spaceships. LIF’s reflect the local special 
relativity more closely (e.g. they do not rotate). However, LIF’s can be defined 
in models of Genrel and the axiomatization based on LIF’s would provide the 
same geometrical entities Lm(%9t) behind the models as the present Genrel 
does. The role of Einstein’s field equations in interpreting Genrel is touched 
upon in Sec. 4.5. 


4. Black holes, wormholes, timewarp. Distinguished 
general relativistic space-times 


In Sec. 3.6 we introduced the first-order logic theory Genrel of general rel- 
ativity. In such a situation, the next natural thing to do is to turn to the intended 
models of the theory in question, and to discuss what these models look like. 
Indeed, we will do this in the present section, we will study some of the in- 
tended models of the theory Genrel. A difference between special relativity and 
GR is that while the special theory had basically one intended model (namely 
Minkowski space-time), the general theory has many nonisomorphic intended 
models, as we will see below. 

It will be convenient for us to study the models of Genrel in their geometric 
forms. Hence we will speak about GR space-times (M, L}, but it is important 
to remember that in Sec. 3.6 we saw that such a space-time (M, L) is equivalent 
with a Genrel model Mt | Genrel. So each one of the GR space-times in this 
section represents a distinguished Genrel model, and discussing these will shed 
some light on the semantic aspects of Genrel. 

For discussing the models of Genrel we leave the realm of first-order logic 
and then we work in mathematics proper, the reason for which is that by Tarski’s 
theorem one cannot satisfactorily describe the semantics of a language £ inside 
the framework of £ itself. To study the semantics of £, we have to rise above 
L and use the meta-language of £. In our case, this metalanguage is ordinary 
mathematics (or equivalently Set Theory, say ZF). 


4.1 Special relativity as special case of general relativity 


Minkowski space-time is Gs, = (M, L) where M = Rt and L(p) is transla- 
tion with p (i.e. Lp(q) = p + q for any q € RÔ), for all p € R*. In vector-fields 
form this is (G4, . . . , G4) where G;(p) = 1; for all p € R4 and 1 < i < 4, see 
upper part of Fig. 11.30. In metric-tensor field form Minkowski space-time is 
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(M, 9) where the 4 by 4 matrix (g;;(p) : 1 < i,j < 4) at p € Rt is 


1 0 0 

0 -1 U 0 
g(p) = 0 0 aj 0 , 

0 0 O -I 


i.e. the line-element is ds? = dt? — dz? — dy? — dz?. The timelike and 
photonlike geodesics in this space-time are the straight lines of slope < 1 and 
of slope 1, respectively. The automorphisms (i.e. isomorphisms onto itself) of 
Minkowski space-time G,,. are exactly the possible worldview transformations 
in a model of Specrel (where (Q,...) = (R,...)), cf. Theorem 11.10. 


Worldview of a uniformly accelerated observer in Specrel: 


We let n = 2, for simplicity. Consider the following space-time Gua = 
(M, L) where M = {(t,x) € R? : x > 0} and Gi(p) = (1/pe,0), Ge(p) = 
(0,1) for all p = (p1,p2) E€ M, cf. the lower part of Fig. 11.30. Thus the 
line-element is 


ds? = «dt? — dz?, and the Qj ;-matrix is 


gene 4) 


In this space-time, as we approach the origin, local LFR clocks tick slower and 
slower beyond any limit compared with coordinate GFR time, and local LFR 
clocks tick faster and faster beyond any limit compared with coordinate GFR 
time as we move away from the origin. On the other hand, local meter-rods do 
not change along the x-axis, local LFR spatial distances agree with coordinate 
GFR spatial distances. This space-time looks different from Minkowski space- 
time G,,., but in fact it is isomorphic to a sub-space-time of 2-dimensional G,,. 
The isomorphism denoted by Iso is represented in Fig. 11.30, it maps M of 
Gua bijectively onto {(t,x) : |t| < x}. The space-time Gua is the worldview 
(or space-time) of a uniformly accelerated observer k who lives in Minkowski 
space-time, with uniform (relativistic) acceleration a = 1. (The space-time for 
arbitrary uniform acceleration a is given by Gi(p) = (1/(ap2),0), Gz(p) = 
(0,1).) One can think of Gs, as the worldview of an inertial observer m in 
special relativity, and then Iso is the worldview transformation Wy between 
the worldview Gua of accelerated k and the worldview Gs, of m. 

Gua is rather similar to the exterior of the (2-dimensional tr-slice of the) 
simplest black hole G., below, which, in turn, is no longer partially isomorphic 
to any open part of G,,. Studying the simple space-time Gua of accelerated 
observers can lead to a deeper understanding of the space-time Gsp of the 
important Schwarzschild black hole to which we turn now. This connection is 
elaborated e.g. in the textbook Rindler, 2001, Secs. 3.7, 12.4. 
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worldline of a 
uniformly accelerated 


observer k 























Space-time Gua of uniformly accelerated observer 


Figure 11.30. Isomorphism from the worldview of a uniformly accelerated observer to 
Minkowski space-time. (The example in the text is given for uniform acceleration a = 1, 
the figure is for a = 1/3.) Iso(t, x), for t > 0, is the point p on the Minkowski-circle M (0, x) 
of radius x and with center the origin such that the relativistic arc-length of M (0, x) from (0, x) 
to pis azt. a is the length of G; at x. 


4.2 The Schwarzschild black hole 


There is now overwhelming observational evidence for the existence of large 
black holes in our universe. In the last 15 years astronomers have observed 
them. At the same time, black holes have really fantastic properties. Black 
hole physics is at the cutting edge of modern science. 
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There are many kinds of black holes, the Schwarzschild black hole is the sim- 
plest one. We will recall more exotic ones in Sec. 4.3. Why is the Schwarzschild 
black hole important? Here are four reasons for this: it is the simplest form of 
relativistic gravity (all the mass is in one point), it is an idealization of the gravi- 
tational space-time of our own Sun, it is a typical general relativistic space-time, 
and many other GR space-times build on it. 

Worldview of a suspended observer far away from the black hole: 

Schwarzschild (black hole) space-time is Gg, := (M,L) where M = 
{p € Rt : |(po,p3,pa)| # 0,1} and L is given as follows. For any p = 
(p1; p2, p3; pa) € R4 let r := r(p) := (0,p2,p3,pa), r := |r], and 1, := 
1,(p) := +» r. Here, “r” stands for “radius”. Now L is specified by the 
following four vector-fields 


Forr > 1: 
Gi(p) = 4/75 * 11, Gelp) = 4/ = * 1r, the lengths of Gy (p) and G,(p) are 


mE 
1, and Gi(p), Gz(p), Gy(p), Gz(p) are pairwise orthogonal. 
For r < 1: 


Gilp) = 4/4 * 1r, G2(p) = 4/ 157 * Le, the lengths of Gy(p) and G_(p) are 
1, and G;(p), Gz(p), Gy(p), Gz(p) are pairwise orthogonal. 
See Fig. 11.31. 

Gsp in metric-tensor form is the following. We use cylindric-polar coordi- 
nates because they are more convenient (by spatial spherical symmetry of the 
space-time). The line-element is 

ds? = (1 — +)de? — (1 — 4)“!dr? — r?dy?, 
where dy represents two coordinates the usual Euclidean way. Namely, y 
represents “space-angle’, i.e. y is the usual Euclidean combination of two polar 
coordinates @ (longitude) and ņ (latitude). Formally, dy? = d6? + sin (0)?dn? 
(metric on Euclidean unit 2-sphere). We note that by defining the line-element 
we also defined the metric-tensor field g of Schwarzschild space-time Gsp. 

In the general form of Schwarzschild space-time, there is a parameter that 
we chose to be 1 in the above. Namely, the general form of the line-element 
for the Schwarzschild black hole is 

ds? = (1 M\ dé? (1 M)-1dr? — rd’, 

the parameter M € R is thought of as the “mass” of the black hole. M is also 
called the radius or size of the black hole. (For historical reasons 2m is used 
for M in the literature, but this does not matter when one wants to understand 
the “logic” of the black holes.) Similarity with the accelerated observer can be 
discovered by choosing x = r — 1. Then the accelerated line-element (i.e. that 
of Gua) becomes ds? = (r — 1)?dt? —dr?. For lack of space we do not discuss 
this more here, but we note that there is more in this direction in Andréka et al., 
2006a. 
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Figure 11.31. Illustration for Schwarzschild space-time. G; gets longer as we approach r = 1 
from above, i.e. local time runs slower and slower as we approach r = 1. Gz gets shorter as 
we approach r = 1, i.e. there is more and more space (compared to “coordinate-space”) as we 
approach r = 1. The length of Gy stays 1, this means that spatial distances orthogonal to the 
radius agree with coordinate-distances. Time and (radial) space are interchanged in the interior 
of the black hole, this means that, in the interior, the r-axis is the worldline of an observer, but 
lines parallel to the time-axis are not possible worldlines. The singularity is in the future of an 
observer inside the black hole. 


The set of coordinate points p € R* with r = 0 is called the singularity 
(this coincides with the time-axis), and the set of coordinate points p € Rt with 
r = 1 is called the event horizon (EH). These are disjoint from the domain 
of Gsp, i.e. we did not associate local clocks and meter-rods to these points. 
Thinking in terms of the global coordinate frame, the event horizon is a sphere 
of radius 1 (or M in the more general case), and the singularity is the center of 
this sphere. Loosely speaking, we will refer to the EH and its interior as the 
black hole (BH). When we want to be more careful, we will refer to the part 
outside of the EH as the exterior of the BH and we will refer to the inside part 
of the EH as the interior of the BH. Later we will see that the interior and the 
exterior of the BH behave, in some sense, like two different universes connected 
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by a one-way membrane, namely by the EH. Here one-way membrane means 
that an observer may fall through the EH into the interior of the BH, but nothing, 
not even light can come out from the interior. This effect is not yet clear from 
our present space-time diagram (Fig. 11.32) but it will be clear after we apply 
the so-called Eddington-Finkelstein re-coordinatization (and extension) to it, 
cf. Fig. 11.35. 

Let us think for a while in terms of the global coordinate system, and let 
us see what the exterior of the BH looks like. Infinitely far away everything 
is normal, the farther away we are from the EH, the more “normal life is”, 
e.g. both local time and local meter-rods agree with the global GFR coordinate 
ones asymptotically. Space-times with this property are called asymptotically 
flat. Asymptotically flat means asymptotically Minkowski (or asymptotically 
special relativistic), namely as we move away from the BH, space-time becomes 
more and more like Minkowski space-time is. This can be formalized by saying 
that as r(p) tends to infinity, so the metric-tensor g(p) tends to “Minkowski g”. 

Convention: by coordinate properties (e.g. coordinate time) we always mean 
the global, GFR-coordinate properties. 

Let us now approach the EH from far away. As we get close, local clocks 
begin to tick radically slower (compared to the global coordinate time), beyond 
any limit; so metaphorically local clocks “stop” or freeze at the EH. (This is only 
metaphorical because we did not associate local clocks and meter-rods to the 
points of the EH. However, this will be helped after the upcoming Eddington- 
Finkelstein re-coordinatization and then time will really freeze on the EH.) At 
the same time, local meter-rods in the radial direction get smaller (beyond any 
limit) as we approach the EH, but local meter-rods orthogonal to the radius of 
the EH continue to agree with the coordinate meter-rods. 

Far away from the BH, GFR-coordinate-speed of light tends to be the same, 
namely 1, in all directions, but as we approach the EH, the coordinate-speed of 
light in the radial direction gets radically smaller compared to the coordinate- 
speed of light in directions orthogonal to the radius (this coordinate “anisotropy” 
is the reason why the light-cones in Fig. 11.31 close to the EH but in the exterior 
get “flattened out”); and as we get closer to the EH, the coordinate-speed of 
light tends to 0 in all directions. We note that the fact that g,,(p) = 0 for 
i Æ t everywhere means that at each event, the coordinate-speed of light in a 
spatial direction d and in its opposite direction —d is the same, if measured by 
the global frame. The above means that the so-called light-cones get infinitely 
narrow towards the EH but they do not get tilted as seen by the GFR, cf. 
Fig. 11.31, Fig. 11.32. 

Since a timelike curve must stay inside the local light-cones, this means that 
observers stay all the coordinate time outside the EH, they never “enter” the EH, 
as seen via the global coordinate grid. We will see that this is only an “artifact” of 
Schwarzschild’s particular choice of global coordinate system (GFR), similar to 
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the “artifact” mentioned earlier and also in Fig. 11.30. Avoiding of this artifact 
will be done via the so-called Eddington-Finkelstein re-coordinatization which 
will be presented soon. 


ingoing photons 


œ| 















spilad 
observer m observer k 


Figure 11.32. The “tr-slice” of a Schwarzschild black hole. This is a geodesically closed 
2-dimensional sub-space-time of Gs». The worldline of m is a geodesic, it “bulges” outward 
because m can maximize his time by bulging outwards. The worldline of k is not a geodesic 
(because it does not “bulge” outwards). Photons and inertial observers moving in radially 
“freeze” to the EH. The wristwatch of an in-falling inertial observer slows down “infinitely”, 
and will show times which are bounded. 


Let us see what the worldlines of observers and photons in Schwarzschild 
space-time look like in the exterior of the EH. 

Consider, for an example, the curve f : R — R4 where f(T) = (v1.5 x 
T,3,0,0) for all r € R. (This f is represented in Fig. 11.32 as “suspended 
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observer”.) This is a time-faithful curve, so we can imagine an observer k 
whose worldcurve this f is; the worldline of k (in Gs) is the vertical line 
going through (0,3,0,0) and k’s wristwatch shows 7 at the coordinate point 
(v1.5 * 7,3,0,0). Is this curve f a geodesic? Local clocks at coordinate 
points farther away from the EH tick faster, so one can “gain time” by moving 
outwards a little, clocking up a lot of wristwatch time while out there, and then 
coming back. Special relativistic time-dilation dampens this effect somewhat 
since the clocks of a fast moving observer slow down. But it is not hard to 
show that by moving outwards and then coming back with small velocities 
all the time, one gains time, and there is an optimum “outward-bulging” with 
maximum gain of time. Hence, vertical lines are not geodesics and the timelike 
geodesics always “bulge a little outwards”, i.e. they accelerate (or “turn’’), as 
seen in GFR, towards the BH. So, f is not a geodesic. 

Radial timelike geodesics are similar to the worldlines of pebbles thrown up 
into the air here on the Earth; with “upward” replaced by “outward”; a geodesic 
curve which begins to move away from the EH looses (coordinate) speed, 
eventually it stops and “falls back” towards the EH. According to GR, things 
thrown up fall back not because gravity of the Earth pulls them with myriad 
small invisible “hands”, but because time ticks slower near the Earth, and faster 
farther away from the Earth. Newton’s apple falls from the tree because of the 
“gravitational time-dilation” (known also as gravitational red-shift)! This is the 
first explanation for gravity since its behavior was described by Newton. 

There is a similar reason for a photonlike geodesic with a velocity in nonradial 
direction to bend toward the EH as if the EH “pulled” the geodesic towards itself. 
So, even photons “fall” (gravitational “light-bending” effect). 

Since the BH attracts, if an observer wants to stay at a constant distance 
from the EH, he has to use fuel for accelerating away from it. We call an 
observer like k above a “suspended observer”. Suspended observer means 
that the worldline of the observer is parallel with the time-axis in the present 
Schwarzschild GFR coordinate system. However, the notion of “suspended 
observer” is coordinatization-independent, i.e. observer-independent, because 
these “vertical” worldlines can be defined by a first-order logic formula in 
the language of local clocks and meter-rods (i.e. in the language of Genrel in 
Sec. 3.6). In other words, there is an experiment with which an observer can 
check whether he is suspended or not. In still other words, being suspended is 
an observational concept. 

Let us consider a (timelike or photonlike) geodesic curve that starts out 
towards the EH in a radial direction. By cylindrical symmetry of the space- 
time, there is “no reason” for the geodesic to bend right or left, since “right” 
and “left” are completely alike by symmetry. This implies that a geodesic curve 
which starts in a radial direction, will always move in this radial direction, i.e. 
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the (range of the) curve will be a subset of a tr-plane. Thus a tr-plane is a 
geodesically closed sub-space-time of Gs. 

Let us review briefly the interior of the EH. In the interior of the EH, time and 
(radial) space are interchanged, the r-axis is in fact the worldline of a possible 
inertial observer. Hence the global r-axis is the local time-axis for some LFR 
“living” in the interior of the BH. Similarly, the ¢-axis of the GFR is a spatial 
direction for this LFR. The singularity is no more a “place” for this observer 
(or for any observer in the interior), instead, it is like a future time instance like 
the Big Crunch in usual cosmology, something that will happen in the distant 
future but not “present” in the “now” of the in-falling observer. Similarly, for 
this inertial observer inside the BH, the EH is like a time instance or a “‘time- 
slice of space-time” which happened sometime in the past like the Big Bang in 
cosmology. We see the light coming from the Big Bang or from the EH in our 
past but we cannot influence it causally because it is in our past. Local time 
at the EH (in the interior) is “infinitely fast” compared to coordinate-time and 
local meter-rods in the t-direction are “infinitely long”. As we move towards 
the singularity, local time “slows down” (compared to GFR) beyond any limit, 
and local meter-rods in the t-direction get shorter, approaching zero coordinate- 
length at the singularity. Thus local light-cones in the interior of the EH are 
“infinitely wide” at the EH, and get “infinitely narrow” at the singularity, see 
Fig. 11.32. 

We could say that the space-time Gs» is the worldview of an observer k 
suspended far away from the black hole. How far away? We measure “far” 
by multiples of the radius of the EH (which we chose to be 1 presently), and 
the farther away the suspended observer is, the more he will experience special 
relativistic space-time on his own worldline. He sees that inertial observers 
fall towards the black hole, but he never sees them reach the black hole, for 
him (k) they stay outside for the eternity of k. As they fall towards the black 
hole, they move slower and slower as they approach the event-horizon, and 
eventually they “freeze” onto the event-horizon. Also the wristwatches of the 
inertial observers tick slower and slower towards the EH, and “stop altogether 
at the EH”. The same happens to the photons sent towards the EH in a radial 
direction: the photons slow up as they approach the EH, and eventually, and 
metaphorically, they “freeze” onto the event horizon. Our suspended observer 
k observes things this way both via photons (i.e. visually by his eyes) and via 
his coordinate system. 

Since an in-falling inertial observer m lives for an infinite GFR-time, it is 
possible in theory that there is no upper bound for the time his wristwatch 
shows, i.e. that m also will experience that all his infinite time passes outside 
the EH. However, this is not the case: the wristwatch readings of an in-falling 
inertial observer m are bounded, e.g. the wristwatch of m may approach 12 as 
he falls in, tick slower and slower and never reach 12 o’clock, cf. Fig. 11.32. So 
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what happens in the in-falling inertial observer’s own worldview or spaceship 
when his wristwatch reaches 12 o’clock? 

We will see that he falls through the EH into the interior of the BH. For 
our suspended observer k, the interior of the BH is not visible by photons, he 
cannot get information about the inside of the BH while suspended. However, 
he may wonder what might be inside of that “big black ball”, i.e. the EH. While 
suspended he cannot find out the answer. But, in principle, he may choose to 
fall in, and because of this, the interior of the BH is a “reality” for him. 

In many ways this worldview Gs, of the suspended observer k is similar to 
the worldview of Gua of an accelerated observer in special relativity. However, 
the following is an important and significant difference between Gua and Gsp. 
Assume two inertial observers m, h fall radially into the black hole, in the same 
“path”, i.e. in the same direction, and they started to fall at the same GFR-time, 
and close to each other. During their fall, one of them, say m, measures the 
distance between them by sending photons to h which it mirrors back to m. 
Then m measures constantly the time it takes for the photon to get mirrored 
back. The result of this measurement is called radar-distance. In Gua this 
radar-distance remains the same, since the distance of parallel geodesics in Gsr 
does not change (this means that it is “flat”). However, in Gsp, m will find 
that the radar-distance between him and h is growing; and since Gs» is the 
relativistic version of a gravitational source, this is expected to be so because 
in Newtonian gravity, things closer to a gravitational source fall faster than 
things more distant. Technically speaking, timelike geodesics that started out 
in a parallel way, will increase their distance from each other in the tr-plane; 
the space-time Gs, is curved.'! This shows that there is no partial isomorphism 
between Gsp and Gsr with an open domain. 


Worldview of an observer falling into the black hole: 


In the worldview of an observer falling into the black hole, the worldline of 
the in-falling observer m ought to be a straight line parallel with the time-axis. 
Instead of aiming for this, it is more convenient to “re-coordinatize” Gs, in 
such a way that the worldline of a radially ingoing photon will be a straight 
line of slope 1. There are no essential differences between such a worldview 
and a worldview where the worldline of m would be a straight line.!? In fact, 
since the EH and the singularity in its middle are special “marked” places in 
this worldview, it is quite natural to make a worldview where these “do not 
move” in the GFR. 





'l Parallel geodesics diverge means negative curvature. Hence the tr-plane of G sp is negatively curved. This 
is an instance of the so-called tidal forces which Gs (and, in general, GR) inherited from the Newtonian 
theory of gravity. 

12«yarallel with the time-axis” is inessential here, since it is easy to rotate a picture. 
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In Gsp, the worldline of a radially in-falling photon in the tr-plane and outside 
the EH is {(—r — In(r — 1) + constant, r) : r > 1}, and the worldline of a 
photon inside the EH is {(—r — In(1 — r) + constant,r) : r > 1}. (This 
is not difficult to show by using the definition of a photonlike curve given in 
Sec. 3.3, and by knowing that the worldline is a subset of the tr-plane.) Thus 
the following simple (partial) function Iso : R4 — Rt will take these photon 
worldlines to straight lines of slope 1: 


Iso(t, x,y,z) := (t+ In|r—1|,2,y,z) where r = yx? +y? + 2?. 


Let us look for the isomorphic image of Gs» along Iso. By Theorem 11.24(ii) 
(p. 675), the simplest way of defining the isomorphic image Ge, = (M’,L’\ 
of Gs = (M, L) is by letting L’(Iso(p)) = D(Iso)p o Ly for all p € M. By 
doing so we get the following definition (see Fig. 11.33). 


t r=1 G! 














Figure 11.33. Eddington-Finkelstein re-coordinatization as an isomorphism between space- 
times. 


The Schwarzschild black hole in Eddington-Finkelstein coordinates is Gey = 
(M’, L') where M’ = {p € Rt : r(p) 4 0,1}, and L’ is specified by the 
vector-tetrad G” (p) = (Gi (p), G4 (p), Gy (p), G% (p)) where 


GrP) = 4/ qot +y m1, Gi(p) = Gi(p) forr > 1, 


Gip) = 4/ da 1; T1,, G)(p) = G2(p) forr <1, and 
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Gy(p) = Gy(p),  G.(p) = G-(p) for all r. 


(A few words on how we got G” (p): Assume r = r(p) > 1 and let p’ = Iso(p). 
Now D(Jso), takes 1,+pto 1;+p' and 1, +pto 1, + 1, +p’. Itcanbe seen 
in Fig. 11.33 that we get G”, (p) by considering its “slope (relative to 1,,)” and by 
considering the length of its “r-projection”. The slope is given by the derivative 
of Iso, thus it is —*,, and the 7-projection of G’,(p) is G,(p) = ‘= * 1). 
We obtain +,|G,| = Ti as the t-component of G’,. The case r < 1 is 
analogous. G',, G, G’, are obtained similarly.) Thus, G” (p) specifies the local 
LFR frame at p, see Fig. 11.34. 


singularity r = 0 





_ event horizon r = 1 

















Figure 11.34. Schwarzschild space-time in Eddington-Finkelstein coordinates. 


To see what the metric-tensor field gep of Gep is we have to “look at the 
coordinate unit vectors 1;” with the eye of this LFR. This was explained in 
Sec. 3.4, on p. 672. 

In the coordinate system specified by G’(p), the coordinates of 1; and 1, 


are (,/"—*,0,0,0) and (—, iG ais ~~) 0, 0) respectively. Hence for r = 
rip) > 1, 





Ju(p) = 9, (12, 1;) = ( rh )? = rt 
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9. (0) = Gp(Ins te) = (—yfaey 2 - GFR? =—+ a 


in (P) = Gye (P) = gll 1) = YF -ya = 


For r < 1 we obtain the same final values for g;;(p). For this reason, the 

metric-tensor field gp belonging to Gey is given by the following line-element 
ds? = (1 — 4)d¢? — 2dtdr — (1 + 4)dr? + r?dy?. 

In this metric-tensor, g,,. 4 0 because the coordinate unit vectors 1+, 1, are not 

orthogonal in the eye of the local LFR specified by the vector-tetrad G’(p), see 

Figs. 11.33, 11.34. This, g,, A 0, means that the light-cones in the tr-planes 

are tilted as illustrated in Figs. 11.34, 11.35. 

We can see that the above 9, can smoothly be extended to the event horizon, 
ie. to EH = {p € Rf : r(p) = 1}. The reason for this is that the above 
formula for gep is not degenerate for r = 1. Hence we can extend G? ş to EH, 
and this way we get Eddington-Finkelstein space-time, in short EF-space-time, 
Ges = (Mef, Jes) where Mes = {p € R4 : r(p) 4 0}. This is an extension of 
Ge and Iso is a partial isomorphism between Gs» and Gep. The event horizon 
EH is part of the space-time here; and in fact the EH in a tr-plane is the worldline 
of a photon! This extended Ges explains what happens on the event horizon 
and shows how the inside of the BH can be connected to the outside. (We 
note that the above given G’ cannot be smoothly extended to EH, but one can 
smoothly change G’ to G” which gives the same metric-tensor field and which 
can be smoothly extended to EH.) 

What will an in-falling inertial observer experience in Gef? Throughout we 
assume that the BH in question is big enough so that the tidal forces on the 
EH and also well inside the EH are negligible. The present “animation” is 
based largely on Gey in Fig. 11.35, but also Fig. 11.32, Fig. 11.36 are taken into 
account. So, it is useful to consult these figures with an emphasis on Fig. 11.35 
before reading on. For visualizability, we assume that the BH is like the ones 
in centers of galaxies in that there are some stars (suns) orbiting our BH. So 
there is the EH, outside that there are the nearby suns orbiting the EH, and far 
away, there are the distant galaxies. As explained in Rindler, 2001, Sec. 12.5, 
pp. 267-271, Fig. 12.6, it is observationally possible to decorate the exterior of 
our BH with a latticework or “scaffolding” consisting of suspended observers 
(spaceships using fuel for maintaining their latitude) which surround the EH 
(only the exterior) and which maintain constant radar-distance from each other 
by using rockets. Gyroscopes are used to avoid rotation. We will think of these 
suspended observers as milestones, telling our in-falling observer where he is 
and what his speed is. It is impossible to have suspended observers on the EH 
or inside the EH, so one sign telling the in-falling observer that he is already 
inside will be the nonexistence or disappearance of the milestones. 
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Figure 11.35. The “tr-slice” of space-time of Schwarzschild black hole in Eddington- 
Finkelstein coordinates. 


As the in-falling observer m approaches the EH, he will see the milestones 
flashing by him, so to speak, faster and faster approaching the speed of light. If 
there were a milestone on the EH, then it would flash by (i.e. move relative to the 
observer) with the speed of light. However, this milestone cannot be realized 
by an observer. When all the milestones have flashed by our in-falling m, he 
will notice that there are no more milestones and even the BH has disappeared. 
Then m finds himself in basically empty flat-looking space!? with no BH and 
no singularity in any of his spatial directions. More precisely, if he knows what 
to look for, then he can still observe some traces of the EH just as we can “see” 
our Big Bang (via the cosmic microwave background radiation) but it is all in 
the past, gone so to speak, and not influenceable causally. Like history is not 
changeable. The nearby suns (say, of different colors for fun) are also visible, 
even moving as m watches them, but they are like ghosts, their light comes 
from before the (Big Bang like) EH and they are causally not “touchable”, 
since they all are in the distant past. All of the outside world, even the future 





13 Space may become flat (inside EH, of course), space-time remains curved. 
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of the suspended observers outside the EH, are in the past for m inside the BH, 
moving, living, dynamical, and changing but in the distant past before the Big 
Bang like EH, causally unreachable. What is interesting about this is that it is 
tempting to say that for m safely inside the EH the exterior of the EH does not 
exist. He is in a different universe, period. But that would not be the complete 
truth. Namely, the nearby suns circling the BH are still visible for m, they are 
just not influenceable causally. For more on this experience of seeing several 
universes via a BH we refer to the professional physics movie “Falling into a 
BH” (Hamilton, 2001), and the Prologue of Thorne, 1994. What we described 
so far is nothing but a decoding of the space-time diagrams Figs. 11.32-11.36 
and of the metric of Gep. This is the meaning of the mathematical expression of 
saying “space and time gets interchanged”. Soon we will discuss more subtle 
BH’s where m can avoid the fate of eventually hitting the singularity. 

If we want to concentrate on the causal structure of a space-time, e.g. of the 
Eddington-Finkelstein black hole in Fig. 11.35, then that can be represented 
more compactly by a so-called conformal diagram (or Penrose diagram) of 
Gef. Such a conformal diagram (of Fig. 11.35) is represented in Fig. 11.36. In 
a conformal diagram, photonlike geodesics are represented as straight lines of 
slope 1 and local time flows upwards. 

Gs» and Gef are two worldviews of the Schwarzschild black hole, con- 
nected by Iso. With an analogous way as we obtained Gef we can obtain 
a re-coordinatization where the worldlines of the outgoing photons will be 
straight lines of slope 1. In this worldview, the interior of EH will behave like 
a so-called white hole: things can come out of the interior but cannot move 
inside. If we go on completing the worldlines of observers when we can we 
will also obtain a so-called hypothetical dual universe. All of these fit into one 
world-view called the Kruskal-Szekeres space-time, whose conformal structure 
is illustrated in Fig. 11.36. 


4.3 Double black holes, wormholes 


After an observer falls into a Schwarzschild black hole, he has only a finite 
time to live inside, and he must meet the singularity. There are many more 
friendly kinds of black holes, where he can live for an infinite time inside 
the black hole, he can avoid meeting a singularity, and he can even come out 
into an asymptotically flat region. (The expression “wormhole” refers to this 
last property.) We briefly describe here two such black holes, the electrically 
charged black hole and the rotating black hole. 


Electrically charged black holes 
This black hole is also called Reissner-Nordstrém black hole in the literature. 
Its line-element is 


ds? = (1-44 S)dt??- (1-44 
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future singularity this light signal 
(spacelike, unavoidable) does not reach m 


event horizon 





Figure 11.36. Conformal or Penrose diagram of “completed” Schwarzschild black hole. The 
shaded area consisting of blocks I, II is conformal diagram of EF-black hole. Region I is the 
exterior of the BH, region II is the interior of BH, region III is the white hole, and region IV is 
the dual universe. In some sense, regions III, IV may or may not exist but regions I, II have a 
stronger ontological status, they probably exist. 


where 0 < e < i: (Notice the strong analogy with Schwarzschild space-time 
Gs.) Here, e represents the square of the electric charge. In this space-time 
r = Ois the singularity, and there are two event horizons at 


1_ +1 1_ 
g=e and r =z taane 


The exterior of the outer event horizon is similar to the Schwarzschild black 
hole: the light-cones get narrower towards the outer EH, and they are “infinitely 
narrow” at the EH. Inside the outer EH, the space-time remains similar to Gsp 
till about halfway towards the inner EH: time and space get interchanged and as 
we move inwards, local time gets faster and faster. But after a while, local time 
begins to slow down again, and local time “stops” at the inner event horizon, 
where time and space get interchanged once more. The innermost part, the 
interior of the inner event horizon, is similar somewhat to the exterior of the 
outer EH, but time runs faster and faster towards the singularity, beyond any 
limit. The singularity can be avoided in the inside of the black hole, the in-falling 
observer can “live forever”.'4 The coordinatization represented by the above 
line-element is analogous to the Schwarzschild coordinatization of simple black 
hole, where the events of the in-falling inertial observers’ entering the outer EH 





14 Actually, it is extremely difficult to go near the singularity (because of the repelling effect), so the in-falling 
observer is safe, will not be hurt by the BH. 
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are not included. An Eddington-Finkelstein-type re-coordinatization of the 
space-time where the worldlines of the ingoing photons are straight lines of 
slope 1 is illustrated in Fig. 11.37. 

The conformal diagram of the electrically charged BH is shown in Fig. 11.39. 
An observer falling into this BH may come out to an asymptotically flat region 
(after crossing the EH’s) as indicated in Fig. 11.39. 


t z=r z=rt 






inner 
event 
horizon 


outer 
event 
horizon 








Figure 11.37. The “tr-slice” of electrically charged black hole. (Also the “tz-slice” of space- 
time of slowly rotating black hole in coordinates where z is the axis of rotation of black hole.) r* 
is the outer event horizon, r~ is the inner event horizon, z = 0 is the “center” of the black hole. 
The tilting of the light-cones indicates that not even light can escape through these horizons. 
That there is an outward push counteracting gravity can be seen via the shape of the light-cones 
in region III (central region of the black hole). The time measured by m is finite (measured 
between an event outside the inner EH and the event when m meets the inner event horizon) 
while the time measured by k is infinite. 
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Rotating (spinning) black holes 

The space-times of slowly rotating black holes, called slow Kerr space- times 
in the literature, are similar to the electrically charged ones in that there are two 
EH’s. We can think that the second, inner EH is the result of a repelling force 
overtaking the attraction of gravity. In the case of electrically charged black 
holes, the repelling force can be thought of, roughly, as the result of an excess 
of electrons (or protons) “in the BH”, cf., e.g., d’Inverno, 1983, pp. 239-244 
or Hawking and Ellis, 1973, p. 156 for a more careful explanation. In the case 
of rotating black holes, the repelling force can be thought of as the centrifugal 
force of rotation. The metric-tensor g(p) of Kerr black hole at p = (t, r, p, V) 
is given by the 4 by 4 matrix 


—-l+uyp 0 -—pasin? 3 0 


= 0 D/A 0 0 
Okerr = —pasin?) 0 Ipp 0 
0 0 0 z 


where © = r? + a? cos? J, A = r? — Mr + a”, u = Mr /£, and Ipp = (r? + 
a? + pa? sin? V) sin? V. We used the so-called Boyer-Lindquist coordinates 
(t, r, p, 0) where (t,r, p, V) are kind of polar-cylindric coordinates, r being 
radius (to be precise, r is the logarithm of the radius) and y, V being angular 
coordinates like 7 and 0 were on p. 686. In the Kerr metric, J gerr, M,a E€ R 
are parameters, M corresponding to mass and a to the angular momentum of 
the rotating singularity. Indeed, choosing a = 0 yields the metric of the simple 
Schwarzschild BH. A two-dimensional slice of a slowly rotating black hole 
is very similar to the one in Fig. 11.37, and a “spatial” representation is in 
Fig. 11.38. 

We meet two interesting features in these black holes. The first interesting 
feature is that there are so-called Malament-Hogarth events in these space-times. 
An event e is called a Malament-Hogarth event if in the causal past of e there is 
a time-faithful curve which is infinite in the future direction. The words “past” 
and “future” are important here, these refer to a time-orientation of the space- 
time, as follows. A time-orientation on a GR space-time (M, LY is a smooth 
vector-field each member of which is timelike (formally, a time-orientation is 
a smooth T : M — R* such that u,(T(p) — p) > 0 for all p € M). All the 
GR space-times mentioned in Sec. 4 have natural time-orientations. Given a 
time-orientation, the notion of a future-oriented timelike curve can be defined. 
The causal past of an event e is defined to be the set of events e’ which can be 
connected with e by a future-oriented timelike curve such that e’ is “earlier” in 
this curve than e is. 

One could think that in Malament-Hogarth events “actual infinity” is an 
observable physical reality. This phenomenon raises lots of intriguing questions 
to think over and has consequences even for the foundation of mathematics. 
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Figure 11.38. A slowly rotating (Kerr) black hole has two event horizons and a ring-shape 
singularity (the latter can be approximated/visualized as a ring of extremely dense and thin 
“wire”). The ring singularity is inside the inner event horizon in the “equatorial” plane of 
axes x,y. Time coordinate is suppressed. Fig. 11.37 is a space-time diagram of this with x, y 
suppressed. Rotation of ring is indicated by an arrow. Orbit of an in-falling observer m is 
indicated, it enters outer event horizon at point e, and meets inner event horizon at point b. For 
more on the basics of this figure cf. O’ Neill, 1995, Fig. 2.2, p. 63. 


For more on this we refer to Németi and David, 2006, Németi and Andréka, 
2006, and to Etesi and Németi, 2002. There are Malament-Hogarth events in 
both the charged and the rotating black holes, see Fig. 11.39. 

Another intriguing feature is the presence of closed timelike curves (CTC’s) 
in Kerr space-time. CTC’s raise the question of time-travel into the past, and 
offer themselves for a logical treatment like the Liar Paradox. For more on 
this we refer to Earman, 1995. There are CTC’s in the space-time of a rotating 
black hole, see e.g. O'Neill, 1995, pp. 76-77, Proposition 2.4.7, Wiithrich, 
1999, Andréka et al., 2006c. There are many other kinds of space-times with 
CTC’s, e.g. Tipler- van Stockum’s rotating cylinder, Gédel’s universe, the ones 
described in Thorne, 1994 and Novikov, 1998, to mention a few. Cf. Fig. 11.40. 


4.4 Black holes with antigravity (i.e. with a cosmological 
constant A). Triple black holes 


One can combine the idea of a BH with a universe in which the vacuum 
regions have a nonzero curvature characterized by Einstein’s cosmological term 
A E€ R. A may be positive or negative, but |A| is small. Recent cosmological 
observations suggest that A or something like it might be out there, i.e. might 
be important for understanding the acceleration of our expanding universe. The 
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Figure 11.39. Penrose diagram of electrically charged black hole (and also of slowly rotating 
black hole). The red line represents a segment of the life-line of an in-falling inertial observer m, 
and the blue line represents the life-line of a suspended observer k. The time passed on the red 
line is finite, while the time passed on the blue line, i.e. for the suspended observer, is infinite. In 
principle, the in-falling observer has access in a finite wristwatch-time of his to all of the future 
history of the suspended observer k (an ultimate effect of “slow time” caused by BH’s). 


line-element is a generalization of the one of the charged BH (generalizing Gs») 
on p. 697 





ds? = (1 — Mp = Ar?)dé? — (1 Lee 3 Ar?)—dr? — r2dy?. 


r 


Here M is the mass of the BH, e is (square of) its electric charge, and A repre- 
sents the hypothetical antigravitational property of intergalactic vacuum. The 
parameters M, e, A can be chosen independently of each other obtaining various 
kinds of special cases. A > 0 causes the timelike geodesics outside the outer 
EH behave as if an antigravitational force would be pushing them outwards, 
away from the EH, cf. Rindler, 2001, Sec. 14.4, pp. 304-306. If A = 0, we 
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Figure 11.40. Starting point for Gédel’s rotating cosmological model. This is a GR space-time, 
the vector fields and local light-cones representing the local special relativity frames (LFR’s) are 
indicated. CTC’s can be seen in the figure. 


are in asymptotically flat universe, A > 0 means negative curvature, hence 
what is called de-Sitter universe, while A < 0 means positive curvature (for 
our vacuum), and so-called anti-de-Sitter universe. 

The choice A < 0 causes distant clocks speed up!> (via the —Ar? term in 
Ja), While small A > O causes them to run slow (assuming Ar? < l;atAr? =1 
there is a “coordinate-event horizon” !® if M = e = 0). The behavior (speed) of 
distant clocks determine the behavior of geodesics (gravitation) according to the 
same “logic” as explained at the Schwarzschild BH on pp. 689-690. The choice 
of M = e = 0 yields de-Sitter and anti-de-Sitter space-times respectively, 
depending on the sign of A. 





'5Hence Malament-Hogarth computers breaking the Turing Barrier become possible, cf. Németi and David, 
2006, Hogarth, 2004. 

!6The event horizon at r? = 1/A (i.e. where Ji; becomes 0 because of A) is in many respects like a huge 
Schwarzschild BH turned inside out, cf. Rindler, 2001, p. 306, lines 11-22. An electric BH in a de-Sitter 
space-time possesses three distinct event horizons, the innermost one caused, roughly, by e, the middle one 
caused, roughly, by M, and far out the outermost one cased by A. As we move away from the singuplarity 
lying on the time-axis t, in the positive r direction, time and space get interchanged at crossing each one of 
the three event horizons. 
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4.5 Einstein’s field equations 


In the present work we concentrate on the space-time aspects of general 
relativity (GR). One of the reasons for this choice is that to study GR, it is 
reasonable to start with studying GR space-time, this enables one to study 
advanced and exotic examples of GR space-times like black holes, wormholes, 
cosmological models etc. as done e.g. in Taylor and Wheeler, 2000, and then 
turn to studying Einstein’s field equations EFE and the rest of GR together 
with its “borderlines”. This order is followed in, e.g., Penrose, 2004. About 
this possible continuation of studies we note the following. EFE is not a new 
axiom in the language of GR space-times restricting these. Instead, EFE is 
a definitional expansion of GR space-times in the sense of definability theory 
of mathematical logic (described in Sec. 2.6). EFE comes in two versions, a 
more flexible version, EF E*, permitting the use of an extra parameter A for 
“fine-tuning” our space-time, and a less flexible one, EFE, in which A = 0 is 
assumed (or equivalently A is not used). 

First we consider the A = 0 version. In this case, EFE is an explicit definition 
associating a tensor field denoted as (T;;(p) : p € M}, or briefly T;;, to every 
GR space-time G = (M,g). From the logical point of view, Tj; is a brand 
new symbol not occurring in Genrel (or in the language of (M, g) or (M, LY). 
Hence Genrel + EFE can be regarded as a new theory expanding Genrel with 
new kinds of entities not mentioned in Genrel (or in its manifold oriented 
forms). Since EFE is an explicit definition (over Genrel) of the new entities 
called T;;, the new theory Genrel + EFE is a conservative extension of Genrel 
(in the logical sense). This is the reason why we said earlier that EFE does 
not restrict the generality of Genrel, though it introduces a new linguistic (or 
conceptual) device to add such restrictions later if/when wanted and justified. 

The physical role of EFE is the following. EFE helps us in elaborating the 
connections between Genrel and other physical theories (such as e.g. electro- 
dynamics, or e.g. mechanics). This is so because the new concept Ti; (or new 
property Tj; of (JZ, g)) can be interpreted in the various physical theories as 
representing typical physical quantities like mass-energy-momentum density 
at points p E€ M. In other words, the “new” tensor field Tj; can be regarded 
as associating various physical properties (or entities) to points p of the space- 
time under investigation. It is in this connection that T;; makes it possible for 
related theories of physics to induce restrictions on the models of Genrel via 
Genrel + EFE. 

The more flexible theory Genrel + EFE™ is also a conservative extension 
of Genrel but in EFE we introduce two new concepts, T;; and Einstein’s 
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cosmological parameter A . What is A? It intends to specify the curvature 
of vacuum.!” What do we mean by referring to the vacuum, in Genrel there 
was no such concept as the vacuum. Again, using the concept of vacuum 
is connected to the physical interpretations of the theory. Roughly, vacuum 
consists of those points p of the space-time where T;;(p) = 0. The assumption 
A = 0 amounts to assuming that the curvature of vacuum is the same as the 
curvature of Minkowski space-time, i.e. as that of special relativity. Intuitively, 
Genrel + EFE* permits us to choose an arbitrary but fixed value A € R for 
the whole space-time. Usually |A| is small. It was this extra flexibility which 
made it possible for Kurt Gödel to specify his rotating universe (Gödel, 1949) 
as a universe containing only pressureless dust. 

EFE* can be written in the following form: 


(EFE*) Ty; —A* g;; = expression of (9; and derivatives of Jij). 


Cf. e.g. Rindler, 2001, item 14.15, p. 303 or Wald, 1984, item 5.2.17, p. 99. In 
(EFE™) we suppressed the constants deriving from units of measurement. By 
inspecting (EFE) above, one can see that instead of determining T;; (matter- 
energy-momentum-etc density), it determines only the difference of T;; and A, 
more precisely, it tells us the value of 7}; — AQ; (from knowing g;, and its 
behavior). Hence (EFE?) leaves us a certain degree of freedom for distributing 
effects between T;; and A. Further, g;; occurs on both sides of the equation, 
hence (EFE*) is only an implicit circumscription, not an explicit definition. 
For completeness, we note that EFET can also be used for a kind of clas- 
sification of space-times, roughly, in terms of what they may “contain”. An 
example is “vacuum space-times” which refer to space-times compatible with 
Tij = 0 (uniformly). A complication here is that in principle the “division of 
labor” between T;; and A is up to the interpreter’s mind to choose. E.g. de-Sitter 
space-time (having a constant negative curvature) can be classified as a vacuum 
space-time with A ¥ 0, or equivalently as one with A = 0 and 7;;; nonzero. 
This classification can be further stretched to associate “realisticity” or “physi- 
cality” to space-times but such judgements often turn out to be subjective later. 
For illustration we note that Minkowski space-time is vacuum and so are Gsp, 
Ge, the rotating BH’s space-time, but the electrically charged BH’s space-time 
is not vacuum (because the presence of electrical field at p implies T;;(p) 4 0). 





5. Connections with the literature 


Elaborating the logical foundations of relativity goes back to Hilbert’s 
6-th Problem. Most of the (logic oriented) work we are aware of concentrate 





7 The curvature of a GR space-time G = (M, g} is a definable property of G. In more detail, the curvature 
tensor field of G is defined from the behavior of the geodesics of G. 
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on special relativity or on its fragments. Probably the first axiomatization for 
special relativity was given by Alfred Arthur Robb in 1914 (Robb, 1914), and 
his work is the model or starting point of many later axiomatizations. There 
are many works in which an axiom system for special relativity is given, a 
small sample (which is far from being complete) of these is: Robb, 1914, 
Reichenbach, 1969, Carathéodory, 1924, Alexandrov and his school starting 
with 1950 (Alexandrov, 1974, Guts, 1982), Suppes and his school starting with 
1959 (Suppes, 1959, Suppes, 1968, Suppes, 1972), Szekeres, 1968, Winnie, 
1977, Ax, 1978, Friedman, 1983, Mundy, 1986, Goldblatt, 1987, Schutz, 1997, 
Latzer, 1972. Of these, only Ax, 1978 and Goldblatt, 1987 are in first-order 
logic. These works usually stop with a kind of completeness theorem for their 
axiomatizations. What we call the analysis of the logical structure of relativ- 
ity theory begins with proving such a completeness theorem but the real work 
comes afterwards, during which one often concludes that we have to change the 
axioms. Very roughly, one could phrase this as “we start off where the others 
stopped (namely, at completeness)”. Most of this literature concentrate on what 
we call Specrel,, namely the causal fragment of special relativity without its 
metric aspect (which is present in Specrel). We note that there are interesting 
works connecting modal logic with special relativity, e.g. Goldblatt, 1980, van 
Benthem, 1983, p. 4,pp. 22-29, Casini, 2002, Shehtman and Shapirovsky, 2003. 
For more on the literature see Andréka et al., 2006b. 

As a contrast with special relativity, we know only of a few attempts for pro- 
viding a logical analysis of general relativity. Examples are Basri, 1966, Kron- 
heimer and Penrose, 1967, Schröter, 2007, Busemann, 1967, Ehlers et al., 1972, 
Walker, 1959. None of these examples tries to stay within the framework of first- 
order logic (or even something like that, say, second-order logic) or attempts 
proving something like a completeness theorem. In Sec. 3.6 of the present work 
we propose a relatively simple first-order logic axiomatization Genrel for gen- 
eral relativistic space-times, and in Theorem 11.28 we formulate a completeness 
theorem for Genrel. What remains as a future research task is doing “reverse 
mathematics” for Genrel, i.e. providing a conceptual analysis for Genrel which 
would be analogous to the conceptual analysis provided for Specrel in Sec. 2 
and in Andréka et al., 2002. Of course, a related future research task remains to 
push the present logic based conceptual analysis to the not yet existing theories 
conjectured to exist beyond general relativity like quantum gravity. 


Acknowledgments 


We are indebted to the participants of our 2004 and 2006 courses on “Relativ- 
ity and Logic” at Eötvös University, Budapest, and especially to Zalán Gyenis, 
Ramon Horvath, Gergely Székely, and Renata Tordai. We greatly profited from 
discussions and correspondence with Johan van Benthem, Gyula David, Robin 
Hirsch, Péter Németi, Miklós Rédei, László E. Szabó and the participants of the 


Logic of Space-Time and Relativity Theory 707 


Oxford Space-time Workshop in 2004 and its follow-up conference in 2005. 
This research was supported by Hungarian Research grant OTKA T43242 as 
well as by a Bolyai Grant for Judit X. Madarász. 


References 


Aiello, M. and van Benthem, J. (2002). A modal walk through space. Journal 
of Applied Non-Classical Logics, 12(3—4):319-363. 

Alexandrov, A. D. (1974). On foundations of space-time geometry. I. Soviet 
Math. Dokl., 15:1543-1547. 

Andréka, H., Madarász, J. X., and Németi, I. (1998—2002). On the logical struc- 
ture of relativity theories. Technical report, Rényi Institute of Mathematics, 
Budapest. http://www.math-inst.hu/pub/algebraic-logic/Contents.html. 

Andréka, H., Madarász, J. X., and Németi, I. (1999). Logical analysis of special 
relativity theory. In Gerbrandy, J., Marx, M., de Rijke, M., and Venema, Y., 
editors, Essays dedicated to Johan van Benthem on the occassion of his 50th 
birthday. Vossiuspers, Amsterdam University Press. CD-ROM, ISBN: 90 
5629 104 1, http://www. illc.uva.nl/j50. 

Andréka, H., Madarász, J. X., and Németi, I. (2004). Logical analysis of rela- 
tivity theories. In Hendricks, V., Neuhaus, F., Pedersen, S. A., Scheffler, U., 
and Wansing, H., editors, First-order Logic Revisited, pages 7-36. Logos 
Verlag, Berlin. 

Andréka, H., Madarász, J. X., and Németi, I. (2006a). Logical axiomatizations 
of space-time. Course material on the Internet, http://ftp.math-inst.hu/pub/ 
algebraic-logic/kurzus-2006/kurzus-h-2006.htm. 

Andréka, H., Madarász, J. X., and Németi, I. (2006b). Logical axiomatizations 
of space-time. Samples from the literature. In Prékopa, A. and Molnár, E., 
editors, Non-Euclidean Geometries: Janos Bolyai Memorial Volume, volume 
581 of Mathematics and Its Applications, pages 155-185. Springer Verlag. 

Andréka, H., Németi, I., and Sain, I. (1982). A complete logic for reasoning 
about programs via nonstandard model theory, Parts I-II. Theoretical Com- 
puter Science, 17:193—212, 259-278. 

Andréka, H., Németi, I., and Sain, I. (2001). Algebraic Logic. In Gabbay, 
D. M. and Guenthner, F., editors, Handbook of Philosophical Logic, vol- 
ume 2, pages 133-247. Kluwer Academic Publishers, second edition. See 
also www.math-inst.hu/pub/algebraic-logic/handbook.pdf. 

Andréka, H., Németi, I., and Wiithrich, C. (2006c). A twist in the geometry of 
rotating black holes: seeking the cause of acausality. Manuscript, Budapest 
and Berne. 

Ax, J. (1978). The elementary foundations of space-time. Found. Phys., 8(7— 
8):507-546. 

Barbour, J. B. (1989). Absolute or relative motion? Cambridge University Press. 


708 HANDBOOK OF SPATIAL LOGICS 


Basri, S. (1966). A deductive theory of space and time. North-Holland, 
Amsterdam. 

Busemann, H. (1967). Time-like spaces, volume 53 of Dissertationes Math. 
(Rozprawy Math.). Mathematical Istitute of Polish Academy of Sci. 

Carathéodory, C. (1924). Zur Axiomatik der speziellen Relativitatstheorie. Sit- 
zungsber. phys. math., 14.:12-27. 

Casini, H. (2002). The logic of causally closed space-time subsets. Classical and 
Quantum Gravity, 19(24):6389-6404. http://arxiv.org/abs/gr-qc/0205013. 

Chang, C. C. and Keisler, H. J. (1973). Model theory. North-Holland. 

d’ Inverno, R. (1983). Introducing Einstein's Relativity. Oxford University Press. 
Earman, J. (1995). Bangs, crunches, whimpers, and shrieks. Singularities and 
acausalities in relativistic spacetimes. Oxford University Press, Oxford. 
Ehlers, J., Pirani, F. A. E., and Shild, A. (1972). The geometry of free fall and 
light propagation. In General relativity, Papers in Honor of J.L. Synge, pages 

63-84. Clarendon Press, Oxford. 

Einstein, A. (1961). Relativity (The special and the general theory). Wings 
Books, New York, Avenel, New Jersey. 

Etesi, G. and Németi, I. (2002). Non-Turing computations via Malament-Hogarth 
space-times. International Journal of Theoretical Physics, 41(2):341-370. 

Friedman, H. (2004). On foundational thinking 1, Posting in FOM (Foundations 
of Mathematics). Archives www.cs.nyu.edu. 

Friedman, M. (1983). Foundations of Space-Time Theories. Relativistic Physics 
and Philosophy of Science. Princeton University Press. 

Gödel, K. (1949). An example of a new type of cosmological solutions of 
Einstein’s field equations of gravitation. Reviews of Modern Physics, 21: 
447-450. 

Goldblatt, R. (1980). Diodorean modality in Minkowski spacetime. Studia Log- 
ica, 39:219-236. 

Goldblatt, R. (1987). Orthogonality and space-time Geometry. Springer-Verlag. 

Guts, A. K. (1982). Axiomatic relativity theory. Russian Math. Survey, 37(2): 
41-89. 

Hamilton, A. (1997-2001). Falling into a black hole. Internet page, http://casa. 
colorado.edu/ ajsh/schw.shtml. 

Hawking, S. W. and Ellis, G. F. R. (1973). The large scale structure of space- 
time. Cambridge University Press. 

Henkin, L., Monk, J. D., and Tarski, A. (1985). Cylindric Algebras Parts I, II. 
North-Holland, Amsterdam. 

Henkin, L., Monk, J. D., Tarski, A., Andréka, H., and Németi, I. (1981). Cylin- 
dric Set Algebras, volume 883 of Lecture Notes in Mathematics. Springer- 
Verlag, Berlin. 

Hilbert, D. (1899/1977). Grundlagen der Geometrie. Lepzig / B. G. Teubner 
Verlag, Stuttgart. 


Logic of Space-Time and Relativity Theory 709 


Hirsch, R. and Hodkinson, I. (2002). Relation algebras by games. North- 
Holland. 

Hodges, W. (1993). Model theory. Cambridge University Press. 

Hogarth, M. L. (2004). Deciding arithmetic using SAD computers. Brit. J. Phil. 
Sci., 55:681-691. 

Horvath, R. (2005). An Alexandrov-Zeeman type theorem and relativity theory. 
Paper for scientific student contest, Eötvös Loránd University, Budapest. 
Kronheimer, E. H. and Penrose, R. (1967). On the structure of causal spaces. 

Proc. Camb. Phil. Soc., 63:481—501. 

Latzer, R. W. (1972). Nondirected light signals and the structure of time. Syn- 
these, 24:236—280. 

Madarász, J. X. (2002). Logic and Relativity (in the light of definability the- 
ory). PhD thesis, ELTE, Budapest. http://www.math-inst.hu/pub/algebraic- 
logic/Contents.html. 

Madarász, J. X., Németi, I., and Székely, G. (2006a). First-order logic foundation 
of relativity theories. In New Logics for the XXIst Century II, Mathemati- 
cal Problems from Applied Logics, volume 5 of International Mathematical 
Series. Springer. To appear. philsci-archive.pitt.edu/archive/00002726/. 

Madarász, J. X., Németi, I., and Székely, G. (2006b). Twin paradox and the 
logical foundation of relativity theory. Foundations of Physics, 36(5):681- 
714. www.arxiv.org/abs/gr-qc/0504118. 

Madarász, J. X., Németi, I., and Toke, Cs. (2004). On generalizing the logic- 
approach to space-time towards general relativity: first steps. In Hendricks, 
V., Neuhaus, F., Pedersen, S. A., Scheffler, U., and Wansing, H., editors, 
First-order Logic Revisited, pages 225-268. Logos Verlag, Berlin. 

Makkai, M. (1993). Duality and definability in first order logic. Number 503 

in Memoirs of the AMS. AMS. 

Misner, C. W., Thorne, K. S., and Wheeler, J. A. (1970). Gravitation. Freeman 

and Co, New York. Twentieth Printing 1997. 

Mundy, J. (1986). The philosophical content of Minkowski geometry. Britisch 

J. Philos. Sci., 37(1):25-54. 

Németi, I. and Andréka, H. (2006). Can general relativistic computers break 
the Turing barrier? In Beckmann, A. Berger, U. Loewe, B. and Tucker, J. V. 
editors, Logical Approaches to Computational Barriers, Second Conference 
on Computability in Europe, CiE 2006, Swansea, UK, July 2006, Proceed- 
ings, volume 3988 of Lecture Notes in Computer Science, pages 398—412. 
Springer-Verlag, Berlin-Heidelberg. 

Németi, I. and David, Gy. (2006). Relativistic computers and the Turing barrier. 
Applied Mathematics and Computation, 178:118—-142. 

Nicholls, P., editor (1982). The science in science fiction. Crescent Books, New 
York. 

Novikov, I. D. (1998). The river of time. Cambridge University Press. 





710 HANDBOOK OF SPATIAL LOGICS 


O’ Neill, B. (1995). The geometry of Kerr black holes. A K Peters. 

Pambuccian, V. (2006). Alexandrov-Zeeman type theorems expressed in terms 
of definability. Aequationes Mathematicae. to appear. 

Penrose, R. (2004). The road to reality. A complete guide to the laws of the 
Universe. Jonathan Cape, London. 

Reichenbach, H. (1969). Axiomatization of the theory of relativity. University of 
California Press, Berkeley. Translated by M. Reichenbach. Original German 
edition published in 1924. 

Rindler, W. (2001). Relativity. Special, General and Cosmological. Oxford Uni- 
versity Press. 

Robb, A. A. (1914). A Theory of Time and Space. Cambridge University Press. 
Revised edition, Geometry of Time and Space, published in 1936. 

Sain, I. (1986). Nonstandard dynamic logic. Dissertation for candidate’s degree, 
Hungarian Academy of Sciences, Budapest. In Hungarian. 

Schröter, J. (2007). A new formulation of general relativity, Parts I-III. Adv. 
Theor. Math. Phys., 1:1-70. 

Schutz, J. W. (1997). Independent axioms for Minkowski space-time. Longo- 
man, London. 

Schwabhiuser, W., Szmielew, W., and Tarski, A. (1983). Metamathematis- 
che Methoden in der Geometrie. Springer-Verlag, Berlin. Hochschul text, 
Vilit+482pp. 

Shehtman, V. and Shapirovsky, I. (2003). Chronological future modality in 
Minkowski space-time. In Advances in Modal Logic-2002, pages 437-459. 
King’s College Publications, London. 

Simpson, S. G., editor (2005). Reverse Mathematics 2001. Lecture Notes in 
Logic,. Association for Symbolic Logic. pp. x+401. 

Smolin, L (2001). Three roads to quantum gravity. Basic Books. 

Suppes, P. (1959). Axioms for relativistic kinematics with or without parity. In 
Henkin, L., Tarski, A., and Suppes, P., editors, Symposium on the Axiomatic 
Method with Special Reference to Geometry and Physics, pages 291-307. 
North-Holland. 

Suppes, P. (1968). The desirability of formalization in science. The Journal of 
Philosophy, 27:651-664. 

Suppes, P. (1972). Some open problems in the philosophy of space and time. 
Synthese, 24:298-316. 

Szabó, L. E. (2002). The Problem of Open Future, Determinism in the light of 
relativity and quantum theory. Typotex, Budapest. 

Szabo, L. E. (2006). Empiricist studies on special relativity theory. Book 
manuscript, Budapest. 

Szczerba, L.W. (1970). Independence of Pasch’s axiom. Bull. Acad. Polon. Sci. 
Ser. Sci. Math. Astronom. Phys., 18:491-498. 


Logic of Space-Time and Relativity Theory 711 


Szekeres, G. (1968). Kinematic geometry: an axiomatic system for Minkowski 
space-time. Journal of the Australian Mathematical Society, 8:134-160. 
Szmielew, W. (1974). The role of the Pasch axiom in the foundations of 
Euclidean Geometry. In Proc. of the Tarski Symp. held in Berkeley in 1971, 

pages 123-132. Providence, RI. 

Tarski, A. (1959). What is elementary geometry? In Henkin, L., Tarski, A., 
and Suppes, P., editors, Symposium on the Axiomatic Method with Special 
Reference to Geometry and Physics, pages 16-29. North-Holland. 

Tarski, A. and Givant, S. (1987). A formalization of set theory without variables, 
volume 41 of AMS Colloquium Publications. Providence, RI. 

Taylor, E. F. and Wheeler, J. A. (2000). Exploring Black Holes. Introduction to 
General Relativity. Addison Wesley Longman. 

Thorne, K. (1994). Black holes and time warps. Einstein’s outrageous legacy. 
W. W. Norton and Company. 

van Benthem, J. A. F. K. (1996). Exploring logical dynamics. Studies in Logic, 
Language and Information. CSLI Publications, Stanford. 

van Benthem, J. F. A. K. (1982). The logical study of science. Synthese, 51: 
431-472. 

van Benthem, J. F. A. K. (1983). The logic of time, volume 156 of Synthese 
Library. Reidel Pub. Co., Dordrecht. 

Wald, R. M. (1984). General Relativity. The University of Chicago Press. 

Walker, A. G. (1959). Axioms for Cosmology. In Henkin, L., Tarski, A., and Sup- 
pes, P., editors, Symposium on the Axiomatic Method with Special Reference 
to Geometry and Physics, pages 308-321. North-Holland. 

Winnie, J. A. (1977). The causal theory of space-time. In Earman, J. S., Glymour, 
C. N., and Stachel, J. J., editors, Foundations of space-time Theories, pages 
134-205. University of Minnesota Press. 

Wiithrich, C. (1999). On time machines in Kerr-Newman spacetime. Master’s 
thesis, University of Berne. 


Chapter 12 


DISCRETE SPATIAL MODELS 


Michael B. Smyth 
Imperial College, London & University of Birmingham 


Julian Webster 
Imperial College, London 


Second Reader 


John G. Stell 
University of Leeds 


1. Introduction 


By “discreteness” of a spatial model we generally understand that in any 
bounded neighbourhood, or (bounded) region, there are only finitely many 
elements of the carrier of the model. Thus the model should be (at least) locally 
finite. Another way of putting it is that the bounded regions should not be 
infinitely subdivisible. 

As is well known, Aristotle argued (Physics: II,6; V1,2) that space and time 
intervals are potentially infinitely subdivisible. In 20th century mathematics, a 
similar position was adopted by Brouwer, in opposition to the actual infinities 
of Cantorian set theory. In proposing discrete spatial models, we embrace the 
possibility that space is (locally) neither actually not potentially infinite, but 
actually finite. 

We mention three streams of motivation for adopting such a point of view. 
The first of these is found in the “tolerance space” tradition, going back to 
Poincaré, 1905. The idea here is that any perceptual (as opposed to mathemati- 
cal or physical) “continuum” must be finite and, moreover, can be structured by 
means of a binary relation, with the aid of which some rudimentary topology 
can be developed. Poincaré proposed a definition (not ultimately successful) of 
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dimension for such spaces. Many years later, Zeeman and his student Poston 
gave much fuller, technical accounts of the idea. More details, with references, 
are provided below. 

Both Zeeman and Poston envisaged that the ideas might be applicable to, 
not only perceptual, but physical continua: see, for example, the concluding 
paragraphs of Zeeman, 1962. Around the same time (but independently and 
from an entirely different point of view), R. Penrose proposed certain discrete 
structures, mathematically not unlike tolerance spaces, as suitable building 
blocks for physical continua: “spin networks”. In recent years spin networks 
have been taken up by several authors, along with other discrete structures 
such as “causal sets” (which are posets, ordered by a causality relation) as 
having the potential for quantizing space-time. According to the enthusiasts, 
the situation with regard to the discreteness or atomicity of space and time may 
be compared with that obtaining around a century ago, when physicists were 
at last beginning to accept the atomic theory of matter. Apart from Penrose, 
1971 there do not, however, seem to be any canonical references in this area. 
A popular account which emphasizes the (claimed) discreteness of space and 
time is Smolin, 2001. As a sample of more technical references: Markopoulou 
and Smolin, 1997; Sorkin, 2002. 

We do not attempt to deal with physics in the present work. It may be 
worth mentioning, though, that our theory of dimension (Sec. 6) is based to 
a considerable extent on that of Evako, who was motivated at least in part 
by physics. Also, we draw inspiration from quantum logic in developing our 
theory of regions (Sec. 5.7). 

A third stream of work is more pertinent to our efforts here. This is the 
field known as digital topology. In digital image processing, the space and 
the images (or regions) have to be representable discretely. One may seek to 
develop a topology and geometry which can handle these discrete representa- 
tions directly, without having to embed them in the traditional continua. The 
main aproach uses structures which are, in effect, tolerance spaces, in which the 
binary relation is (usually) adjacency of pixels rather than perceptual indistin- 
guishability. A second approach (Khalimsky et al., 1990) uses true topological 
spaces rather than graphs/tolerance spaces. We regard these approaches as 
entirely compatible (see Sec. 2). 

We describe now some of the distinctive features of the work which follows. 
In the “topological” part of the chapter (Sec. 2—6) the spaces are envisaged 
as belonging to the category of Cech closure spaces. Graphs and topological 
spaces alike are objects of this category. Formulating key definitions at the 
level of closure spaces means that we can have a uniform treatment for graphs 
(especially tolerance spaces) and topological spaces. An important example of 
this is the treatment of regular sets. The regular open and regular closed sets 
of topology are generalized to regular interior and closure sets at the level of 
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closure spaces. Besides closure spaces we consider (Sec. 4) the more familiar 
closure systems in which the operator, in contradistinction to the Cech closure, 
is idempotent but is not required to distribute over unions. This is done with 
a view to developing some techniques with which to study the collection of 
regular interior sets of a closure space. The important fact emerges that the 
regular sets form a complete ortholattice, but not in general a Boolean algebra 
(because not distributive). In contexts in which we wish to emphasize this 
ortholattice structure (and this is generally the case) we shall, as explained in 
Sec. 4, refer to the regular interior sets as the orthoclosed sets. In the main case 
of interest here, the tolerance space, some of the distinctions made in Sec. 4 are 
superfluous, and we have: 


interior set = regular interior set = orthoclosed set. 


These sets are intended to serve as the regions of our spaces. 

The question immediately arises as to whether we can hope to work with a 
region lattice that is not distributive. The three extended examples in Sec. 5 
are intended to assist in answering this question. The first of these examples 
deals with the “logic” in which ortholattices which are not distributive feature 
most prominently, namely quantum logic. In the quantum setting it turns out 
that the lattices in question are actually orthomodular, a condition which can 
be regarded as a weak form of distributivity. In Theorem 12.29 and Proposi- 
tion 12.30 we have tried to summarize the results from the quantum structures 
literature that may have a bearing on our endeavours here. In the next two ex- 
amples of section 5 we consider region connection theory, and especially some 
attempts which have been made to allow for discrete models. (In comparing 
this kind of work with tolerance spaces, the graph relation is now construed as 
connection.) The problem is that in the well known formulations of connection 
theory, whether by Whitehead, Clarke, or the authors of RCC, non-discreteness 
(infinite divisibility) is built in. Thus we are particularly interested in attempts 
by J. Stell and associates to develop a discrete connection theory. Two kinds of 
lattices (dual p-algebras and bi-Heyting algebras) are proposed by these authors 
as capable of supporting a discrete region theory. The bi-Heyting algebra in 
particular is claimed (as suggested originally by Lawvere) to be well suited for 
geometry as it permits a nice notion of boundaries of regions to be introduced. 
But there is a problem with this: no matter what the region, its boundary is 
always 0-dimensional. Thus the proposal seems to be ruled out on dimensional 
grounds. 

In Sec. 6 we develop our theory of boundary and dimension. It builds on 
work of Evako and associates. In the simple case of tolerance spaces, the Evako 
dimension is in effect given by the size of the largest clique. But this means 
that products do not work well: in effect the dimension (strictly speaking, rank) 
of a product space is the product of their dimensions rather than the sum. Our 
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solution involves the idea that some cliques are better than others. (Not all 
cliques can be considered as true “cells” of the space.) Restricting attention to 
the “good” cliques, we get a poset whose length may be taken as the dimension 
of the space. 

In Sec. 7 we begin to go beyond topology, specifically to convex and affine 
structure. It is now assumed that the elements of the tolerance space are them- 
selves cells, rather than structureless points. Formally, a cell is just a finite 
subset of the ground set. Informally, we can think of a cell A as the rela- 
tive interior of the convex hull of A (and the connection, or tolerance, relation 
as overlapping of these open cells). Two features of the material in Sec. 7 
are particularly significant. It turns out that, when the space is realizable (in 
terms of finite subsets of a Euclidean space), the lattice of regions is always 
orthomodular—and thus has “plenty of distributivity” within it, even though the 
lattice as a whole is not distributive. An unsolved problem at present is whether 
this orthomodularity property holds for “cell spaces” in general, whether real- 
izable or not. The significance of orthomodularity is illustrated by showing that 
it leads to a particularly simple definition of triangulation (Sec. 7.4). 

The second feature of the cell space material to be highlighted here is the 
identification, given a few plausible axioms, of such spaces with oriented ma- 
troids. We begin to explain this identification in Sec. 7.3. The proof of the 
equivalence is then given in Sec. 9. This proof goes via a new axiomatization 
of oriented matroids (via “surrounding sets”); thus the main burden of the proof 
is to show that this axiomatization is actually equivalent to the standard ones. 
An important feature of this equivalencing of geometries is that it works with 
spherical versions of the spaces involved. 

Along with sphericity comes a commitment to spaces that are finite, rather 
than just locally finite. (In topological terms we have: compact + locally finite 
=> finite.) This is fully apparent in the remainder of the chapter, in which we 
discuss what is by far the most developed approach to a purely combinatorial 
account of affine and convex structure: matroid theory. 

Sec. 8 is an introduction to matroids and Sec. 9 an introduction to oriented 
matroids. Oriented matroids may be considered as combinatorial Euclidean 
geometries. Any finite subset of R” inherits an oriented matroid structure, 
which captures its subspace geometry. Further, any oriented matroid—a purely 
combinatorial structure defined axiomatically with no reference to R”—can be 
realized as a collection of pseudo-spheres on the surface of an n-sphere. The 
theory illustrates, subject to a reservation to be mentioned shortly, the basic 
approach to space taken in this chapter. 

To have acategorical axiomatization—that is, an axiom system with only one 
model—is not at all a desideratum in our approach. Rather what we have is that 
there are, at least potentially, infinitely many oriented matroids, some of which 
contain more structure than others. Category theory provides the language 
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in which to discuss this issue more adequately. The class of all finite spaces 
of a given type should form a category, with refinement (i.e. more structure) 
defined in terms of morphisms. Completeness of this category with respect 
to the appropriate (most usually, inverse) limits would be attained by adding 
in the infinite, classical models of the axioms. That such completeness would 
obtain is what we have, in a special case, called the “Correspondence Principle” 
(Sec. 2). Itis perhaps worth mentioning that such a category of spaces is being 
proposed for its theoretical significance, rather than directly as a foundation 
for spatial computation. Programs are thought of as being written in terms of 
the spatial models themselves, the objects of the category. Although practical 
computational issues are not a major concern in this chapter, we illustrate the 
point with a short discussion of Knuth’s work on oriented matroids (Sec. 10). 

Now we have to admit that “the” category of spaces is (known and) avail- 
able only in certain cases. For topology, such categories and constructions 
are certainly available; see Sec. 1 for references to our previous work in this 
area. For geometry, particularly as formulated in terms of oriented matroids, 
the main stumbling block at present is an almost complete lack of morphisms. 
(This is the “reservation” mentioned earlier.) In Sec. 10 we suggest that the 
way forward might be to now try to go beyond synthetic geometry and develop 
a finite algebraic theory of space. It seems anyway a natural progression from 
topology to geometry to algebra in seeing how far the combinatorial method 
of spatial representation can be pushed. The models in this theory are very 
likely to be spherical, which is why in Sec. 9 spherical oriented matroids are 
discussed more than flat ones. Basic to combinatorial sphericity is the notion of 
an involution (the involute of a point on the surface of a sphere is its antipode), 
and the natural geometric interpretation of a set together with an involution is 
a crosspolytope (rather than a simplex, which is the natural geometric interpre- 
tation of a set). The vertex-edge graph of the n-crosspolytope is the complete 
n-partite graph discussed in Sec. 1. 

One might expect that symmetry is fundamental in any spherical formulation 
of space, and Sec. 10 concludes with an outline of recent work by Gelfand and 
others on Coxeter matroids, which is a generalization of matroids entirely in 
terms of Coxeter groups. 


2. Preliminaries; correspondence principle 


Following some basic definitions concerning graphs, this section introduces 
what we term the Correspondence Principle. This is essentially the idea that 
graph theory provides discrete counterparts for key topological notions. As 
a sample of the evidence for the validity of this principle, we present the 
graph-theoretic counterparts of two significant topological/metric space no- 
tions, namely the contractible space and the hyperconvex metric space. These 
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counterparts are: the dismantlable graph and the Helly graph. Definitions and 
examples of these counterparts are provided. We believe that both concepts are 
of importance in discrete spatial modelling, although for reasons of space we 
shall not develop this theme in this chapter. The class of dismantlable graphs 
will make a brief appearance in the discussion of surfaces in Sec. 6. The Helly 
concept as such will not be needed later on; however, the graph which serves 
us as the discrete counterpart of the n-cube, and which has a substantial role in 
what follows, is actually a characteristic Helly graph. (For the sense in which 
the discrete cube characterizes the class of Helly graphs, see Observation 2 
preceding the statement of the Correspondence Principle below.) 

A (di)graph (G, R) is for us simply a binary relation R on the set G (of 
vertices). Thus, our graphs do not admit multiple edges. A loop is present at a 
vertex x, or not, according to whether xz or not. (We do not employ the con- 
ventional term “simple graph”, as it blurs the distinction between reflexive and 
irreflexive relations.) We almost always want the relation R to be symmetric, 
and this is generally intended by the use of the term graph. Moreover we are 
principally interested in two cases: 


1 The relation is reflexive. Use of the symbol ~ for the relation indicates 
this case. 


2 The relation is irreflexive (no vertex is related to itself). The symbol 
will indicate this case. 


Given a graph (G, ~), we sometimes use the notation co-G for the irreflexive 
graph with vertex set G and relation the complement of ~; likewise if we start 
with a graph (G, L). 

A (graph) morphism is a relation-preserving map (on vertices). The product 
of graphs is for us the category product, i.e. (G, R) x (G’, R’) is (G x G’, Rx 
R’), where (x, x’)R x R'(y, y’) holds iff x Ry and x’ R’y’. (Many other notions 
of product are used by graph theorists.) We denote the m-path, that is, the graph 
with m + 1 vertices vo, v1,- . . , Um and with v; Rv; iff |i — j| < 1, by Im. Our 
reason for departing from the usual notation here is that the paths serve for us 
as “discrete (unit) intervals”. In particular, we consider 1}, (that is, the n-fold 
product of the path Im) as a digital n-cube. 

A second useful operation on graphs G, G” (whose vertex sets are assumed 
disjoint), sometimes called the sum, is defined as follows: G * G” has vertex 
set G U G’. Every vertex of G is taken as adjacent to every vertex of G”. 
Additionally, two vertices belonging to the same summand (G, or G’), are 
adjacent in G x G” exactly if they are adjacent in that summand. 

We sometimes let the natural number n denote the (reflexive) graph with n 
vertices and no edges other than the self-loops. With this notation, the complete 
k-partite graph with k components, having n1,..., nę vertices respectively, can 
be expressed as nı *...* ny. The notation K(n1,...,7,), or some variant 
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thereof, is commonly used for this graph. The case where each n; = 2 will be 
particularly important later. 

A clique of a graph (G, R) is a set C of vertices such that, for every pair 
of distinct elements x, y of C, xRy. A simplicial complex is a collection © of 
finite subsets of a set S, including each singleton set, such that 


ACB,Bed > AEB. 


The cliques of any (locally finite) graph provide an example, indeed the main 
example that we are interested in. The simplicial complex © (over S) will be 
called graph-like if the simplexes of © coincide with the cliques of some graph 
(S, R). Here we can specify R to be the relation given by: 





eRy & JACXM EAA YEA. 


For any reflexive graph G we have the associated metric dg, where dg (x, y) 
is the length of the shortest path (measured by the number of edges) from x to 
y. For any vertex x € G the k-ball B(x, k) is the set {y| dg(x,y) < k}. 

In any digraph (G, R) we take the neighbourhood N (v) of a vertex to be the 
set {x|vRx}. If G is a reflexive graph, this is the 1-ball B(v,1). Sometimes 
the punctured neighbourhood No(v) = {x|vRzx} \ {v} is required. 

If G’ is a subset of the set of vertices of the digraph (G, R), the induced 
subgraph on G" is simply the restriction (G’, R’) of G to G’: that is, vR’w in 
G’ iff vRw in G. If G” is an induced subgraph of G, and there is in addition 
a surjective morphism from G onto G”, we say that G” is a retract of G (and 
that the surjective morphism is a retraction from G to G”). Simple examples 
are provided by the wheels and cycles to be mentioned in a moment. 

In keeping with our view of graphs as spatial models, we shall sometimes 
take the liberty of referring to reflexive graphs as “tolerance spaces”, and to 
irreflexive graphs as “orthogonality spaces”. There is in fact ample precedent 
for this kind of terminology. Poincaré, 1905 suggested that the “perceptual 
continuum” is, or should be, structured by means of a binary relation of in- 
discernibility, which is reflexive and symmetric, but not in general transitive. 
Having lain fallow for many years, the idea (along with the terminology of 
“tolerance spaces”) was reintroduced by Zeeman around 1960: Zeeman, 1962. 
Under the name “fuzzy geometry” it was comprehensively developed, as a kind 
of alternative topology, by Poston in his thesis (Poston, 1971). For a concise 
treatment of tolerance spaces, and of their category Tol (the morphisms be- 
ing the usual graph morphisms) see Sossinsky, 1986. Our own contributions 
to it are found in Smyth, 1995; Smyth, 1997. Another recent contribution is 
Georgatos, 2003. 
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The name “orthogonality space” for the irreflexive graph, on the other hand, 
has a precedent in discussions of quantum logic (a topic which we consider 
briefly in Sec. 5.1 below). 

We illustrate the topological and metric aspects of graphs by an informal 
discussion of two (related) concepts. Almost nothing in the remaining sections 
depends on these examples, and we shall make use of some (standard) general 
terminology without troubling to define it. 

The first concept is that of a contractible (also dismantlable) graph. The 
definition can be given in a way that reads exactly as in topology: A graph G 
is contractible if the identity function on (the vertices of) Œ is homotopic to a 
constant function. Several remarks are in order: 


1 Homotopy for graphs can be treated in an entirely discrete fashion. In- 
stead of the continuum [0,1] one may use the discrete intervals (paths) 
In: see for example Poston, 1971. 


2 For finite reflexive graphs, an elementary definition, in recursive style, is 
available: G is dismantlable if there exists a vertex v of G such that, for 
some vertex u distinct from v, N(v) C N(u), and G — {v} is dismant- 
lable. (A prime example of a “vertex dismantling” definition.) 


W 


Dismantlable non-reflexive graphs have recently been applied in mod- 
elling physical systems exhibiting “hard constraints”: see 
Brightwell, 2000. 


Simple examples of dismantlable and non-dismantlable graphs are provided by 
the wheels W,, and the cycles Cn (n > 3). Notice that Wn = 1 * Cn. The 
wheels and C3 are dismantlable; the remaining cycles are non-dismantlable. 
As an illustration of previously defined terms, we also observe: for n > 4, Chn 
is an induced subgraph but not a retract of W,,, while I> is a retract of every 
such wheel and cycle. 











_ 
gx 











(a) Ca (b) W5 


Figure 12.1. 


If we define a cone to be any (reflexive) graph of the form 1 * G, then we 
evidently have that every cone is dismantlable. In fact, it is easy to see that if a 
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graph H is expressible as a sum G * G” and at least one of the summands G', G” 
is dismantlable, then H is dismantlable. 

The second concept has a metric aspect. A graph G is said to be a Helly 
graph if the set of balls of G has the Helly property: every collection of pairwise 
intersecting balls has a non-empty intersection. Some observations follow: 


1 Every (finite) Helly graph is dismantlable: Quilliot, 1983; Bandelt and 
Pesch, 1989. 


2 Helly graphs derive much of their significance from the following charac- 
terization: they are the injective objects in the category Tol (with respect 
to isometric embeddings). A more concrete characterizaton is: a graph 
is Helly if and only if it is a retract of a product of paths. 


3 Helly graphs have their counterpart in classical analysis: the hypercon- 
vex metric spaces. The details of this correspondence, together with a 
common generalization of the two concepts (namely, the hyperconvex 
semi-metric space), have been set out in Smyth and Tsaur, 2002. 


We have been led by these, and many other, considerations to propose the 
following: 


CORRESPONDENCE PRINCIPLE: The central ideas of topology have discrete 
counterparts in graph theory. Moreover, the former arise by a limiting process 
from the latter. 


“Ideas” here is meant to cover both definitions and theorems. For the second 
part of the Principle to make sense, we have to think of graphs and topological 
spaces as existing in the same category. There are several interesting possible 
choices for such a category. The best-known of these will make an appearance 
in the next section. 

We should emphasize that, in the present chapter, we shall not be studying 
the limiting processes (especially inverse limits) needed to fully justify the 
Correspondence Principle. We have written extensively about such matters 
in previous work (Webster, 1997; Smyth and Webster, 2002; Smyth, 1995; 
Smyth, 1997). Here we shall illustrate, rather, the first part of the C.P. with a 
very simple example. The idea of the fixed point(s) of a function is certainly 
central in topology. What is the discrete counterpart? It turns out that we need 
to work with “almost fixed points” (or fixed points up to tolerance): 


DEFINITION 12.1 Let H be a graph. An almost fixed point of an endomor- 
phism h : H — H is a vertex p such that h(p) ~ p. H is said to have the 
almost fixed point property (AFPP) if every endomorphism of H has an almost 
fixed point. 
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EXAMPLE 12.2 The 4-cycle does not have the AFPP (because of the mapping 
which sends each vertex to the diagonally opposite vertex), while the wheel has 
the AFPP. 


That the wheel has the AFPP is a special case of the following result: 
THEOREM 12.3 Every dismantlable (reflexive) graph has the AF PP. 


This result has been rediscovered a number of times. The first statement and 
proof of it seems to be Pultr, 1963. Independently, Poston, 1971 provided 
an elaborate treatment in terms of algebraic topology. A special case was 
considered by Rosenfeld, 1986. Generalizations involving multifunctions were 
studied in Tsaur and Smyth, 2001. 

Notice that the AFPP is not hereditary (that is, it is not inherited by induced 
subgraphs). To see this, we need only look at the wheel and the cycle. Retracts, 
as well as products, are much better at preserving “topological” properties than 
are arbitrary induced subgraphs. In particular we have: 


PROPOSITION 12.4 


1 Suppose that H is a retract of G by r : G — H, e : H — G, and that G 
has the AFPP. Then H has the AFPP. 


2 If G1, Go have the AFPP, then so does G x Go. 


Proof 


1 Assume that G has the AFPP. Suppose h : H — H. Choose an almost 
fixed point p of the map eo hor : G — G. Then, since p ~ eo hor(p), 
we have r(p) ~ r(eo ho r(p)) = (roe)(hor(p)). Thus r(p) is an 
almost fixed point of h. 


2 Left to the reader. 
QED 


A class of graphs closed under products and retracts is known as a variety 
of graphs. Thus the graphs having the AFPP constitute a variety. It is also true 
that the classes of dismantlable and of Helly graphs are varieties. 

More broadly, the category-theoretic aspect of graphs is of great importance, 
though neglected in most hitherto existing graph theory. This situation is be- 
ginning to be remedied, with the appearance of works such as Hell and NeSetril, 
2004. In terms of our view of graphs as discrete counterparts of topological 
spaces, graph morphisms correspond to continuous functions. (A precise for- 
mulation of this will be given in the following section.) Our brief treatment of 
dismantlable and Helly graphs in this section was intended to be illustrative of 
the significance of morphisms, as well as of the topological aspect of graphs. 
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The lack of any systematic treatment of morphisms is typical of the entire 
field (of discrete spatial models). Unfortunately we shall in this chapter be able 
to do little to remedy this situation. There is as yet, for example, no satisfactory 
category of oriented matroids. Questions of this kind constitute a substantial 
prospective area of research. 


3. Cech closure spaces 


As a convenient general setting within which we can study and compare 
spatial models of various kinds, whether discrete or continuous, we may take 
the formalism of closure spaces. A closure space is a set S equipped with an 
operator (the closure, Cl) acting on subsets of S. Closure spaces come in two 
main flavours. Precise details will be provided shortly (in this section and the 
following one) but a preliminary indication may be helpful. In the first flavour, 
we can think of Cl(S’) as the set of elements close to the set S, in a suitable 
sense. The characteristic axiom here is: 


Cl(AU B) = Cl(A) U Cl(B). 


(An element is close to A U B if and only if it is close to A or close to B.) In 
the second flavour we typically have that Cl(.S) is the set of elements directly 
or indirectly dependent on S. Here we invariably have idempotency: 


Cl(Cl(S)) = CLS), 


together with (often) some more specific axiom reflecting the type of depen- 
dency involved. 

The less familiar “distributive” (but non-idempotent) closure spaces are dis- 
cussed in this section. The idempotent spaces appear in the next section. To 
avoid confusion, we shall refer to the idempotent spaces as closure systems. 

Cech, 1966, demonstrated in great detail that much of general topology goes 
through without the assumption that the operation of closure is idempotent. It 
seems that he was motivated to do this by some examples in functional analysis. 
But it turns out that this generalization is exactly what we need if we seek to 
extend topology to “digital” spaces. 


DEFINITION 12.5 A (Cech) closure space, (X, Cl), is a set X equipped with 
a set operator Cl satisfying: 


1 AC CIA), 
2 Cl(0) = 0, 
3 CAU B) = CI(A) UCI(B). 


Closely associated with this notion of closure, we have those of interior and 
neighbourhood. Just as in ordinary topology, these three are interdefinable (at 
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least if we allow classical logic). Thus, in a closure space (X, Cl), we define 
the interior operation by: 


Int(A) = X — Cl(X — A); 
and we say that the subset B is a neighbourhood of A if 
A C Int(B). 
For the specialization preorder < x of a closure space X we have: 
gr<xy & (VBC X.2 € Int(B) = y € Int(B)). 


In words: every neighbourhood of x is a neighbourhood of y. Equivalently 
(check!) we can say: 


a<xy = Cl(x) € Cl(y); 


oreven more simply: x € Cl(y). Given this, we naturally express Tp-separation 
by: 
XisTo & (t=xy > T=y). 


(“Identity of Indiscernibles.”) Note that with this definition of To-separation 
we depart significantly from Cech, 1966. The details, together with our reasons 
for changing Cech’s definition, are given in Smyth, 1995. 

Continuity of functions is characterized via closure, interior, or neighbour- 
hoods just as usual. (However, the definition via open sets or closed sets is not 
appropriate here.) Thus: the function f : X — Y is continuous if, for any 
AC X, f(C1(A)) CFA): 


EXAMPLE 12.6 Any reflexive digraph (G, R) is a closure space, on taking 
the closure of a set of vertices to be given by: 


CLA) = {a | dy € A. xRy}. 





The interior of a set B is then given by 
Int(B) = {x | Vy. xRy > y € B}. 


A neighbourhood of a vertex v is any set of vertices which contains all the 
R-successors of v. It is easy to check that a mapping of digraphs is a graph 
morphism (relation-preserving map) if and only if it is continuous in the sense 
of closure spaces. 


REMARK 12.7 In case R is transitive (that is, the digraph is a pre-order), 
the closure is idempotent, and we have the usual Alexandroff topology of the 
pre-ordered set: that is, the topology obtained by taking the upper sets in the 
pre-order as open (equivalently, by taking the lower sets as closed). 
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REMARK 12.8 It can easily be shown that a closure space (X, Cl) is a graph 
(more precisely, is the closure space derived from a digraph as in the preceding 
example) if and only if each point of X has a smallest neighbourhood. 


In thus describing digraphs as closure spaces, we had to require reflexivity 
of the graphs on account of Axiom 1 of closure spaces. Much of the theory 
of closure spaces can still be carried through if we drop Axiom 1 (so that 
arbitrary digraphs are captured). We shall generally assume our (di)graphs to 
be reflexive, although this is in many cases unnecessary from the mathematical 
point of view. 


4. Closure systems 


4.1 Definitions; first examples 


DEFINITION 12.9 A closure system (X, K) is a set X together with a set 
operator K satisfying: 


1 AC K(A), 
2 ACB => K(A)C K(B), 
3 K(K(A)) = K(A). 


REMARK 12.10 As Martin and Pollard, 1996 point out, the three conditions 
of Definition 12.9 can be replaced by the single condition: 


AC K(B)& K(A) C K(B). 
The closure K is called algebraic if in addition we have: 
K(A) = | J{K(B) | B is a finite subset of A}. 


Closure systems are of course ubiquitous, and at this generality one would 
expect that they have little specifically geometrical content. Yet (abstract) con- 
vexity structures as studied particularly by van de Vel, 1993, which we shall 
consider in Sec. 7.1, are in effect just algebraic closure systems. As an abstrac- 
tion of affine structure we have the matroid: a species of closure system which 
will be considered later in some detail. 

A striking difference between closure systems and Cech closure spaces is 
that the former always have enough closed sets (sets A such that A = K(A)) to 
determine the space completely. Indeed, as is well-known, we can alternatively 
define a closure system to be a set X together with a collection C of (“closed”) 
subsets satisfying: 


= the intersection of any set of closed sets is closed. 
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A generalization of this situation will be discussed in a moment. 

As we shall see in Sec. 7, abstract convexity structures are not entirely suited 
to our purpose: they are not well adapted to the discrete situation. Here one 
could think of incorporating graph structure into the convexity: 


DEFINITION 12.11 A graph convexity is an algebraic closure system defined 
over the vertices of a graph G, such that each closed set induces a connected 
subgraph. 


Numerous such “convexities” have been studied by graph theorists (Duchet and 
Meyniel, 1983; Duchet, 1988); for example, a set A of vertices of a connected 
graph G is geodesic convex if, for each pair u, v of vertices, every vertex lying 
on a shortest path between u,v is also in A. Again, in any Helly graph we 
have the “neighbourhood convexity’, consisting of those sets of vertices which 
are intersections of balls (Tsaur and Smyth, 2004). On the whole, however, 
graph structure of itself, augmented with abstract convexity, does not sustain 
geometrically adequate notions of discrete convexity. More elaborate structure 
is needed: see Sec. 7, 9 below. 

Returning to closures in general, we note that the collection C of closed 
subsets of a closure system (X, K) is a complete lattice, since it possesses 
arbitrary meets. We may say that it is a complete lattice of subsets of X. What 
this means is that the ordering is the subset order, and that C is a poset possessing 
(arbitrary) joins and meets. The lattice operations, however, are not required to 
agree with those of P(X) (although, in the case of a closure system, the meets 
of course agree). 

Less familiar than this is the fact that we can relax the axioms of a closure 
system, and still obtain a complete lattice of “closed” sets. 


DEFINITION 12.12 A weak closure system is a set X together with an oper- 
ator K which is monotonic and idempotent. 


PROPOSITION 12.13 Inany weak closure system (X, K), the closed sets (that 
is, fixed points of K ) form a complete lattice under set inclusion. 


Proof Let (Ci)ier be a family of closed sets. Define 


W = K(JCi). 


Then we have: 
1 W is closed. 
2 Ci C W for each i, since C; = K(C;) C W. 
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3 W C Z for any closed upper bound Z of the family (C;); indeed W C 
K(Z) = Z. Thus W is the join of the family (C;). 


QED 


We shall see a substantial example of this in a moment (regular sets). 


4.2 Lattice of regular sets of a closure space. 


For our main example of (weak) closure systems, we begin by recalling the 
notion of regular open (or closed) sets in topology. These sets have often been 
used for modeling spatial logics such as RCC. We observe that the notion can 
be generalized to (Cech) closure spaces: 


DEFINITION 12.14 Let (X,Cl) be a closure space, and A C X an interior 
set (that is, A = Int(B) for some B C X). Then A is a regular interior set if 
A = Int(Cl(A)). Regular closure sets: defined similarly. 


It is well-known (see for example Birkhoff, 1948) that, in any topological 
space, the regular open sets (likewise, regular closed sets) form a complete 
Boolean algebra. This is by no means the case in arbitrary Cech closure spaces. 
In order to obtain a satisfactory algebra of regular sets, we shall restrict the 
closure spaces considered via: 


DEFINITION 12.15 A closure space (X, Cl) is weakly regular if, for any in- 
terior set A C X, the closure Cl(A) is a neighbourhood of A. 


Every topological space is trivially weakly regular. With regard to non-topological 
closure spaces, we have the following easy result: 


PROPOSITION 12.16 Every tolerance space (G,~) is, as a closure space, 
weakly regular. 


The term “weakly regular” (we would have preferred “semi-regular”, but 
that already has another sense: Engelking, 1989) has been chosen because, as 
we shall see in a moment, the closure spaces satisfying this condition have a 
rich structure of regular sets. It may be objected that the term is ill-chosen, as 
there is no special connection with regular topological spaces. Note, however, 
the following: 


DEFINITION 12.17 (CECH, 1966) A closure space (X, Cl) is regular if, for 
every point x and neighbourhood U, there is a neighbourhood V of x such that 
CV) CU. 


PROPOSITION 12.18 Every regular closure space is weakly regular. 
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It is easy to see that weak regularity can be expressed in terms of the 
interior/closure operator, without explicit mention of points, by: IntA C 
Int o Clo IntA. 

For the definition of weak regularity to be meaningful, and for the following 
proposition to hold, we actually only need that Cl is a monotonic operator (with 
Int its dual). However for our intended application, namely Theorem 12.20 
(see part 4 of the theorem), we require a Cech closure space, so we shall for 
simplicity work in that context. Abbreviate the operators Int o Cl, Cl o Int by 
J, K respectively. 


PROPOSITION 12.19 Consider the following properties of a given closure 
space (X, Cl): 


1 Int A C Int Cl Int(A) (ie. X is weakly regular); 
2 ClInt C(A) C CIA; 
3 The operator K is idempotent; 


4 The operator J is idempotent. 
Then: (1) <> (2), (3) = (4), and (1) = (3). 


Proof Assume (1). By taking complements (~) we obtain: 





Int>4Cl-aInt-(4A) C -Int3(-.). 


That is, 
Cl Int Cl(—A) C Cl(-A). 


Since A is arbitrary, we deduce (2). (2) = (1) is similar. A similar manipulation 
with complements shows that (3) = (4). 

Now assume (1) (or, equivalently, (2)). We obtain Jo J < J by applying 
Int to each side of (2). On the other hand, we get J < Jo J by substituting 
Cl(B) for A in (1). QED 


From Proposition 12.19 we see that, in any weakly regular space, the oper- 
ators J = Int o Cl and K = Clo Int are weak closure operators (that is, each 
is monotonic and idempotent, cf. Definition 12.12). In the following theorem 
we also make use of the operator x, defined by: A* = Int(X — A). Trivially, 
we have in any space that J( A) = A**; and if A is regular interior (that is, 
A = A**), then A* is also regular interior. 


THEOREM 12.20 Let R be the collection of regular interior sets of the weakly 
regular space (X,Cl). We view (R, C) as a poset, and * (see the preceding 
remarks) as an operator * : R — R . Then we have: 
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1 Ris a complete lattice; 
2A<BSB*< A’; 

3 A =A; 

4 AV A*=XandAN\ A* =f. 


Proof 
1 By Proposition 12.13 applied to the weak closure system (X, J). 
2 By monotonicity of Int. 
3 A is a regular interior set. 


4 Cl(AU A*) = Cl(A) UCI(A*) = X. Hence X is the only regular set 
containing both A and A* as subsets. 
QED 


The clauses (2)-(4) of the preceding Theorem assert that * is an orthocom- 
plement for the bounded poset R. Together with (1), the statement is that R is 
a complete ortholattice. 


EXAMPLE 12.21 Let (G,~) bea reflexive graph. As usual, we regard G as a 
closure space. It is not difficult to see that in this closure space, every closure set 
(that is, union of discs, or 1-neighbourhoods) is regular. Dually, every interior 
set is regular. Note also that, if (G, L) is the complement (orthogonality) graph, 
we can write A+ for Int(G — A), so that we have the expression A+ for J(A) 
(= Int o Cl(A)). Also, A C A; it is convenient to refer to A+- as the 
orthoclosure of A. The notation OL(G) will sometimes be used for the lattice 
of orthoclosed subsets of G. 


It may happen (as in Sec. 5.1) that an irreflexive graph (G, L) is the primary 
object of study. In that case we still take the orthoclosed subsets to be those of 
the form J(A) = A+, while mention of Int, Cl, etc., will be taken to refer to 
the reflexive graph co-G. 

These orthoclosed (alias regular interior) sets are extremely important for 
our subsequent development. As already mentioned in the Introduction, they 
provide our notion of region: see especially Sec. 7 for the details. In calculating 
the orthoclosure in a given (G, ~), a good way to think about this operation is 
as follows. To extend the set A C G to At+, we add to A every vertex a such 
that each vertex adjacent to a is already adjacent to some vertex of A. In more 
geometrical language, the criterion is that a € A++ if the 1-neighbourhood of 
a is contained in the 1-neighbourhood of A. 
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Most of the preceding material on regular interior sets applies, mutatis mu- 
tandis, to regular closure sets. In fact, since in any closure space X a set A 
is regular interior iff X — A is regular closure, the inverse of the complete 
ortholattice R (Theorem 12.20) is isomorphic to the complete ortholattice of 
regular closure sets. We note that J. L. Bell (Bell, 1986), in his approach to 
quantum logic, has worked with the lattice of (regular) closure sets of a tol- 
erance space (or proximity space in Bell’s terminology). We shall take up the 
topic of quantum logic in the next section. We shall work there with the ortho- 
closure as defined above. This is more standard than the (Bell’s) approach in 
terms of closure sets, and is also more convenient for our intended geometrical 
applications. 


5. Extended examples 


This section is mainly concerned with the modelling of various physical and 
spatial logics by lattices of subsets of a closure space (usually a graph). We 
pursue this theme via three extended examples. 

The first of our three examples is concerned with developments of (so-called) 
quantum logic. The reader may wonder why so much attention is given to a 
subject so far removed from our main topic. In defence we would mention that 
the developments we are concerned with involve little that is specific to quantum 
theory. Rather, what is involved is a general logic of “tests”, or “properties”, the 
underlying structures of which (we shall argue) are of considerable significance 
for spatial reasoning. 


5.1 Quantum structures 


As is well known, Birkhoff and von Neumann, 1936 argued that the 
strangeness of quantum mechanics derived, in part, from the fact that its “logic” 
was non-classical: specifically, the distributive law failed. Less well known 
is the extensive development which these ideas have enjoyed from the 1950s 
onwards. We are concerned here with one of the main strands of this work, the 
“empirical logic” (logic of experiment) of D.J.Foulis and C.H. Randell. (Rather 
than cite a huge number of original papers, we mention here a couple of survey 
papers and a useful textbook: Cohen, 1989; Foulis, 1999; Coecke et al., 2000). 

The departure point of the Foulis-Randell theory is an extremely simple 
definition: 


DEFINITION 12.22 A test space is a collection A of pairwise incomparable 
non-empty sets. Elements of A are called tests. Elements of tests are called 
outcomes, and the outcome space is X = |JA. The space is said to be 
classical if it has only one test. Finally, an event is a subset of a test. 
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In the literature, variations on a thought experiment (due to Foulis) involving 
a “firefly box” are often used as illustrations of non-classical test spaces. The 
basic firefly “experiment” goes as follows: 





Figure 12.2. Firefly box. 


EXAMPLE 12.23 A rectangular box has two translucent windows, one on the 
front and one on a (specified) side; the other four sides of the box are opaque. 
At any given moment, the firefly’s light is either on or off. This, we assume, can 
be detected by looking (directly) at either the front or the side. Moreover, by 
looking directly at the front window when the light is on, one can tell whether 
the firefly is in the left (l) or right (r) half of the box. Likewise, by looking 
directly at the side window when the light is on, one can tell whether the firefly 
is in the front (f) or back (b) half of the box. When the light is off, nothing (n) 
is seen, whether we look at the front or the side of the box. (It is assumed that 
the box has no partitions or baffles behind which the insect can hide, so that, if 
nothing is seen when looking at the front, then nothing would have been seen 
had one looked at the side, and vice versa.) 

With this set-up, then, we have two “experimental procedures” F and S. 
Procedure F is conducted by looking directly at the front of the box and recording 
Lr, orn according to what, if anything, is seen. In procedure S one looks directly 
at the side of the box and records, as the case may be, f, born. 

Abstracting from this description, we have a test space consisting of the two 
tests {l,r,n} and {f,b,n}. The outcome space is {l,r,n, f, b}. 


As a somewhat less whimsical example we have: 


EXAMPLE 12.24 Quantum test space. A is the collection of all orthonormal 
bases of a Hilbert space H. Here, the outcome space X is the unit sphere of H. 


The third type of example we consider here is given as follows: 


EXAMPLE 12.25 Partition manual. A test space in which the tests form a 
partition (not just a cover) is called a partition manual. The spin manual is the 
special case of this in which each test has cardinality two. This name is, in 
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part, a reference to the case of the “spin-1/2” particle, typically an electron, 
which, for each angle of measurement, has the two possible outcomes: spin up, 
or spin down. 


Clearly, the notion of a test space is very general: just an irredundant cover 
of the set X. (Sometimes the definition is weakened further, to an arbitrary 
cover of X: Wilce, 2004.) We shall consider in a moment certain conditions 
that serve to make the notion more restrictive. 

Any test space (X, A) gives rise to an orthogonality graph (X, L) via: 


xly © z,yare distinct elements of some test in A. 


This in turn gives a closure system (see Sec. 4), with closure operation given 
by: 
Aw J(A) = AH. 


It is easy to check that in the Hilbert space example this simple definition does 
indeed give, as closed sets, the closed (linear) subspaces. In the spin manual, 
or any partition manual, closure is the identity on proper subsets of tests, while 
every other set has X as its closure. 

A family {A;} of events of the test space (X, A) is said to be compatible 
if its members are all contained in a common test. Disjoint compatible events 
are said to be orthogonal. Notation: A | B. (Notice that, so far, we are not 
entitled to identify this notion with orthogonality as applied to subsets of the 
graph (X,1).) If A L Band AU B is actually a test, A and B are called 
(orthogonal) complements. Notation: AocB. Finally, the events A, B are said 
to be perspective (written A ~ B) if they possess a common complement. 


DEFINITION 12.26 The test space (X, A) is said to be algebraic if, for any 
pair of perspective events A, B, any orthogonal complement of A is also an 
orthogonal complement of B. 


The idea of this definition is that perspective events can be considered as, in 
a sense, logically equivalent. For example, in the firefly experiment, we have 
that the events {1,r} and {f,b} are perspective. Each of these events occurs 
(under the appropriate test) if and only if the firefly’s light is on, this being 
conveyed by the fact that their common complement is the event {n}. Thus, 
in a coherent experimental set-up, events which share (locally) an orthogonal 
complement should have the same complements everywhere: the test space 
should be algebraic. 

Notice that any two tests of a test space are perspective (they have the empty 
set as complement). It is not difficult to show that, in the “logic of experimental 
propositions” of an algebraic test space, any test will appear as True (the top 
element of the lattice, assuming that we can construct an appropriate lattice). 
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Mathematically, a test space is nothing but a simplicial complex, where the 
events are the simplices and the tests are the maximal simplices. As such, a 
test space (X, A) is graph-like (Sec. 2) if and only if every subset of X whose 
elements are pairwise orthogonal is an event. The assumption that this holds 
is (a strong form of) what is called orthocoherence in the quantum structures 
literature. The assumption was often made, more-or-less explicitly, in the older 
literature (say, from the 1950s to the 1970s), but is generally avoided today. 
However that may be, it is easy to see that the examples of test spaces given 
above (firefly, Hilbert space, partition manual) are all graph-like. Notice in 
particular that the spin manual having n tests is represented as the complete 
n-partite graph K(2,2,...,2)—a very interesting graph to which we shall 
return more than once in the sequel. Moreover the example related to region 
geometry, with which we shall be concerned later, is graph-like. We shall adopt 
this assumption in the remainder of our discussion of test spaces and quantum 
logic. Thus, a test space is henceforth taken to be an orthogonality graph, in 
which the events are the cliques, and the tests are the maximal cliques. 

In studying a test space (G, L), we apply our previous work (see Theo- 
rem 12.20) on the ortholattice OL(G) of orthoclosed (that is, regular interior) 
sets of G. The closed sets we are concerned with now are those which are 
closures of cliques. These in general form a proper subset of OL(G). For 
example, in the graph co-C’s, any non-orthogonal pair of vertices is closed, but 
is not the closure of a clique. In fact, we shall need to consider orthoposets that 
are not necessarily lattices. An orthoposet is a poset P satisfying conditions 
(2)-(4) of Theorem 12.20; that is, P has the greatest element 1 and least ele- 
ment 0, and is equipped with an order-reversing, involutory unary operation * 
such that, for any element z, the join x V x* and the meet x ^ x* exist and are 
equal to 1,0 respectively. A desirable property of our test space G is that the 
poset EC (“event-closures”) of closures of cliques is an orthoposet, and indeed 
a suborthoposet of R. (Thus co-C does not give a well-behaved test space; on 
the other hand, co-C¢ is satisfactory on this count.) Recall now the idea that 
perspective events may be considered logically equivalent. An obvious prob- 
lem with this is that we have not shown that perspectivity is an equivalence 
relation. This property will need to hold if we are to have a satisfactory “logic” 
of events, and a good way to ensure it is to ask that two events be perspective if 
and only if they have the same closure. If this holds, we will be able to identify 
propositions with event-closures, rather than having to deal with equivalence- 
classes of events. One might guess (correctly, as we shall see) that this feature 
is closely related to whether the test space is algebraic. 

We come now to an extremely important notion: orthomodularity (of a poset, 
particularly of a lattice). This can be formulated in many equivalent ways, and 
we shall use a formulation which is not usually taken as the definition, but is 
convenient for our purposes. Note first that we say that two elements x, y of 
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an orthoposet (P, <,* ) are orthogonal if x < y*. (This relation is, of course, 
symmetric.) 


DEFINITION 12.27 Let (P,<,0,1,*) be an orthoposet. We say that P is 
orthomodular provided that, whenever x, y are orthogonal, we have: 


yN SA SS aS ae 

Given the element x of the orthoposet P we may refer, somewhat ambiguously, 
to an element y orthogonal to x whose join with x is (defined and) equal to 
1 as “an orthocomplement” of x. Then orthomodularity is the condition that 
orthocomplements are unique. 

Lastly (in preparation for our main result) we define a property of graphs 
which may appear somewhat technical, but is in fact of considerable geometric 
and logical significance: 


DEFINITION 12.28 An orthogonality graph (G, L) is said to be Dacey if, for 
any maximal clique C and vertices x, y such that C C x+ Uyt, we have x L y). 


For a detailed study of Dacey graphs, see Sumner, 1974. The Dacey graphs en- 
countered in applications, other than in quantum logic, are usually hereditarily 
Dacey (meaning that every induced subgraph is Dacey). For a discussion, with 
several characterizations, of hereditarily Dacey graphs, see 
Brandstädt et al., 1999. 

We can now state: 


THEOREM 12.29 Let (G, L) be an orthogonality graph, R its lattice of or- 
thoclosed subsets, and P(= EC) the poset of event-closures (i.e. closures of 
cliques) in G. Then the following are equivalent: 


1 Gis Dacey. 


2 Let A, B be cliques such that AocB. If K = J(A) (ie. A++), then 
K+ = J(B). 


3 P is a sub-orthoposet of R, and each element K of P is the closure of 
every clique which is maximal in K. 


4 P is an orthomodular sub-orthoposet of R. 
5 Two cliques are perspective if and only if they have the same closure. 


6 As a test space, G is algebraic. 


Proof (1) = (2). Under the stated conditions on A, B, K, we evidently have 
J(B) C K+. Suppose that y € K+ (so that y | A). Let x be an arbitrary 
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vertex orthogonal to B. By the Dacey condition for G, we have x L y. This 
shows that y € J(B). 

(2) = (3). Assume (2). Suppose that K = J(A), A a clique. Extend A to 
a maximal clique A U B, so that AocB. Then by (2), K+ = J(B). (Thus P is 
a sub-orthoposet.) Now let A’ be any clique maximal in K. It is easily checked 
that A’ocB. Since K+ = J(B), it follows by (2) that K = K++ = J(A’). 

(3) = (4). Assume (3). We have to show that P is orthomodular. Take 
K € P, where K = J(A), A a clique. An element L of P orthogonal to K 
has the form J(B), where B L A. Moreover, if K V L = G, we must have 
BocA (since, for any x orthogonal to A V B, we would have x ¢ J(A U B)). 
Thus if L is an orthocomplement of K, we indeed have L = K+. 

(4) = (5). Notice that two cliques having the same closure are necessarily 
perspective (without the need to assume (4)). For if J(A) = J(B), and AocC, 
then we know that B L C and also that the clique B U C is maximal (since a 
vertex x orthogonal to B U C would be orthogonal to J(B) U C, and therefore 
to AUC). 

Now assume (4). Suppose that we have (perspective) cliques A, B, with 
AocC and BocC. Consider J(C). Since P is a sub-orthoposet, the orthocom- 
plement (in OL(G) ) K of J(C) is an element of P. Now J(A) is orthogonal 
to J(C) and J(A) V J(C) = G; so by orthomodularity, J( A) = K. Likewise, 
J(B)= K. 

(5) = (6). Assume that, whenever two cliques are perspective, they have 
the same closure. Let A, B be cliques with AocC and BocC. If also AocD, 
then J(B)(= J(A)) L D. As before, the clique B U D is maximal; thus 
BocD. 

(6) = (1). Suppose that (as a test space) G is algebraic. Let A, B be 
cliques, with A U B maximal, and x, y vertices such that x | Bandy L A. 
Choose a clique B’ such that Aoc{y} U B’, and likewise a clique A’ such that 
A'U {x}ocB. Thus we have AocB, and also Aoc{y} U B’. But BocA’ U {zx}, 
and so by algebraicity we get A’ U {x}oc{y} U B’. In particular, x L y. QED 


A “quantum logic” is sometimes defined to be an orthomodular poset 
(Kalmbach, 1983; Pták and Pulmannovaé, 1991). More often, perhaps, the struc- 
ture is required to be an orthomodular lattice. The examples mentioned earlier 
in this subsection all yield orthomodular lattices rather than just posets. Several 
further refinements and generalizations of these structures are currently studied 
(Coecke et al., 2000; Dvuretenskij and Pulmannova, 2000). We do not here 
comment on these developments or their possible experimental justification. 
But we shall find that the orthomodular posets and lattices are of considerable 
significance for our geometric enquiry (Sec. 7). 
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We conclude this subsection with a characterization, due to Foulis and 
Randell, 1971, of “complete orthomodular spaces”, that is, of graphs (G, L) 
such that the lattice of orthoclosed subsets of G is orthomodular: 


PROPOSITION 12.30 G isa complete orthomodular space if and only if every 
closed set K of vertices is the closure of every maximal clique contained in K. 


Proof If the stated condition holds, then R(= OL(G)) coincides with P(= 
EC); hence by Theorem 12.29 R is orthomodular. 

Suppose that R is orthomodular. Let K € R, and let A be a maximal clique 
in K. Then J(A) L K+, and moreover J(A) V K+ = G (since clearly there 
cannot be a vertex orthogonal to both K+ and J(A)). Hence K = J(A). ep 


5.2 Region Connection Calculus 


Connection theory, also known as mereotopolgoy, is an approach to topology 
(and possibly geometry) in which regions rather than points are taken as the basic 
entities. Besides the set of regions, the main primitive is a two-place relation of 
“connection” between regions. A treatment of space(-time) on the basis of these 
primitives was proposed by Whitehead and others, on philosophical grounds, 
in the early XXth century. 

Connection theory has enjoyed a revival in recent years, especially as it has 
become popular as a framework for spatial reasoning in computer science. It 
is, indeed, an active area of research, and quite a large literature has developed. 
Our justification for treating it as a mere “example” is that, in its main lines of 
development, connection theory implies that space is continuous (or infinitely 
divisible). Only a few attempts have been made to adapt the theory so that it 
can have non-trivial discrete models. Following a brief review of the general 
theory, we shall consider one such proposal, due to J. Stell. 

Articles by B. L. Clarke (Clarke, 1981; Clarke, 1985) are credited with 
sparking the contemporary interest in connection theory. With R a non-empty 
set (of non-null “regions”) and C a binary relation (“connection”) we have, 
according to Clarke, the following axioms: 

Al «Cz; 

A2 «Cy => yr; 

A3 C(x) = Cly) > t=y. 

Here, C(x) means {y € R| Cy}. Thus, in our terminology, A3 amounts to 
To-separation. Given a non-empty subset X of R, a region zx is said to be the 
fusion of X provided that, for every region y, yC'x if and only if yC'z for some 
z € X; that is, provided that 


C(z) = (KCl  € X}. 
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Then the next axiom is: 

A4 Every non-empty set of regions possesses a fusion. 

From A3 we see that the fusion of the non-empty set X , say FX, is uniquely 
defined. Moreover (as in Sec. 3 above) the relation < defined by: 


z<y @ C(x) CCl) 


is a partial order. 

One aim of connection theory is to provide a definition of “point” as a suitable 
set of regions. Clarke’s definition has it that a point is a pairwise connected 
set of regions, having some aspects of a prime filter. We shall not go into the 
details of this definition, but in terms of it Clarke states his fifth axiom: 

A5 «Cy = there exists a point P such that {x,y} C P. 


Now Biacino and Gerla (Biacino and Gerla, 1991) have shown that a model 
of Clarke’s axiom system A1—AS is, after adjunction of the null region 0, nothing 
but a complete Boolean algebra (under the ordering <), in which we have: 


join is fusion; 

complement is given by sz = F{z| ~ zCx}; 

connection is overlap: «Cy iff dz.z < x A z < y (written xOy) ; 
points are prime filters. 





Clearly, such a theory is unsuited to our needs. Simple graph models cannot 
be presented as Boolean algebras (without additional structure). By a “simple 
graph model” we mean a model in which the regions are subsets of some 
graph G, such that G (perhaps up to To-equivalence) can be recovered from the 
algebra. Even worse follows when Clarke’s sixth axiom (i.e. Axiom A2.1’ of 
Clarke, 1985) is added: as Biacino and Gerla show, the addition of this axiom 
is equivalent to requiring the Boolean algebra to be atomless, thus ruling out 
any kind of discrete model. 


Despite these unfortunate features, our review of Clarke’s theory has not 
been a waste of time. As we shall see later, the restriction of this theory to the 
Axioms A1-—A4 can be very interesting for us. 

RCC (Region Connection Calculus: Randell et al., 1992; Cohn et al., 1997) 
is inspired by Clarke’s theory, but does away with the second order aspect 
(Axiom A3) and the reference to points (A5). As a purely first order theory of 
regions, it can claim to have advantages both methodological and practical (as 
a basis for automated reasoning). 

We shall not here consider the original RCC axioms, but rather an essentially 
equivalent algebraic formulation due to Stell, 2000: 
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DEFINITION 12.31 Let A bea Booleanalgebra. Take R (“regions”) to be the 
set of non-null elements of A; and take R— (the proper regions) to be R—{T }. 
We say that (A; C), where C is a binary (“connection”) relation on A, is a 
(Boolean) connection algebra provided: 


(B1) C is symmetric, and its restriction to R is reflexive; 

(B2) Every proper region is connected with its complement; 

(B3) For non-null x,y,z: C(a,yV z) = C(a,y) or C(a,z) ; 
(B4) Yx € R- Aye R~ Cy. 

The original RCC lacks the null element 0 (or L). This has had to be added to 
obtain a Boolean algebra, but the axioms B1—B4 are silent as to the connection 
properties of L. Stell has pointed out (private communication) that this can 
be remedied by removing the restriction to non-null regions in Axiom B3. 
Assuming this slight modification, we get that | is not connected with any 


region. In the following remarks we assume that all regions mentioned are 
non-null. An easy deduction from B1 and B3 is that 





wOz > wCz. 


Now if x, y are regions such that =(y < x), then a’ overlaps y in the region 
y — x, and so x’ C y. By contraposition we get: 


(12.1) ayr > y< r. 


Let x be a proper region. By B2 we have xCz’. By B4 there exists a region 
y not connected with x’, and by (12.1) any such y must be strictly less than x 
(that is, y < x and y # x). Repeating the step, we get an infinite decreasing 
sequence of regions starting from zx. 

Clearly, the fact that the lattice of regions is a Boolean algebra has been used 
heavily in this argument for the infinite divisibility of regions. Perhaps, then, a 
way to modify RCC so as to admit discrete models may be found by relaxing 
the assumption that the regions form a Boolean algebra. 

This indeed is the approach of Roy and Stell, 2002. As a first step, observe 
that the RCC axioms not only preclude discrete models, but any models (even if 
infinite) in which there occur atomic regions. But the argument (using B4), that 
a region x cannot be atomic, would fail if it were possible for the complement 
of x to be T. For this possibility to be achieved, we may think about replacing 
the Boolean complement with the dual pseudo-complement: 


DEFINITION 12.32 Let L be alattice with top element T. Ifa € L, an element 
a’ of L is the dual pseudo-complement of a provided that a’ is the least element 
of L whose join with a is T; that is 


Vre L(aVa = T iff a’ <2). 
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L will be called a dual p-algebra if it is distributive, and the dual pseudo- 
complement a’ exists for every a E L. 


EXAMPLE 12.33 . (A) The closed sets of any topological space X form a 
dual p-algebra. (The dual pseudo-complement of the closed set P is Cl(X — P).) 

(B) Let A be the dual p-algebra of closed sets of X, and define connection 
to mean overlap. Then, trivially, axioms BI and B3 are satisfied. Moreover, 
B2 holds if X is connected (in fact, B2 is equivalent to the connectedness of 
X). Finally, B4 is satisfied in certain cases, for example if X is R. 


With an example like R we have a dual p-algebra which satisfies all the 
axioms B1—B4, and in which every region (closed set) is a join of atomic 
regions. If, with Roy and Stell, we want the model as a whole to be finite, B4 is 
problematic. For this reason, Roy and Stell drop B4, arriving at the notion of a 
connection algebra as a dual p-algebra satisfying B1-B3. Their main example 
is that of a cellular complex, viewed as a poset with Alexandroff topology (that 
is, with the lower sets taken as the closed sets, and thus as the regions). 

This approach is not taken very far by Roy and Stell, and it is not clear 
whether such a weak version of RCC is really viable. Moreover, the reduction 
of connection to overlap, as in these closed set models, seems to be contrary to 
the spirit of connection theory. 

Yet the idea of relaxing the Boolean algebra requirement is surely worth 
pursuing. After all, in the (by a long way) most developed theory of “regions”, 
namely frame/locale theory, we have a Heyting algebra rather than a Boolean 
algebra. If we give up the attempt to present our region theory as a modified 
RCC, it is possible to describe a rich algebra of regions, which is non-trivial 
even in the case of finite “spaces” (specifically, graphs). This was shown by 
Stell and Worboys, 1997, in work which we briefly discuss in our third (and 
last) main Example. 


5.3 Co-Heyting algebras 


Heyting algebras are very well known as structures with which to develop 
point-free (“region-based”) topology. Also interesting for geometry, although 
much more rarely discussed, are the dual Heyting (or co-Heyting) algebras. 
Lawvere, 1991 argued for the geometric significance of co-Heyting algebras, 
and the argument has recently been taken up by a number of authors, notably 
Stell and Worboys, 1997 (in connection with region geometry), and Pagliani, 
1998 (in the context of rough sets theory). We begin with the formal definition. 


DEFINITION 12.34 Let L be a bounded distributive lattice. If there is defined 
in L a binary operation (“implication”) —: L x L — L such 
that, for all x,y,z € L, 


asyor?r S&S tAy<z, 
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we say that (L, —) is a Heyting algebra. Dually, if in L we have a binary 
operation (“subtraction”) \, satisfying: 


PES 8S y<rVz, 


then (L,\) is a co-Heyting algebra. If L has both an implication — and a 
subtraction \, (L, —, \) is a bi-Heyting algebra. 


REMARK 12.35 Evidently, L can have at most one implication (resp. sub- 
traction) defined on it. Every Boolean algebra is bi-Heyting, since Boolean 
implication (=x V y) is both an implication and a subtraction in the sense of 
Definition 12.34. Indeed, a bi-Heyting algebra (L, —+, \) isa Boolean algebra 
iff — coincides with \. 


If we think of the elements of L as regions then, informally, y —> z is 
the largest region (if it exists) whose intersection with y is contained in z. 
The standard example is that in which L is taken to be the collection of open 
sets of a topological space. Dually, y\z is the smallest region whose union 
with z contains y. One readily guesses (and verifies) that the closed sets of a 
topological space constitute a co-Heyting algebra. 

We shall here be concerned with certain discrete spatial models, rather than 
general topological spaces. Moreover, for the existing applications we do not 
actually need the full binary operations, but only the corresponding (unary) 
complements. That is, we make use just of the negation — : L — L, defined 
by: 


ar = xz — 0, 


or (in the co-Heyting case) of the supplement ~: L — L: 
~g = ae 


An advantage claimed for the co-Heyting (or bi-Heyting) approach is the 
simple algebraic treatment of boundaries which it permits. Let us see how this 
works in the important graph example (Lawvere, 1991; Reyes and Zolfaghari, 
1996; Stell and Worboys, 1997). In this context, it is usual to consider a graph 
G to be a pair (V, E), where V, E are (disjoint) sets of vertices and edges, 
respectively, and each edge has either one or two vertices (thus multi-edges, as 
well as loops, are allowed). A graph G” = (V’, E’) is a subgraph of G provided 
that E’ C E, V’ C V, and each edge e € E’ has the same vertices in G” as it 
has in G. Let, now, L be the collection of subgraphs of a graph G. Clearly, L is 
a bounded distributive lattice. But complements are not so straightforward. If 
(V’, E’) is a subgraph of G, the “pairwise” complement (V — V’, E — E’) is not 
necessarily a subgraph: an edge e belonging to E’ — E may have a vertex lying 
in V’. This is a symptom of the fact that L is not in general a Boolean algebra. 
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It is, however, a bi-Heyting algebra. (This may be verified directly; it is also an 
immediate consequence of Remark 3 to follow.) The subgraph (V’, Æ’) thus 
has a supplement. This is the smallest subgraph whose union with (V’, E’) is 
G. That is, ~ (V’, E’) is the set E — E’ of edges, together with their end-points. 


s 
of---0----0 


ob 


oc 


od 





Figure 12.4. Supplement, ~ R. Figure 12.5. Boundary. 


This leads to the suggestion that the boundary of a “region” (subgraph) of a 
graph may be defined in the same way as is done in topology. The boundary 
of a closed subset C of a space X is by definition the intersection of C with 
the closure of X — C. In terms of co-Heyting algebra, this is CA ~ C. The 
suggested definition is thus: the boundary of a graph region Ris RA ~ R. 
Thus, in Fig. 12.5, we get the vertices b, c,d (but no edges between distinct 
vertices). 

Remark 1 By this definition, the boundary of a graph region is always “totally 
disconnected” (it contains no edges). 

Remark 2 Reyes and Zolfaghari, 1996, work with digraphs. However it is 
easy to see that the orientation plays no part in the treatment of boundaries (or 
in the bi-Heyting structure generally). Nothing is lost by confining attention 
(as in Stell and Worboys, 1997) to undirected graphs. 

Remark 3 Itis easy to see that the co-Heyting algebra of subgraphs is a special 
case of the closed sets algebra. Given a graph G = (V, E), let X = VUE. 
Then the subgraphs of G (more precisely, the subsets V’ U E’ corresponding 
to subgraphs (V’ U E’)) are the closed sets of a topology on X. Spelling this 
out a little: order the set X by taking x < yif x = y or x is a vertex (end- 
point) of the edge y. Then the closed sets (in effect, the subgraphs) are simply 
the downward-closed subsets of X. Of course, we could equally well take 
these subsets as the open sets of a topology on X, which “explains” why the 
subgraphs of G give us a bi-Heyting algebra, not just a co-Heyting algebra. 
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The notions boundary and dimension are closely related. We expect that, in 
general, a k-dimensional region will have a k — 1-dimensional boundary. From 
this point of view the preceding treatment of graph boundaries is somewhat 
disappointing: boundaries are always (on any reasonable definition of dimen- 
sion) 0-dimensional. In topology, graphs are usually treated as one-dimensional 
“complexes”, so it could be said that this result is only to be expected. But if we 
are hoping that graphs can provide us with reasonably general “discrete spatial 
models”, a different treatment will be needed. This we attempt to provide in 
the next section. 

Stell and Worboys, 1997, provide a second type of example of bi-Heyting 
algebras, involving pairs of sets. Specifically, let L be the collection of pairs 
(S,T) of subsets of a ground set A, such that S C T. L is made into a 
distributive lattice by defining the lattice operations componentwise from the 
corresponding operations over the subsets of A. For operations that are not 
monotonic in every argument more care is needed, due to the constraint which 
has to be respected. But it can be checked that the following formula defines 
subtraction in L: 


(S1, Ti) \ ($2, T2) = (SiN A\T1, (S1NA\ Ti) U ($2N A\ T))). 


The dual formula gives implication. 

One possible interpretation of this scheme has to do with vagueness: A pair 
(S, T) is thought of as a description of a region with indeterminate boundaries, 
where S consists of elements which are certainly in the region, while T has 
elements which may be in the region. This example appears to be of quite 
a different character than the previous one involving graphs. Yet it also is 
reducible to closed sets algebra. To see this (with some details left to the 
reader), observe that pairs of subsets of A may be identified with subsets of 
A x 2 (= {(a,i)| a € A,i € {0,1}}). Next: determine a partial order < on 
A x 2 such that the pairs (S, T) € L are thereby identified with the lower sets 
of (A x 2,<). In this way we get a “closed sets” algebra of the pairs. 


The three extended examples in this section are intended to set the scene for 
our Own positive proposals in the remaining sections. Note in particular the 
aspect of distributivity of the lattices involved. Quantum logic generally has 
non-distributive ortholattices. Studies in spatial reasoning on the other hand 
almost invariably assume distributivity, and often assume that a Boolean algebra 
is given. These distributivity assumptions are, we believe, at the source of 
difficulties which researchers in spatial reasoning have experienced in handling 
such notions as boundary and complement (especially when “discrete” models 
are intended). It is in this context that it can be beneficial to look at the work 
that has been done in quantum structures. 
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6. (Boundary and) dimension 


As our next major topic, we consider dimension in discrete spaces. Dimen- 
sion and boundary are closely related ideas. One of the main intuitions about 
dimension is that the dimension of a space should be one greater than that of 
the boundaries of portions of the space. The definition of “boundary” which we 
have seen in the previous section gives the result that all boundaries are totally 
disconnected. On this approach, every space will be assigned a dimension < 1. 

On the other hand, a too-literal adaptation of the topological boundary con- 
cept will (as we shall see) result in an implausibly high dimension for simple 
discrete spaces. Even the careful approach of Evako, 1994, Evako et al., 1996, 
which is the closest to that which we shall adopt here, results in uncomfortably 
“high” dimension for our basic spatial models. 

We begin by reviewing some of the main desiderata for a theory of dimension. 
Just as in topology (Hurewicz and Wallman, 1948), we shall expect: 


1 The dimension function (dim) is a “topological” invariant. 
2 The dimension of J?’ (our discrete version of the n-cube) should be n. 


3 Monotonicity. If X is a subspace (induced subgraph) of Y, dim(X) < 
dim(Y). 


Moreover we continue to adhere, as far as possible, to our Correspondence 
Principle. Specifically, we aim to provide a formulation that works well at least 
for tolerance spaces, and possibly for larger classes of graphs, and is “close 
enough” to standard topology. Still more convincing, no doubt, would be to 
provide an account at the level of closure spaces: a common generalizaton 
of topological and tolerance space definitions. As to whether this is possible, 
however, we shall have to leave for later research to determine. 

Recall the definition of “small inductive dimension”, dim, in topology: 


DEFINITION 12.36 If X is the empty space, dim(X) = —1. A non-empty 
space X has dimension < n provided that dim(x) < n for every x € X, where 
dim(a) < n means that x has arbitrarily small neighbourhoods with boundary 
of dimension < n — 1. 


This is easy to calculate in spaces having Alexandroff topology, as we only have 
to consider the (unique) smallest neighbourhood N (x) of x when computing 
dim(z). 


EXAMPLE 12.37 Let X be a finite poset with the Alexandroff (upper sets) 
topology. Then dim(X) = h, where h is the length of (the longest chain in) X. 
Indeed, suppose that x is a maximal point of X. Then the least neighbourhood 
of x is {x}, and its boundary Cl({x}) — Int({x}) is the set of points strictly 
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below x, which has length h — 1. Moreover, every set of the form Î y, y E€ X 
clearly has a boundary with length < h — 1. An inductive argument yields the 
result, dim(X) = h. 


Suppose that we try to apply this formulation, unmodified, to a tolerance 
space G. Of course, we view G as a closure space, and Cl, Int are interpreted 
accordingly. First, if G is a clique, then it is assigned the dimension 0 (the 
boundary of every neighbourhood is empty). Next, if G consists of some cliques 
joined in a row, as for example G’ in Fig. 12.6, then we get dimension 1 (the 
boundary of each N (x) is either a clique or empty). If the cliques are arranged 
in a “corner”, however, as in G” (Fig. 12.7) we get dimension 2, since some 
of the vertices of G” have neighbourhoods with boundary G’. The purpose of 
this series of little examples is to lead up to the case G = T2, (m > 2). In fact, 
we evidently get dim(G) = 3 in this case (instead of 2 as we intend). 












































fe} fe} fe} O O — 0 
Figure 12.6. G": Figure 12.7. G": 
dim = 1. dim = 2. 


It is (it seems) easy to see why we are getting too high a dimension: as 
against the “boundaries” we encountered in the previous section (which were 
too thin), we now have boundaries which are too thick. A possible solution 
lies to hand: the approach taken in Evako et al., 1996. According to this, we 
ignore the topological definition of “boundary”, and simply decide that the only 
boundary set that we need to consider, for a vertex v € G, is N(v) — {v}. The 
digital n-sphere (Evako, 1994) provides a pleasing example. As presented by 
Evako, this is the complete n-partite graph AK (2,2,...,2). Dually, it may be 
described as the result of deleting the edges between opposite pairs of vertices 
of the cube 77. (These figures are combinatorially dual in the sense in which 
the regular octohedron and cube are combinatorially dual: vertices and faces 
are interchanged.) 

This approach also, however, is not without problems. The dimension of the 
clique K, now evaluates to n — 1. As a result, we still get that dim(I?,) = 3. 
Higher powers of Im give an even worse result: since J? contains 2”-cliques, 
it has dimension at least 2”~!. 

We shall find that a slight modification of the definition of Evako et al., 
1996, will give us all that we require, while leaving undisturbed their main 
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examples (the digital n-spheres, and surfaces generally). A first difficulty that 
we should address concerns the formula N (v) — {v}. This seems to be so 
far removed from the topological boundary formula that it must be forbidden 
by the Correspondence Principle. However, the discrepancy is not necessarily 
as great as it seems. It must be remembered that the closure space definition 
(for the required boundary) need not be syntactically the same as the standard 
topological definition, provided it reduces to the latter when the closure is 
idempotent. The topic is too speculative for us to go into details here, but there 
is evidence that at the closure space level the definition takes (in part) the form 
Cl(Int(V)) — Int(N), where N is the chosen neighbourhood of v. This is in 
fact the topological boundary, for open N, whilst it also corresponds closely to 
the formula N(v) — {v} in the case of a tolerance space. 

Although we shall retain the formula N(v) — {v}, we shall modify the 
condition which relates the dimension at a point to the dimension of the cor- 
responding boundary set. The principle here is that the mere duplication 
of a point should not change the dimension of a space. More precisely, if 
x,x' € X, with x = 2’ in the specialization order of X, then we should have 
dim(X) = dim(X — {x'}). For example, with X as the graph (tolerance 
space) G of Fig. 12.8, 


D 
ie 


Figure 12.8. G. Figure 12.9. G' = G\æ'. 
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we get: 
dim(G) = dim(G") = 1. 


We thus arrive at the following definition. The notation here is that dim(G') 
is the dimension of G, while dimg(z) is the dimension at the vertex x of G. 
Also we denote the punctured neighbourhood N(x) — {x} by No(z). 


DEFINITION 12.38 IfGis empty, dim(G) = —1. IfG is non-empty, dim(G) = 
sup{dimg(x)|x € V(G)}. For x € V(G), we put: 





dim(No(z)), if 3y € No(x), N(x) C N(y) 
dim(No(x)) +1, otherwise. 


dimg(x) = 


REMARK 12.39 The first alternative in the expression for dimg(x) in the 
preceding definition is usually expressed by saying that x is a dominated vertex. 
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It means that, in the subspace N(x), there is a vertex (€ No(x)) y such that 
T=N(a) Y. 


REMARK 12.40 We shall henceforth assume that our tolerance spaces are 
finite dimensional (although this restriction can be avoided fairly easily if 
desired). 


In particular, if G is a clique, then all vertices of G are equivalent, and it 
follows that dim(G) = 0. The dimension of cliques is particularly significant. 
It is easy to see, indeed, that under the definition of Evako et al., 1996, the 
dimension of G is the dimension of its largest clique, that is, c(G) — 1. Working 
instead with Definition 12.38, do we get any simple relation with the size of 
cliques? Let us consider those non-empty cliques of G which are intersections 
of (one or more) maximal cliques. As we shall see in a moment, there are 
grounds for considering these intersection cliques as the cells (or faces) of G. 
We shall denote the poset of these cells, ordered by inclusion, as cell(G). Recall 
that the level of an element x of a poset P (of bounded chain-length) is the length 
of a longest chain below zx, and that the length of P is the level of an element 
of greatest level, that is, the length of a longest chain in P. We take the length 
of the empty poset to be —1. 


THEOREM 12.41 For any non-empty tolerance space G, 


dim(G) = length(cell(G)). 


Proof We prove by induction on |V (G)| that, at each point x of G, dimg(z) 
is the length of the poset cell,(G) of those cells which contain x. This is 
obviously true if |V(G)| = 1. Suppose that v € V(G), with No(v) non-empty. 
There are two cases. In the first case, v is dominated by v’ € No(v). If C 
is any clique contained in No(v), then C U {v’} is also a clique contained in 
No(v). Thus, given any chain K of cells in No(v), we have a chain of cells 
containing v’, of the same length as K, in No(v). Since v = v’, we have 
cell, (N(v)) = cell,(No(v))). By the induction hypothesis we can conclude 
that 
dimg(v) = dim(No(v)) = length(cell, (No(v))) 


= length(cell,(N(v))) = length(cell,(G)) 


(since a clique containing v is necessarily a subset of N (v)). 

In the second case we have, for each v’ € No(v), a v” € No(v) such that 
a(v"” ~ v’). Any maximal clique containing v, v” thus excludes v’. It follows 
that {v} can be expressed as an intersection of maximal cliques; that is, {v} is 
a cell. Let (invoking the induction hypothesis) u be a vertex in No(v) such that 
dim(No(v)) = length(cell,(No(v))). Further, let K be a maximal chain in 
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cell, (No(v)). By extending each cell C € K to CU{v}, and taking also the cell 
{v}, we obtain a (maximal) chain K’ in cell, (N (v)), of length length( K) +1. 
We conclude that, in this case, 


dimg(v) = dim(No(v)) + 1 = length(cell, (N (v))) = length (cell, (G)). 


QED 


We have already remarked that in many important cases our definition gives 
the same result as that of Evako et al., 1996. In particular this is the case for 
the closed surfaces as defined in that work: 


DEFINITION 12.42 A 0-surface is a disconnected graph (tolerance space) 
with exactly two vertices. For n > 0, an n-surface is a non-empty connected 
graph G such that, for each vertex v of G, No(v) is an (n — 1)-surface. 


In terms of topology, the surfaces defined here are closed surfaces, as opposed 
to surfaces with boundary. It is an easy exercise to show that every n-surface 
has dimension n at each of its points (vertices). 

The “simplest” n-dimensional surface is the digital m-sphere 
(= n-crosspolytope as in Sec. 9 below; cf. also the “spin manual’, Sec. 5.1): the 
complete n + 1-partite graph K(2,2,...,2). We denote the digital n-sphere 
by DS”. One way to make precise the claim about the simplicity of the sphere 
is given by the following; 


PROPOSITION 12.43 Let S be any n-surface. Then there exists a surjective 
morphism from S onto DS”. 


Proof By induction on n. If n = 0, we have S = DS”. Suppose then 
that n = k > 0, and assume for the induction that every (k — 1)-surface can 
be mapped onto D.S*—!. Given the k-surface S, let p,q be arbitrarily chosen 
points of S, D.S*, respectively. The (punctured) neighbourhood No(p) of p in 
S is a (k — 1)-surface , and thus we have a surjection ep—1 : No(p) > DS*-1. 
we identify DS*—! with the neighbourhood of q in DSF. Denote by —q the 
remaining point of DS" (antipodal to q). Then the required surjection ex : S > 
DSF is defined by: 


q, ifx =p 
ek(z) = $ e~_i(x) if x E€ No(p) 
—q otherwise. 


This is a morphism, as p is not adjacent to any element of S\\ N(p). QED 
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In the next few Propositions we indicate some initial steps in the discrete 
theory of surfaces and dimension, focussing on decomposition results. 


PROPOSITION 12.44 Suppose that G = A x B is a surface. Then each 
summand A, B is either empty or a surface. 


Proof By induction on |G]. The assertion is immediate for |G| < 2. Suppose 
then that G” = Ax B isa surface, with |G’| = n > 2, and assume (for 
the induction) that the assertion holds for all G with |G| < n. If one of the 
summands, say A, is empty, then there is nothing to prove, since in that case B 
is isomorphic with G’, and is thus a surface. So assume that both summands 
are non-empty, and choose a vertex from either one of them, say a € A. Then 
the (punctured) neighbourhood of a in G” is No(a) x B, where No(a) is the 
neighbourhood of a in A. Since G” is a surface and |G| > 2, No(a) * Bisa 
surface. Hence (by the induction hypothesis) B is a surface. QED 


COROLLARY 12.45 . No cone (i.e. graph of the form 1 x G) is a surface. 


PROPOSITION 12.46 No surface is dismantlable. 


Proof In fact no tolerance space with a dominated vertex can be a surface. 
For suppose that v is a dominated vertex of the space G; say v is dominated by 
u € No(v). Then u is connected to every vertex of No(v), so that No(v) can 
be written as u * (No(v) \ u). By the preceding Corollary, No(v), and hence 
G, is not a surface. QED 


The converse of Proposition 12.44 also holds. In fact we have the following 
(= Theorem 5 of Evako et al., 1996): 


PROPOSITION 12.47 IfG is ann-surface and H is anm-surface, then G» H 
is an (n + m + 1)-surface. 


As already mentioned, an important case in which we differ from Evako 
et al., 1996 is the “digital n-cube” I”. It follows from the Product Theorem, 
to be given in a moment, that this receives the “correct” dimension. The same 
case serves to explain our terminology of “cells”: by inspection we see that, at 
least in the interior of the digital cube, the cells are indeed the small cubes and 
their faces. Polytopes provide a further source of illuminating examples. In 
fact, given a polytope II, we derive a graph G(II) by making each facet of II 
into a maximal clique (that is, for each facet F’, an edge is created for each pair 
of vertices of F). The graph G(II) and its cells thus constitute a combinatorial 
version of IT. 
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THEOREM 12.48 (PRODUCT THEOREM) For any non-empty tolerance 
spaces G, H, 
dim(G x H) = dim(G) + dim(H). 


Proof We note first that a subspace S C G x H is a maximal clique iff S 
is of the form A x B, where A, B are maximal cliques in G, H respectively. 
Then, F is a face (of G x H) contained in such S iff F is A’ x B’, where 
A’, B' are faces of G, H contained in A, B respectively. Finally, note that if K 
is a maximal chain of cells in G x H, with greatest element S, each link of K 
(connecting a k + 1-cellin K to a k-cell contained in it) corresponds to a link 
in either A or B. In particular this applies if K is a chain of maximal length 
(below S); thus 
level( S) = level(A) + level( B). 


Applying this to an S' of greatest level, we have the result. QED 


Note. In taking account of Evako’s approach, we have worked mainly with 
the systematic presentation in Evako et al., 1996. There are however indications 
in Evako, 1994, especially as seen in the Examples there, that, in the case of 
tolerance spaces, something closer to our Definition 12.38 was intended by 
Evako. 

In the definition of closed surfaces (Definition 1.47), comparison with clas- 
sical manifolds would suggest that, for each vertex v, No(v) should be required 
to be an (n — 1)-sphere, rather than just an (n — 1)-surface. Indeed, some- 
thing of this sort is generally done, both by Evako and by other authors. An 
accurate treatment of this theme would, however, require that we carefully 
consider what is to count as “a” digital n-sphere, rather than “the” (minimal) 
digital n-sphere discussed above. This in turn involves the question of the 
(homotopy-)equivalence of digital spaces: an extremely important topic, but 
one that is too complex to enter into here. 


T Discrete region geometry 
7.1 Abstract convexity 


To help motivate the approach which we take here to region geometry (and 
to convexity in particular), we begin this section with a brief discussion of the 
abstract convexity spaces which were mentioned in Sec. 4. In this context, 
an algebraic closure system will be renamed a convexity structure, the closed 
sets now being called convex. Itis often convenient to impose the normalizing 
conditions that the empty set and all singletons are convex; let us assume that 
at least the first of these conditions is imposed. As regards notation, we shall 
for the moment use C'onv for the closure (i.e. convex hull) operation. (Later, 
a square bracket notation will be used.) 
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On this very slender basis one can define and study a range of notions im- 
portant in convexity and polytope theory. Take an open interval to be any set 
of the form Conv{a, b} \ {a,b}. Notation: (a,b). 


DEFINITION 12.49 Let C be a convex set of the structure (X, Conv). Then 
an interior point of C is any point p € C such that 





Va EC\payeC.p E (x,y). 
Further, a face of C is any convex subset F of C such that; 
V2,yEC.(2,y) NF 40 > x,y EF. 


Recall that in real analysis, the relative interior of a convex set C C R”, 
relint(C), is defined as follows. First construct the affine hull < C > of C in 
R”. Endow < C > with the subspace topology derived from R”. Then take 
relint(C’) to be the topological interior of C in < C >. It happens that the 
abstract convexity approach enables one to define the relative interior, and other 
useful notions, with far less conceptual apparatus than is needed classically. In 
particular we have the following proposition, whose proof is left to the reader 
as an exercise: 


PROPOSITION 12.50 Let C C R” be convex. Then p € relint(C) if and 
only if p is an interior point of C in the sense of Definition 12.49. Also, if P is 
a polytope in R”, Definition 12.49 identifies the faces of P in the usual sense. 


It is immediate from the definition that any intersection of faces of a convex 
set (in an arbitrary convexity structure) is again a face of C’; thus we have a 
complete lattice of faces. A special case of this is the familiar face lattice of a 
polytope (in R”). 

Any face of C other than C itself is called a proper face. Concerning proper 
faces and the set Z(C) of interior points of C, we have the following very easy 
result: 


PROPOSITION 12.51 Every proper face of the convex set C is disjoint from 
T(C). 


Proof Let p be any element of the proper face F. Choose a point x € C \ F. 
Then there does not exist any point y € C with p € (x, y) (for if such y exists, 
then by definition of a face, x,y € F). Thus p ¢ Z(C). QED 


One might hope for a stronger result here, namely that every point of C belongs 
either to Z(C) or to a proper face (but not both). This, however, does not hold 
in an arbitrary convexity structure. To ensure that results such as this hold, one 
would need to impose additional axioms on the convexity structure involved; 
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see in particular (Coppel, 1998; van de Vel, 1993). We do not follow this path 
here, as the additional axioms include density properties which conflict with 
our desire for discrete models. 

Given a convexity structure (X, Conv) and a subset Y C X, we have the 
evident substructure convexity on Y, where the convex sets in Y are the sets 
C NY, where C is convex in X. This gives us a plentiful supply of discrete 
convex structures. 


EXAMPLE 12.52 Leta, b, cbe the vertices ofa triangle inR?. Let p,q,r be the 
midpoints of the sides bc,..., and and o the centroid, that is, the intersection 
of ap,bq,cr. Take the set T as {a,b,c,p,q,7r,o}, and let (T,Conv) be the 
convexity inherited from R?. We readily check that the sole interior point of T 
is o, and that the faces of T are as expected. However, if the point p is deleted, 
giving (T \ p, Conv), the point o is no longer an interior point (nor does it 
belong to any proper face). 


For a more extreme illustration of the difficulties arising from not having 
“enough” points in a discrete model, consider the following: 


EXAMPLE 12.53 Let P be a polytope in some Euclidean space R”, and let 
T be the set of vertices of P. Consider the substructure (T, Conv), with the 
inherited convexity. According to Definition 12.49, we have to regard every 
subset of T as a face of the polytope T. 


A face of a convex set is by definition a convex subset F such that no interval 
can “cross” F. Taking the polytope P of Example 12.53 to be a rectangle abcd, 
ac fails to be a face as it is crossed by the other diagonal bd. The question arises 
as to whether, in going to the discrete substructure in which we retain only the 
four vertices, there is some sense in which we can assert that the diagonals cross, 
even though they no longer have a point in common. (Intuitively, it appears 
to be so.) Our solution will in effect be to treat the notion, that two figures 
(especially polytopes) meet in their interiors, as a primitive—indeed, the main 
primitive - of our geometric theory. In terms of our work in previous sections, 
this is a connection relation which gives rise to various structures involving 
closure spaces and systems, as discussed above. 


7.2 Cell Geometry Axioms 


For our version of region geometry, then, we suppose that regions are built 
from “basic regions” thought of as cells (= polytopes). Moreover a connection 
relation, denoted =, is assumed to be defined over the cells, and thence over 
the regions. Formally, we take as a starting point the following definition: 
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DEFINITION 12.54 A cell geometry is a pair (Pin(X): ~~), where © is a 
reflexive, symmetric relation defined on the non-empty finite subsets of a ground- 
set X, satisfying: 

(Add) A x B, A = B => AUA x BUP. 
Instances of the relation ~ will sometimes be called quasi-equations. 


Remark. We require only additivity, rather than monotonicity (A ~ B 

C B' = A ~ B'),as we want to allow for the interpretation that A ~ B 
means that the cells A, B meet in their relative interiors. For example, if B 
is (interpreted as) a closed interval [xy] in R, with x < y, then we have 
{x} = {x} while {x} æ% [xy], so that monotonicity fails. 

Given a cell geometry (X, ~) we define a region to be an (ortho)closed set 
of cells. The cell geometry is here being viewed as a tolerance space. A simple 
example may illustrate the idea. Let X be some subset of intersection points 
(vertices) of a rectangular grid in R?, such as the set of nine vertices named in 
Fig. 12.10. In this case we have 2° — 1 “cells”, according to Definition 12.54. 
We adopt our standard interpretation of ~, namely that the connection of two 
cells means the overlapping of their interiors. (N.B. In this context, “interior” 
is always taken in the sense of Definition 12.49, or equivalently the relative 
interior of conventional convex analysis.) In denoting cells, we shall often 
omit set brackets and commas. Thus we have, for example, ag ~ bfh, and 
c & bdh. 

Let S be the set of “cells” 


ab, abcd, b, bc, efh, e, bi; 


see Fig. 12.10. 





Figure 12.10. S: ab; abcd; b; bc; efh; e; bi. 


In terms of the Euclidean model, S determines the region J S, and any cell C 
can be considered as a part of this region if it is covered (with respect to interiors) 
by US. In terms of our formal cell geometry, however, we have to calculate 
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the orthoclosure J(.9) (in the notation of Sec. 4). Recall that J( S) = S++, and 
that we also have 


Ae J(S) & N(A) C N(S). 


Thus the region J(S) determined by S is: 
S U {ac, bd, abc, bed, cda, dab, ai}. 


Notice, e.g., that the covering of the cell ai by {ab, b, bi}, and hence by S, is 
captured by the fact that an arbitrary cell C is connected with az only if it is 
connected with at least one of ab, b, bt. Also, although a necessary condition 
for A € J(S) is that A ~ B for some B € S, this condition is not sufficient. 
For example, eg ¢ J( S) since (although eg ~ efh) we have fgh ~ eg but not 
fghsefh. 


We define the sum of regions (= join as in Prenowitz’ system of geometry 
(Prenowitz and Jantosciak, 1979); cf. also Webster, 1995, for a concise account 
of Prenowitz’ join geometry) by: 


R+S = J{AUB|AER,BES}). 


This is, in effect, the region which lies between (any parts of) R and S. This 
operation provides us, in particular, with a simple definition of convexity: 


DEFINITION 12.55 The region R is convex if R+ RC R. 


Rather than pursue the theory of convex regions, which involves some sub- 
tleties, we shall in a moment consider a notion of convex point-set, which is a 
little easier to develop (and to compare with existing work). 


Note on terminology. We prefer not to use Prenowitz’ term “join” for the 
operation +, since we do also need the join operation in the lattice of regions 
(and this operation is emphatically not the same as +). On the other hand 
there is a case for calling it “product”, especially in view of the quantale aspect 
of our structures (see the remarks at the end of Sec. 7.3). We finally opt for 
“sum”, however, for greater compatibility with the standard notation of oriented 
matroids, which will be important later on. If one is thinking of A, B as subsets 
of a Euclidean space, or more generally of a vector space over an ordered field, 
A+ B resembles an average of the two sets rather than their vector sum. Better 
still, it may be thought of as the set of weighted sums of elements of A, B, 
respectively, where the weights are positive scalars with arithmetic sum 1. 

We now introduce the first of two axioms concerning points. (Incidentally, 
it is because of the need for these axioms, or something close to them, that it 
is difficult to envisage an entirely “point-free” geometry.) Geometrically, the 
effect of this axiom is to ensure that connection of cells means connection of 
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their relative interiors. Note that we often present finite sets via lists; thus, 
“A, p” in the following denotes A U {p}. Moreover we have the convention 
that letters A, B,... usually denote finite sets of points, while p, q,... denote 
singletons. 


G1 If A,p = B,p then AX Bor A,p= Bor Ax B,p. 


Formally, the axiom asserts that an instance of the connection relation, in 
which a point p occurs on both sides, can hold only because a “simpler” instance, 
in which p occurs on at most one side, holds. A diagram illustrating the three 
cases (in the conclusion of the Axiom) follows (Figs. 12.11-12.13). 
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Figure 12.11. A® B. Figure 12.12. A,p ~ B. 
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Figure 12.13. Ax B,p. 


The special case where B (say) is empty is allowed: 
(12.2) Ajp x p> Ap. 


A second axiom concerning points says, in its simplest form, that if a point 
is common to two regions, then the regions are connected (overlap = connect- 
edness). The full form of the axiom provides the general context in which the 
occurrence of a point may be eliminated: 


G2 p, A7 B; p, A x B' > BLA A,B’. 


The diagram (Fig. 12.14) should make it plain that this is a form of Pasch’s 
Axiom, some form of which (as was discovered towards the end of the XIXth 
century) is essential for ordered geometry. 

The preceding axioms on points are useful in establishing a basic property 
of convexity, as we shall now see. 
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Figure 12.14. Ax. G2 (Pasch). 


DEFINITION 12.56 A set X of points is convex if, for any point x and cell A: 
era ACX > rex. 


The collection C of convex sets is obviously closed under intersection. More- 
over the closure system involved is evidently algebraic (finitary), and the empty 
setis convex; so we do indeed have a convexity structure, in the sense of abstract 
convexity theory. The convex hull operator shall (henceforth) be denoted [e]. 





PROPOSITION 12.57 For any set X C G, |X] = {z|SA C X. xz = A} 





Proof Denote by Y the set {x|3A C X. x = A}. Clearly we have X C 
Y C [X]. It remains to show that Y is convex. Suppose that p + 5152... Sk, 
where each s; is connected with a subset of Y. Then by Axiom G2 we can 
eliminate sı obtaining: p © t,...tjs9...s,. It is clear that we can in this 
way successively eliminate s;,...,5,, obtaining a quasi-equation of the form 
pti... tn CY. QED 


We are now in a position to revisit the notions considered in the previous subsec- 
tion: the faces and interior of a convex set. In the context of discrete cell-based 
geometry, it is natural to replace the intervals which feature in Definition 12.49 
by cells. We thus arrive at: 


DEFINITION 12.58 Let (X,*) be a cell geometry, with C C X convex. Then 
an interior point of C is any point p € C such that 


Yre CJ3JACC.p x A,x. 





Further, a face of C is any convex subset F of C such that; 
VACC ARF > ACF. 
In order to proceed, we shall temporarily make use of the following : 


PROPOSITION 12.59 Ifp ~ A(C X)andq ~ B(C A), then p x A,q. 
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This trivial-looking assertion is somewhat troublesome to prove directly from 
our axioms. Later it will turn out that it is an immediate consequence of an 
alternate axiomatization to be considered in Sec. 9 (vector axioms for oriented 
matroids). Hence we omit the proof for now. We have: 


THEOREM 12.60 Let (PFin(X) x) be a cell geometry satisfying Axioms 
G1,G2. Let C be any convex subset of X, p a point of C, and F a subset 
of C. The following are equivalent: 


1 F is the least face of C which contains p. 
2 F is a face of C and p is an interior point of F. 
3 F = U{ACC|pe A}. 


Proof Abbreviate |J{A C C| p ~ A} by E. Let B be any finite subset of 
E. Choose B’ such that p ~ B’ and B C B’. Then if a is any point such that 
a ~ B, we have by the preceding proposition: 


paa B. 


Thus a € E, showing that E is convex. 
Next, suppose that q € E, and that B C C is such that q ~ B. For some 
finite A’ we have p ~ q, A’, and so by Axiom G2: 


pe AA’. 


Thus A’ C E, showing that E is a face. Also, it is immediate from the definition 
of a face that E is a subset of any face that contains p. We conclude that E is 
the least face containing p, that is, (1) (3). 

Finally, that p is an interior point of the face F' is equivalent to the assertion 
that every A such that p ~ A is a subset of F. Hence (2) & (3). QED 


COROLLARY 12.61 Every point p of a convex set C is either an interior point 
of C or else belongs to a proper face of C (but not both). 


7.3 Region Geometries and Oriented Matroids 


In the remainder of the section, we examine some connections between our 
version of region geometry and two well-established areas of research. The first 
area we consider is that of oriented matroids. It turns out that the connection 
between cell geometries (developed in a certain way) and oriented matroids is 
very close indeed: so much so that detailed work in cell geometry is to some 
extent redundant, as covered by matroid theory. 
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To arrive at the suggested development of cell geometry (thereby bringing 
matroids into the picture), we may consider the algebraic structure induced 
by our “sum” of regions. So far we have not even shown that this sum is 
associative (it is evidently commutative). This will shortly be remedied. Let 
us for the moment consider the possible introduction of a neutral element and 
(quasi-)inverses into our putative semigroup of regions. It is easy enough to 
propose to extend the set of points with a special “zero point” e, with the 
stipulation: 

Aexr Bo ABB. 


The only problem with this is that we have not, so far, allowed our quasi- 
equations to have one side empty, so that we are left without a meaning for: 


(12.3) en A. 


We can find models in which such formulas have a meaning by looking at 
spherical geometry. “Ordinary” points lie on the surface of a sphere S, while e 
may be thought of as the centre of S. Then (12.3) has, as usual, the meaning that 
e lies in the relative interior of A. Clearly, this includes the case that A consists 
of two antipodal points. This brings us to the interpretation of “inverses” (or 
involutes): we consider —p to be the point antipodal to p. The rule for the 
inverse is that any point may be moved to the other side of a quasi-equation, 
with change of “sign”: 


pAxB> A&B,=—p. 


Of course, we take —(—p) = p, and —e = e. It is easy to justify this rule by 
vector calculations, if desired (see below, Sec. 9). 

The use of “signed” geometry often permits topics to be treated in a simpler 
and more uniform manner than is possible in “flat” geometry. In part this is a 
result of the following: 


LEMMA 12.62 Let S be a set of cells, and suppose that A € J(S). Then, for 
any cells U,V, if U ~ AUV then 3W € SU x W, V. 





Proof IfU ~ AUV, then U,—V ~ A; since A € J(S), U,—V ~ W for 
some W € S. This gives the result. QED 


Illustrating the use of this, we have: 


PROPOSITION 12.63 Sum of regions is associative. 


Proof Let R,S,T be regions, and U a cell. By definition, U € (R+ S) +T 
iff every cell W connected with U is connected with V U C for some V € 
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(R+ S),C € T. By the preceding Lemma, this condition amounts to the 
assertion that every such W is connected with some cell of the form AU BUC, 
where A € R, B € S,C €T. The analysis of “U € R + (S + TY” is similar. 


QED 





This pattern of argument could be replicated in flat geometry only if we were, 
in effect, to build the above Lemma into the definition of “region”. That is, we 
would have to define a region to be a set S of cells such that, if A is a cell such 
that, for any cells U, V 


AU x V > W,U &V for some W ES, 


then A € S. Spherical geometry, on the other hand, lets us retain the simple 
view that a region is an orthoclosed subset of the connection graph. 

We now come to a very significant feature of the involutory (or “spherical”) 
calculus. The freedom we have, via the involution, to move terms to one side of 
a quasi-equation means that, analogously with classical sequent logic, we have 
the option of formulating a geometry as a calculus of sets of points, rather than 
of pairs of sets. The sets are called “surrounding sets”, as they may be thought 
of as surrounding the centre e of the sphere. This will be spelled out in detail 
in the second part of the chapter. We write —A for {—a| a € A}. A quasi- 
equation A ~ B then becomes the statement that A, — B is a surrounding set. 
Reflexivity of ~ thus states that A, — A is always a surrounding set. Symmetry 
of ~ says that whenever A U — B is surrounding, then so is B U — A (which is 
—(AU-—B)). Put more simply: if C is surrounding, then so is —C'. These are 
precisely Axioms S1,S2 for surrounding sets (Sec. 9). The additivity condition 
of Definition 12.54 translates into the statement that the union of surrounding 
sets is surrounding: that is, Axiom S3. Next, the cell geometry axiom G2 says 
that if p, A, —B and p, A’, —B’ are surrounding, then so is B, A’, —A, —B’. 
If we write S,T for p, A,—B, p, A’, —B’, respectively, and apply negation 
(inverse) to the result set, this becomes: 


S,T surrounding = (S \ p) U (-T \ —p) surrounding. 


That is, we have Axiom S4. We leave to the reader the task of translating cell 
geometry axiom G1, and checking that it corresponds to S5 of Sec. 9. 


Remarks. (1) As an alternative to the “spherical” interpretation of the invo- 
lutory geometry, we may choose to consider this calculus as a purely formal, 
conservative extension of the “flat” calculus. 

(2) Itis not difficult to show that the lattice of regions, taken together with 
the sum (now as “product’”), gives us a quantale. A similar observation was 
made in Smyth, 2000. The setting we have now is somewhat different from 
that of Smyth, 2000, but it is significant that we still get a quantale of regions. 

These remarks will be elaborated on another occasion. 
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7.4 The Ortholattice of Regions 


Before concluding this brief account of region theory, it will be instructive 
to look at what regions amount to, concretely, in the Euclidean case. Can we 
identify regions with some particularly simple arrangements of cells in, say, 
the case of a “cell geometry” generated by a finite set of points in R?? In case 
of possible confusion, it should be emphasized that a cell geometry is by no 
means the same thing as a cell complex, in R? or otherwise. In a cell complex 
the cells, as well as covering the space, must be pairwise unconnected (and also 
have their faces matched properly). Let us speak of a cell partition in the case 
that the cells form a pairwise orthogonal cover (regardless of whether the faces 
match up); see Fig. 12.15. 














Figure 12.15. A cell partition. 


One use of cell partitions is to provide us with a useful substitute for the notion 
that connected cells (or regions) should have points incommon. Namely, we can 
require that, for any given cell partition II, connected cells should be connected 
with a common cell of II. Let us call this the “cell partition principle”. This 
obviously holds in the Euclidean case. How about cell geometries in general? 
Notice that, in a geometry (X, œ), cell partitions can be characterized very 
simply: they are just maximal cliques in the orthogonality graph of X. (In 
particular, the sum plays no part.) Then we have the following characterization 
in terms of Dacey graphs (Definition 12.28): 


PROPOSITION 12.64 A geometry © satisfies the cell partition principle if and 
only if the orthogonality graph of & is Dacey. 


Proof Suppose that II is a cell partition, that is, a maximal clique in (£, L). 
Assume that A ~ B. Then A, B fail to be connected with a common cell of II 
iff the sets A+, B+ form a partition of II - which cannot happen if the Dacey 
condition holds. The partition principle is in effect the contrapositive of the 
Dacey condition. QED 


Having seen the geometrical significance of the Dacey property, it is natural 
to ask about the stronger condition that we studied in Sec. 4.2, namely that 
OL(G) (in the present context, the lattice of regions) is orthomodular. Suppose 
that we have a finite (or at worst, locally finite) set N of points in Euclidean 
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space, which for definiteness we take as R? (nothing essential depends on this 
restriction). We may consider the elements of the geometry, in other words 
the vertices of the connection graph (£, ~), to be the open cells with vertices 
belonging to NV. As a point set, on the other hand, our space X is the convex 
hull of N. We seek a concrete description of the regions. Note first that any 
pairwise disjoint collection K of cells determines a region. More precisely, the 
set S of cells covered by K is aregion. For suppose that B is a cell not covered 
by K. Let p be a point such that p € B — |K|, where |K] is the set |J X of 
points belonging to cells of K. Suppose without loss of generality that the set 
of vertices of B is a subset of | A’| (see Fig. 12.16). 





Figure 12.16. C meets the segment B but not the region K. 


Then we can find two faces, say F, F’, of cells in K such that the cells 
pF',pF" do not meet K. Using vertices of F, F”, we obtain a cell (such as C in 
the diagram) which meets B but does not meet K. This shows that B ¢ J(K). 

We observe that any region of the form J(K) (K pairwise disjoint) has as 
orthocomplement a region of the same kind. For we can easily “fill up” the set 
X — |K| with pairwise disjoint cells (in effect extending K to a cell partition 
of X). Indeed it is not difficult to see, via an argument similar to that in the 
preceding paragraph, that we can fill up the set X — |J ( K)| (the union of the set 
of cells that are disjoint from every cell of K) with pairwise disjoint cells, for 
every set K of cells. We conclude that any region is representable as a disjoint 
union of cells. 

A collection of pairwise disjoint cells is, of course, the same as a clique in 
the orthogonality space (£, L). Thus we conclude from the preceding theorem, 
together with Theorem 12.29 and Proposition 12.30 (Sec. 5.1), that the lattice 
of regions is orthomodular. 

The conclusion we have just reached depends on the assumption that we 
have a geometry realized in Euclidean space. Now we have to confess that 
we do not know whether it holds for geometries in general. It is hoped that 
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further investigation will resolve this question. If it does not hold in general, we 
would be inclined to restrict attention to those geometries whose region lattice 
is orthomodular, so important and useful is this property. 

To illustrate our claim about the significance of orthomodularity, we may 
look at triangulations. Notice that if we restrict attention to the regions which 
can be obtained as joins of cells belonging to a clique K of (£, L), we obtain 
a Boolean algebra (isomorphic to P(K)). Under what conditions do we have 
a maximal Boolean algebra of this kind? Clearly, the clique K should be 
maximal. But this is not enough. We must require that the cells of K cannot 
themselves be decomposed into disjoint unions of smaller cells. Every cell 
can be decomposed into simplexes; so the cells must surely be simplexes. 
Moreover these simplexes must be atomic: concretely, a simplex C is atomic 
if no points of X lie in any face of C (other than the vertices of C). Now 
a decomposition of the space into non-overlaping atomic simplexes is what is 
usually called a triangulation. At the same time, a maximal Boolean subalgebra 
of an orthomodular lattice is generally called a block. Thus we conclude that 
central concepts of computational geometry and of orthomodular theory are 
essentially the same: 


triangulation = block. 


Unfortunately we must leave this topic here. 

As we have seen, several authors have drawn attention to the difficulties in- 
volved in the attempt to develop a discrete region theory. To the previously cited 
authors we should add Pratt and Lemon, 1997, where it is argued that, under 
quite minimal conditions, a discrete theory of polygonal regions is impossible. 
We avoid the problems and difficulties indicated by these authors, because we 
abandon distributivity of the lattice of regions. But do we thereby give up too 
much? Can we really work with a non-distributive “logic” of regions? 

Our answer is that, although the overall lattice is non-distributive, there may 
be, as we have seen, ample distributivity in particular contexts. Orthomodu- 
lar lattices are very rich structures. Such a lattice contains, in general, many 
Boolean algebras. (The patching together of orthomodular lattices and posets 
from Boolean algebras is a major theme in orthomodular theory (Kalmbach, 
1983)). Also it should not be forgotten that the regions are organized, not merely 
into a lattice, but into a quantale. As with any quantale, we have thereby the 
distributivity of the lattice join over the product. 


8. Matroids 


The rest of this chapter is a more conventional exposition of finite geometry, 
particularly finite Euclidean geometry, by which here is meant the theory of 
oriented matroids. The treatment of oriented matroids here is unconventional 
in that more attention is given to spherical rather than flat structures. The reason 
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for this, as also discussed in the previous section, is that finite spherical models 
of space are more naturally algebraic than finite flat ones. Sphericity is one 
of the main themes in this chapter, and possible lines of future research in the 
development of finite algebraic models of space will be discussed in the final 
section. 

Oriented matroids are matroids with additional structure, so we begin with 
the basics of matroid theory. Of course only a tiny fraction of the theory can be 
discussed here, and for a far more comprehensive introduction to the subject 
we refer to Oxley, 1992b. 

Matroids admit many axiomatizations and the following are some of the most 
important. Let E be a finite set: 


Independent set axioms A matroid on F is a set of its subsets, called inde- 
pendent sets, satisfying: 


(11) @ is independent; 
(12) all subsets of an independent set are independent; 


(13) if I, J are independent and || < |.J| then there exists some x € J\J such 
that J U x is independent. 


The submatroid on any X C E is the set of all independent subsets of X, 
which clearly is again a matroid. A set is dependent if it is not independent. 
A basis is any maximal independent set, that is, any independent set that is 
not a proper subset of any other independent set. A basis of X is a basis of 
its submatroid. Clearly, the independent sets are precisely those sets that are 
subsets of bases, in terms of which the above axioms may then be translated. 


Basis axioms A matroid on F is a set of its subsets, called bases, satisfying: 
(B1) there is at least one basis; 


(B2) if B,C are bases and x € B\C, there exists some y € C\B such that 
(B\x) U y is a basis. 


Proving that the independent set axioms imply the basis axioms is simple. 
Proving the converse is tricky, and involves, as with proving equivalence with 
some of the axioms below, hitting upon the right induction argument; see Oxley, 
1992b for the full proofs of axiom equivalence. 

A circuit is any minimal dependent set, that is, any dependent set each of 
whose proper subsets is independent. The independent sets are then precisely 
those sets that do not contain a circuit. 


Circuit axioms A matroid on F is a set of its subsets, called circuits, satisfying: 
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(C1) all circuits are non-empty; 
(C2) no circuit is a proper subset of any other circuit; 
(C3) if B + C are circuits and x € BNC, then (BUC)\z contains a circuit. 


It is almost immediate from the independent set axioms that any two bases of 
a matroid have the same cardinality, and this is quite fundamental as it means 
that the dimension of a matroid can be defined. The rank of a matroid is 
the cardinality of any of its bases, and the rank r(X) of X is the rank of its 
submatroid. It is immediate that a set is independent if and only if its rank is 
equal to its cardinality. 


Rank axioms A matroid on Æ is a function r : 2 — N such that: 
(R1)0 < r(X) < |X]; 
(R2) if X C Y then r(X) < r(Y); 
(R3) r(X UY) +r(X NY) <r(X)47r(Y). 


A set X is a flat if it is maximal with respect to its rank, that is, if X c Y 
then r(X) < r(Y). It is immediate that the ground set of a matroid is a flat. 
Moreover, the intersection of any two flats is again a flat. This means that 
the closure (X) of a set X can be defined as the smallest flat that contains it, 
which is the intersection of all the flats that contain it. From the definition it is 
immediate that r((X)) = r(X), and then a set J is independent if and only if 
x ¢ (I\x) for all x € I. 


Closure axioms A matroid on F is an operator ( ) that is increasing, monotonic 
and idempotent, and satisfies the Exchange Axiom: 


(E) if y € (x U X)\(X) then x € (y U X). 


Infinite matroids are defined simply by dropping from the closure axioms 
the requirement that the ground sets be finite; see Oxley, 1992a. Examples 
include all vector spaces, all projective spaces and all infinite undirected graphs. 
Moreover, the closure operator in each of these spaces is algebraic: the closure 
of a set is the union of the closures of its finite subsets. Despite the importance 
of these examples, the theory of infinite matroids is not as rich as that of finite 
matroids, which is the main subject of discussion here. 


8.1 Examples 


Matroids may be thought of as combinatorial geometries: straight lines are 
flats of rank two, planes are flats of rank three, ..., hyperplanes are flats of rank 
one less than the rank of the whole matroid. The natural geometry in each of 
the following examples is a matroid geometry. 
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8.1.1 Matroids from vector spaces. Whitney’s original paper 
(Whitney, 1935) introducing matroids was entitled “On the abstract proper- 
ties of linear independence”, and independent sets in matroids were meant to 
be abstract, combinatorial versions of linear independent sets in vector spaces. 
For basic vector space definitions and theory, see any standard text such as Mac 
Lane and Birkhoff, 1967. 

Let E be any finite subset of any vector space V, and let the independent 
sets be those subsets of E that are linear independent in V. That this is a 
matroid, called the matroid of linear dependences on E, follows from basic 
vector space dimension theory. It may be checked that the flats in the matroid 
are the intersections with E of the subspaces of V, that the rank of any X is its 
linear dimension, and that (X) is the intersection with E of the vector subspace 
span( X). An example is shown in Fig. 12.17. 

The construction can be generalized to any finite multiset E of elements of 
V by letting the independent sets be those subsets of E not containing repeated 
elements that are linear independent. Multisets are likely to be encountered 
when working with matrices, from which the word matroid was coined. 


eee ee 


Figure 12.17. The linear independent sets in abcd are b, c, d, bd, cd. 


The construction does not necessarily give a matroid in more general modules 
over rings (rather than over fields). In a module over a ring, it can happen that 
two maximal independent sets have different cardinalities, making dimension 
difficult to work with. This cannot happen in a vector space, and one of the 
main advantages of vector spaces over modules over rings is that they have a 
better theory of dimension. 

A second matroid, of affine dependences, on E is obtained by taking the 
independent sets to be those of its subsets that are affine independent in V. 
This construction is not essentially different to the above, however, as a set 
L1,..., Zk is affine dependent in V (= F”) if and only if (71, 1),..., (wz, 1) is 
linear dependent in the vector space F"*+!. 

(The affine independent sets in Fig. 12.17 are: a,b,c, d, ab, ac, ad, bc, bd, 
cd, abd, acd, bcd.) 
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8.1.2 Matroids from graphs. Graphs have a natural geometry. In any 
undirected graph, which may have loops and multiple edges, a set of edges is a 
circuit if it is non-empty and can be ordered (ego, ..., eg) such that there exist 
vertices Up,.-., Ux, all distinct, such that each e; is incident to v; and vi; +1) 
(indices considered mod k). By this definition, all loops (k = 0) are circuits. 
The matroid derived from the graph has as its ground set the set of edges, and 
its circuits are the circuits in the graph. It is a simple exercise to show that this 
is a matroid. An example is shown in Fig. 12.18. 


Figure 12.18. An undirected graph, with circuits ab, acd, bed, f. 


A set of edges is a forest of if it contains no circuit. A forest is spanning if 
every vertex is incident to at least one of its edges. The forests are then the 
independent sets of the graph matroid, and the spanning forests are the bases. 
With regard to duality (to be discussed shortly), the cocircuits of the graph 
matroid are the minimal cut-sets of the graph. 


8.2 Projective spaces 


A projective space is a set together with a collection of its subsets, called 
lines, such that: 


(P1) any two distinct points are contained in exactly one line; 
(P2) there is more than one line; 


(P3) for any distinct points p, q,r, s, t, if there is a line containing p, q,r and 
a line containing p, s,t, then the line containing q, s intersects the line 
containing r, t. 


A subspace is any set that for each two of its points contains the straight line 
passing through them. The set of subspaces is the set of flats of a matroid (the 
infinite case included). Moreover, the projective spaces are then precisely those 
matroids whose rank function is modular: r(X UY )+r(XNY) = r(X)+r(Y). 
(The rank axiom (3) is the requirement that the rank function in a matroid 
be sub-modular.) For the details, and a full-blown treatment of projective 
geometry in terms of matroids, see Faure and Frolicher, 2000. 
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8.3 Matroid duality 


Arguably the single most important and fundamental aspect of matroid theory 
is that every matroid has another, dual matroid on its ground set. 


THEOREM 12.65 The collection of complements of the bases of a matroid is 
itself the collection of bases of a matroid. 


Proof The basis exchange axiom (B2) implies (and is in fact equivalent to): 


(B2*) if B,C are bases of a matroid and x € C\B then there is y € B\C 
such that (B\y) U z is a basis. 


This is not obtained from the exchange axiom simply by relabelling, but by: 
B Uz is dependent so contains a circuit X, x is independent so X contains 
some y € B, and then (B\y) U x does not contain a circuit, so is independent. 

Now let B, C be bases and x € (E\B)\(E\C) = C\B. There is some y € 
B\C = (E\C)\(E\B) such that (B\y) U z is a basis. Finally, £\((B\y) U 
x) = ((E\B)\z) U y. ge 


Complements of bases are called cobases, and the matroid they give is called 
the dual matroid of the original matroid. Obviously, every matroid is the dual 
of its dual. Circuits, flats, etc., of the dual matroid are called cocircuits, coflats, 
etc., of the original matroid Matroid duality is a spatial property that is revealed 
at the finite level. Oxley, 1992b remarks that “It [matroid duality] massively 
increases the weapons at one’s disposal in attacking any matroid problem. The 
theory of infinite matroids provides ample evidence of the vital role that duality 
plays in matroid theory. In the infinite context, there is no duality theory having 
the richness of the theory in the finite context.” 

The following result is an important geometric aspect of matroid duality, 
the significance of which becomes more apparent in the context of oriented 
matroids. 


PROPOSITION 12.66 The cocircuits of a matroid are the complements of its 
hyperplanes. 


Proof A subset of a matroid is spanning if its closure is the whole ground set, 
that is, if it contains a basis. Then a set is spanning if and only if its complement 
is coindependent. Then: H is a hyperplane if and only if H is not spanning 
but, for all x ¢ H, H Uz is spanning, if and only if E\H is coindependent 
but, for all x € E\H, (E\H)\z is codependent. QED 


The following exercises in basic matroid theory conclude this brief 
introduction: 
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(1) Uniform matroids. On a set of cardinality n, for any m < n let the indepen- 
dent sets be those subsets of cardinality < m. Verify that this is a matroid. Find 
the bases, circuits and flats of this matroid. Show that the dual of a uniform 
matroid is again uniform. 


(2) Partition matroids. With respect to any equivalence relation on a finite set, 
say that a set is independent if no two of its elements are equivalent. Verify that 
this is a matroid. Find the bases, circuits and flats of this matroid. Show that 
the cocircuits are the equivalence classes. 


(3) Flats and closure. Let I be independent and let A = {x | [Ux is dependent}. 
Using only the independent set axioms, prove that: 


(a) r(IU A) = r(J); 

(b) TU A= (J) 

(c) for any set X, (X) = {x | r(X) = r(X U x)}; 
(d) the intersection of two flats is again a flat. 


(4) Matchings. A matching in an undirected graph is any set of edges max- 
imal with respect to the property that no two edges have a common vertex. 
Let {X,Y} be a bipartition of a bipartite graph. Say that a set of edges is 
X-independent if no two of its edges have a common vertex in X, and is 
Y -independent if no two have a common vertex in Y. Show that: 


(a) the X-independent sets are the independent sets of a matroid, as are the 
Y -independent sets; 


(b) a matching is any set of edges maximimal with respect to being both 
X- and Y -independent; 


(c) the collection of sets that are both X- and Y -independent is not in general 
the collection of independent sets of a matroid. 


(5) The lattice of flats is graded. Prove that the intersection of two flats is again 
a flat, and so that the collection of flats ordered by subset inclusion is a lattice. 
Prove that this lattice is graded, that is, that all its maximal flags (chains) are 
of the same length. (Hint: consider sequences Jı C --- C Ik of independent 
sets.) 


(6) Matroids and diagram geometry. Let F C G be flats of a matroid such 
that r(G) = r(F) + 3. Show that for any distinct flats X1, X2 such that 
F C X1, Xə C G and r(X1) = r(X2) = r(F) + 1, there exists a unique flat 
Y such that X1, Xə C Y C G. A linear geometry is a collection of points and 
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straight lines that satisfies Axioms (P1), (P2) of projective geometry. Conclude 
that every interval of length 3 in the lattice of flats of a matroid is a linear 
geometry. 


(7) Coproducts of matroids. Let E1, E> be ground sets of matroids and let Æ 
denote their disjoint union. Define a subset of E to be independent if it is the 
union of an independent set in FÆ; with an independent set in Hy. Show that 
this gives a matroid. 


(8) Graphic matroids are vectorial. Given an undirected graph with m vertices 
and n edges, consider the (m x n)-matrix whose rows and columns are labelled 
by vertices and edges respectively, and whose (i, j)-th entry is 1 if the ¿th row 
label is incident to the jth column label, and is 0 otherwise. Consider the 
columns as points of the vector space (Z2)™. Show that the resulting matroid 
of linear dependences is the circuit matroid of the graph. 


9. Spherical oriented matroids 


Matroids are geometries of linear or affine sets (straight lines, planes, ... ), 
but do not have any structure of convexity. There is no notion of betweenness 
on the points of a straight line or, dually, hyperplanes do not separate the space 
into two half-spaces. Matroids therefore have to be given additional structure, 
an orientatation, from which convexity may be derived, and this is the theory 
of oriented matroids. 

Convexity is not anyway to be expected in matroids, as general vector spaces 
do not have convexity. It is only vector spaces over ordered fields, such as the 
field of real numbers, that do. Each straight line in a vector space is isomorphic 
to the field, and betweenness on the line is derived from the linear order on 
the field. 

There are two conceptions of an oriented matroid, flat and spherical, the 
definitions of both appearing more or less independently in the same journal 
in the same year. Flat oriented matroids, due to Bland and Vergnas, 1978, are 
far more commonly studied than spherical oriented matroids, which are due 
to Folkman and Lawrence, 1978. Spherical oriented matroids are relegated to 
exactly one exercise in the standard, classic textbook on the subject Björner 
et al., 1993. This is justified by the fact that the two concepts are equivalent, 
though some care has to be taken over expressing this equivalence formally. 


9.1 Involutions 


It seems that the concept of an involution is basic in combinatorial spherical 
geometry. One thinks of the involute of a point on the surface of a sphere as its 
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opposite, or antipode. In Euclidean geometry, the involute of any such point x 
is simply its negation — z. 

Abstractly, an involution on a set is any endomorphism such that, for all x, 
x = —(—2«) and x 4 —2. The latter fixed-point-free requirement that z 4 —x 
is not usually assumed, but there is no reason not to assume it here. A set 
together with an involution will be called an involuted set. For any subset X 
of an involuted set, its involute —X is the set {—x | x € X}. X is admissible 
if X 9 —X = f, and symmetric if X = —X. A transversal is a maximal 
admissible set, that is, any set that for all x contains exactly one of x, —z. 
Clearly, every finite involuted set has an even cardinality. 

The natural geometric interpretation of a finite set of cardinality n is as the 
set of vertices of the n-simplex, which may be thought of as the convex hull 
in R” of its canonical basis, which is the set of points e1, ..., en, where e; is 
the point that has ¿th coordinate 1 and all other coordinates 0. Subsets of the 
set then correspond exactly to the sets of vertices of the faces of the n-simplex: 
the face lattice of the n-simplex is the Boolean algebra over n elements. 

On the other hand, the natural geometric interpretation of an involuted set 
of cardinality 2n is as the set of vertices of the n-crosspolytope, which is the 
convex hull of the set of points e1, ... , €n, —€1,..., —€n. The admissible sets 
then correspond exactly to the sets of vertices of the faces of the n-crosspolytope 
(see Fig. 12.19). Thus the lattice of admissible sets (together with an adjoined 
top element) is the face lattice of the crosspolytope, which is the dual (i.e. the 
lattice turned upside-down) of the face lattice of the n-cube. 
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Figure 12.19. The 3-crosspolytope (octahedron). 
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9.2 Spherical geometry 


To re-cap the basics of classical spherical geometry, let 
Sn denote the n-dimensional unit sphere, that is, the set of all points in R”+! at 
distance one from the origin. The natural involution on the sphere is negation. 

A ray from the origin is any set of the form {Ax | A > 0}, for any x Æ 0. 
There is an obvious bijection between the set of all such rays and the sphere. 
A convex cone is any convex set that is the union of such rays. A subset of 
the sphere is spherical convex if it is the intersection of a convex cone with 
the sphere, and spherical linear if it is the intersection of a subspace with the 
sphere. An attractive feature of spherical geometry is that linearity may be 
derived from convexity together with the involution, which is thematic in what 
is to follow. 


THEOREM 12.67 A Set is spherical linear if and only if it is spherical convex 
and symmetric with respect to the involution. 


Real projective space P,, may be derived from Sn by quotienting. Consider 
any transversal (taken with respect to the involution) of the sphere, and say that 
its subspaces are its intersections with the spherical linear sets. This results in 
P,,, no matter which transversal is chosen. 

A spherical linear set is a spherical hyperplane if it is the intersection of 
the sphere with a hyperplane that contains the origin (i.e. an n-dimensional 
subspace). The two corresponding open hemispheres are the respective in- 
tersections of the hyperplane’s open half-spaces with the sphere. A closed 
hemisphere is the union of an open hemisphere with its spherical hyperplane 
(“at infinity”). Any open hemisphere together with its inherited geometry is 
isomorphic to R”. 

Spherical geometry has been adopted by Stolfi, 1991, as a geometry that has 
convexity while at the same time retaining many of the advantages of projective 
geometry, which does not have convexity. The main advantage is the retention 
of (a version of) homogeneous coordinates. 

Convexity in Euclidean geometry is algebraic: the convex hull of a set is 
the union of the convex hulls of its finite subsets. The same is true in spherical 
geometry. The spherical relative interior of a finite subset X = {21,..., £k} 
of the sphere is the set of all x € Sn such that there exist scalars A; > 0 such 
that x = X` Aixi. The spherical convex hull of X is the set of all x € S, such 
that there exist scalars A; > 0, such that x = X` Aixi. 


THEOREM 12.68 A set is spherical convex if and only if it contains the spher- 
ical convex hull of each of its finite subsets. 


The set X is surrounding if the origin is in its relative interior, that is, if there 
exist scalars A; > 0 such that 0 = -` A;z;. It is simple to prove that z is in the 
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spherical relative interior of X if and only if X U —z is surrounding, and that 
the spherical convex hull of X is the union of the respective spherical relative 
interiors of all subsets of X. 


9.3 Spherical oriented matroids 


It has just been shown that all of the convex and linear geometry of the sphere 
can be derived from the surrounding sets together with the involution. Oriented 
matroid theory may be regarded as an abstract, combinatorial theory of this. Itis 
convenient to now adopt the convention that the empty set is surrounding. Say 
that a vector is any admissible surrounding set, and that a circuit is any minimal 
non-empty vector. These are the intuitions for the following definitions, the 
first of which is due to Folkman and Lawrence, 1978. 


Circuit axioms A spherical oriented matroid is a finite involuted set together 
with a collection of its subsets, called circuits, such that: 

(C1) each circuit is non-empty and admissible; 

(C2) no circuit is a proper subset of any other; 

(C3) if C is a circuit then so is —C; 


(C4) if C £ D are circuits and x € C N D then (C\x) U(—D\ —z) contains 
a circuit. 


A vector in an oriented matroid is any set that is both admissible and the union 
of circuits. The empty union is allowed, and so Ø is always a vector. Vector 
axioms for oriented matroids come in various apparently differing strengths; 
the following is a version of the Vector Axioms (4.1.1) in Björner et al., 1993. 


Vector axioms A spherical oriented matroid is a finite involuted set together 
with a collection of its subsets, called vectors, such that: 

(V1) 0 is a vector; 

(V2) each vector is admissible; 

(V3) if U is a vector then so is —U; 

(V4) the union of vectors, if admissible, is again a vector; 


(V5) if U,V are vectors and x € U N V then (U\x) U(—V\—2) contains a 
vector that contains (U\V) U(—V\-U). 


A surrounding set in an oriented matroid is any set that is the union of a 
vector and a symmetric set. The following axiomatization, referred to in Sec. 7 
above, appears to be new. 
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Surrounding set axioms A spherical oriented matroid is a finite involuted set 
together with a collection of its subsets, called surrounding sets, such that: 


(S1) every symmetric set is surrounding; 
(S2) if S is surrounding then so is —S; 
(S3) the union of surrounding sets is again surrounding; 


(S4) if S,T are surrounding and x € ST then (S\x) U (—T\—2z) is also 
surrounding; 


(S5) if S is surrounding and x,—« € S then at least one of the sets S\z, 
S\— zx, S\{x, —x} is also surrounding. 


9.4 Equivalence of the axiomatizations 


Axiom (C4) is called the Circuit Elimination Condition. The following 
apparently stronger condition is a most important basic property of oriented 
matroids, though as the proof of the result runs to well over a page in both 
Folkman and Lawrence, 1978 and Björner et al., 1993; it will not be repeated 
here. Using only the Circuit Axioms: 


LEMMA 12.69 If C # D are circuits and x € CN Dandy € (C\D)U 
(—D\—C) then (C\x) U (—D\—2) contains a circuit that contains y. 


The composition of a pair of subsets X,Y of an involuted set is the set 
XoY = XU(Y\-X). This operation is associative but not commutative. 
The composition of two admissible sets is again admissible, and composition 
might be thought of as a kind of non-symmetric union on admissible sets. Again, 
using only the Circuit Axioms: 


LEMMA 12.70 The composition of two vectors is again a vector. 


Proof The result amounts to the fact that if U, V are vectors and x € V\(U U 
—U) then x € C C UoV for some circuit C. Let C be any circuit satisfying (a) 
x € Cand(b) C C VUUU-LU, andis such that |CN—U| is minimal among all 
such circuits that satisfy. If CN—U is empty then the result is proved, so suppose 
for contradiction that y € CM —U. Let D C —U be a circuit that contains y. 
Then x € C\D and, by strong elimination, the set (C\y) U(—D\—y) contains 
a circuit that satisfies (a), (b) and contradicts minimality of |C N —U}. QED 


PROPOSITION 12.71 The Circuit Axioms are equivalent to the Vector Axioms. 
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Proof Defining vectors in terms of circuits it is easy to see that (V1)-(V4) are 
satisfied. For (V5), let U, V be vectors and x € UNV. If U = V then 9) satisfies 
(V5). If not, it is first claimed that, for any y € U\V, there is a circuit Cy that 
contains y and satisfies (a) Cy N (—U\—V) = and (b) Cy N (V\U) = 9. To 
prove this, if there is a circuit C C U that contains y but not x, let Cy = C. 
If not, let D C V be any circuit that contains x. Then y € C\D so, by strong 
elimination, there is a circuit Cy C (C\x) U (—D\—2) with y € Cy, and this 
proves the claim. Now taking the composition (in any order) of all the circuits 
Cy results in a vector W such that U\V C W and W N (—U\-—V) = 90. Bya 
similar construction on —V \—U, there is a vector W” such that -V\\—U C W” 
and W’ N (V\U) = Ø. The compostion W o W” then satisfies the conditions 
of (V5). 

Conversely, beginning with the Vector Axioms and defining circuits as mini- 
mal non-empty vectors, it is easy to see that the Circuit Axioms are satisfied. It 
remains to show that every vector is the union of circuits, which will be proved 
by induction on vector size. Let U be a vector and assume that the property 
holds for all vectors of cardinality < |U]. It is first claimed that, for any x € U, 
there is a circuit C C U U —U such that x € C. Let D C U be any circuit 
and assume that x ¢ D (otherwise the claim is proved). For any y € D, by 
vector elimination there is a vector V C (U\y) U (—D\—y) that contains x. 
Then |V| < |U| because vectors are admissible, so, by the induction hypothe- 
sis, the claim is proved. Now let C be any circuit such that (a) £ € C and (b) 
C CUU-U, and is such that |C N —U| is minimal among all such circuits. 
Suppose for contradiction that y € C N —U. By vector elimination there is a 
vector V C (C\y) U (U\—y) that contains x. By the induction hypothesis, V 
contains a circuit D that contains x. Then D satisfies (a), (b), but |D N —U| 
contradicts minimality of |C N —U]. QED 


PROPOSITION 12.72 The Vector Axioms are equivalent to the Surrounding 
Set Axioms. 


Proof Beginning with the Vector Axioms and defining surrounding sets in 
terms of vectors, it is easy to show that (S1)-(S3) are satisfied. For (S4), let 
U,V be vectors and x € U N V. Let W be a vector such that (U\V) U 
(V\U) CW C (U\x) U(—V\—2). The union of W with the symmetric set 
((XU-X)N(YU-Y))\{a, —x} is the set (U\x)U(—V\—2). If U isa vector, 
X asymmetric set and E€ UNX, then (U\x)U(—X\—2) = UUX\{a2, =x}. 
If X,Y are symmetric sets and x E€ X MY then (X\ax) U(—Y\—a2) = XUY. 
Proving (S4) is then a matter of checking cases. For (S5), let U be a vector and X 
asymmetric let, and letz, —x € UUX. If x, —x ¢ U then (UUX)\{ax, —x} = 
UUX\{z,-x}. If x € U then (U U X)\-z = U U X\{z, -x}. If —x € U 
then (U U X)\z = U U X\{z, =x}. 
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Conversely, beginning with the Surrounding Set Axioms and defining a vec- 
tor as an admissible surrounding set, it is easy to see that (V1)—-(V4) are satisfied. 
For (V5), if U, V are vectors and x € UNV then (U\x)U(—V\—2) is surround- 
ing, and a vector satisfying (V5) is obtained by repeated application of (S5). 
Repeated application of (S5) may also be used to show that every surrounding set 
may be recovered as the union of a vector and a 
symmetric set. QED 


We can now supply a straightforward proof of Proposition 12.59, whose 
proof was omitted in Sec. 7. In our current notation (and slightly generalized) 
the proposition may be rewritten: 


PROPOSITION 12.73 Let A be anadmissible set, B a subset of A, and suppose 
that AU—p and BU—qare surrounding sets. Then AU{q, —p} is surrounding. 


Proof Note first that if q € B there is nothing to prove. Likewise if —q € B 
the result is immediate, since in that case AU {q, —p} = AU—pU {q, —q}, and 
thus is surrounding. Assume then that neither q nor —q occurs in B. Similarly 
we may assume that p Æ q. 

There are now two (main) cases. Suppose first that p does not occur in A. 
Then A U —p is (admissible hence) a vector, and (in virtue of (V3)) q U —B 
is a vector. Hence the composition (A U —p) o (q U —B) = AU {q,—p} 
is a vector. In the second case, suppose that p € A. Then by (S5) at least 
one of A, A \ p, (A U —p) \ p is surrounding (hence a vector). Consider the 
compositions A o (q U — B), (A \ p) o (q U — B), ((AU —p) \ p) o (qU —B). 
In each case A U {q, —p} results from taking the union of {p, —p} with the 
composition, and hence is surrounding. QED 


9.5 Examples 


9.5.1 Realizable oriented matroids. It is left as an exercise to show 
that the circuits on the sphere satisfy (C1)—(C4), that the vectors on the sphere 
satisfy (V1)-(V5) and that the surrounding sets on the sphere satisfy 
(S1)-(S5). It is then immediate that, for any finite subset Æ of the sphere 
that is symmetric with respect to the involution, the collections of circuits, vec- 
tors and independent sets that are subsets of E satisfy the respective oriented 
matroid axioms. The oriented matroids that can be constructed in this way are 
called realizable. An example is given in Fig. 12.20. 


9.5.2 Hyperplane arrangements. A second construction of realizable 
oriented matroids is given by hyperplane arrangements. A hyperplane arrange- 
ment in R”! is a finite set of hyperplanes, each of which contains the origin. 
Consider the set E of the corresponding open half-spaces, with the obvious 
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Figure 12.20. The circuits are a — bc and —ab — c. 


involution. For any point x € R"*!, consider the set of all the half-spaces that 
contain x. The set of all subsets of E that are constructed in this way is the set 
of vectors of an oriented matroid. An example is given in Fig. 12.21. 

This example has a natural spherical interpretation. Each hyperplane is con- 
sidered as a spherical hyperplane, and the half-spaces are then considered as 
spherical open hemispheres. Taking two points on the sphere to be equivalent 
if they induce the same oriented matroid vector (i.e. if they both lie in ex- 
actly the same set of hemispheres), the set of equivalence classes is a cellular 
decomposition of the sphere. 

The hyperplane arrangement construction is in fact dual to the construction of 
the oriented matroid on a symmetric point set E on the sphere. For each z € E, 
consider the the open half-space containing x of the hyperplane orthogonal to 
x. Labelling this open half-space x and then taking the hyperplane arrangement 


construction gives the dual (see oriented matroid duality, to follow) oriented 
matroid of the oriented matroid on EF. 


9.6 Convexity and linearity 


Convexity in oriented matroids is derived from surrounding sets just as in 
spherical geometry. 


DEFINITION 12.74 For any subset X of an oriented matroid: 


(a) its relative interior is the set relint(X ) of all points x such that X U —« 
is surrounding; 


(b) its convex hull is the union |X] of the respective relative interiors of all 
subsets of X. 
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Figure 12.21. With the half-spaces labelled in this way, the vectors of the oriented matroid of 
this hyperplane arrangement are Ø, abc, bc, —abc, —ac, —a — bc, —a — b, —a — b — c, —b 
c,a — b — c,a — c, ab — c, ab. 








Thus x € [X] precisely when there exists a surrounding set S' such that 
—rESC-r#vUX. 


PROPOSITION 12.75 The convex hull operator is increasing, monotonic and 
idempotent. 


Proof It is clear that, for any point x, x € relint(x), and so the convex 
hull operator is increasing. (It is not in general true that X and relint(X) are 
comparable.) It is immediate from the definition that the convex hull operator is 
monotonic. Foridempotency, let x € [[X]]. Thereissome Y = {y1,..., Yk} C 
[X] such that —x U Y is surrounding. For each i there is some X; C X such 
that —y; U X; is surrounding. By elimination on yj, the set —æ UY \y1 U X1 is 
surrounding. Eliminating then on yo,..., yz it follows that =x UX 1U---UX, 
is surrounding, and so x € [X]. QED 


DEFINITION 12.76 A subset of an oriented matroid is convex if it is equal to 
its convex hull, and is linear if it is both convex and symmetric. 


As the convex hull operator is that of a closure system, the intersection of 
convex sets is again convex. As also the intersection of symmetric sets is again 
symmetric, the intersection of linear sets is again linear. Clearly, the ground set 
is linear, so the linear hull (X) may be defined as the smallest linear set that 
contains X. 

The proofs of the next two results are left as a straightforward exercise in the 
manipulation of surrounding sets. 


PROPOSITION 12.77 For any set X and any x,y: 
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(a) —relint(X) = relint(—X); 
(b) -[X] = [-X] 
(c) [X U =X] = (X); 
(d) [x U X] N[-xU X] = [X]; 
(e)y € [xU X] > ~z € [-yU X] 
PROPOSITION 12.78 For any linear L and any x,y ¢ L: 
(a) (LU x) = [LU z] U [LU =z]; 
(b)y €[LUa] exe [LUy] (e [LU z] = [LU y]); 
(c)—x ¢ [LU z]. 
PROPOSITION 12.79 Surrounding sets have the following properties: 
(a) a set is surrounding if and only if its convex hull is linear; 


(b) if S is surrounding and C is convex then S C C implies —S C C. 





Proof (a) If S is surrounding then, for any x € — S, —x € S C —x US, and 
so x € [S]. Then —S C [S], and then (S) = [SU-—S] = [S]. For the converse, 
if [X] = (X) then, for all x € X, —ax € [X], which means that there is some 
surrounding set S such that x E€ S C x U X = X. Then X is the union of 
surrounding sets and so is itself surrounding. 

(b) Because —S C (S) = [S] CC. QED 


PROPOSITION 12.80 The linear sets are the flats of a matroid. 


Proof That the linear hull operator is the closure operator of a matroid is a 
simple consequence of convexity property (e). QED 


This matroid is called the linear matroid of the oriented matroid. Each set 
{x, —x} is dependent, therefore every independent set is admissible. A conse- 
quence of the fact that every linear set is symmetric is that every reorientation 
(see later) of an independent set is again independent. Then every reorientation 
of a matroid circuit is again a matroid circuit. The oriented matroid circuits 
do not in general satisfy the circuit axioms of a matroid, but the circuits of the 
linear matroid are precisely the reorientations of the oriented matroid circuits, 
together with those sets {2, —x} such that x is not dependent. 
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9.7 Oriented matroid duality 


As with matroids, duality is a most important part of oriented matroid theory. 


DEFINITION 12.81 Subsets X,Y of an involuted set are orthogonal, written 
XLY,if XNY=08XN-Y=0. 


The orthogonality relation is symmetric, a set is admissible if and only if it 
is not orthogonal to itself, and a set is symmetric if and only if it is orthogonal 
to all sets. 

A subset of an oriented matroid is cosurrounding if it is orthogonal to every 
surrounding set. Covectors are admissible cosurrounding sets, and cocircuits 
are minimal non-empty covectors. 


PROPOSITION 12.82 The set of cosurrounding sets of an oriented matroid 
satisfies the oriented matroid surrounding set axioms. 


Proof A set is symmetric iff it is orthogonal to all sets, which gives (S1). If 
X LY then X L —Y, which gives (S2). If X L Y,Z then X L (Y U Z), 
which gives (S3). 

For (S4), suppose for contradiction that X, Y are cosurrounding, x € X NY 
and (X\z) U (—Y\-—z7) is not cosurrounding. Then, without loss of generality, 
there exists a surrounding set S such that (a) (X\2)NS # 0, (b) (X\x)N -S = 
Ø and (c) (Y\z) A S = Ø. That X L S gives that (d) X N —S = zx, which 
together with (c) gives that (e) Y N S = x. So x,—a € S, and then from (b) 
it follows that —x ¢ X. Then from (a) it follows that there is y # x, —x such 
that y € XMS. But Y and S\ are not orthogonal because Y N (S\x) = 0 and 
x € YN(—S\-—«); X and $\—z~z are not orthogonal because X N(—S\x) = 0 
and x € X N (S\—a); X and S\{x,—«}) are not orthogonal because X N 
(—S\{a, —x}) = Wand y € X N(S\{x, —x}). This contradicts the property 
(S5) of surrounding sets. 

For (S5), let X be cosurrounding, let x, —x € X and suppose for contra- 
diction that there exist surrounding sets R, S, T such that: (a) (X\r) N RAO 
and (X\r)N —R = ģ; (b) (X\—2) N S Z ý and (X\-2)N -S = 9; (©) 
(X\{x, -2}) NT # O and (X\{x,-x}) N -T = Ø. From (a) and X L R 
it follows that (d) X N —R = x. From (b) and X L S it follows that (e) 
XM-S = —x. Then z € —RNS, so (—R\x) U(—S\—z) is surrounding. X 
is disjoint from this set, and so is disjoint from its involute. It follows that (f) 
XAR = —zx, and (g9) XN S = x. From(c) and X | Tit may be assumed with- 
out loss of generality that —x € —T. Then x € SMT, so (—S\—2x) U(T\z) 
is surrounding. By (c), X intersects T\x, and therefore intersects the involute 
(S\x)U(—T\—2). By (g), X therefore intersects (—T\—x), and then, by (c), 
x is the only possible point in the intersection. Therefore both x, —x € T. By 
(c), X and T\{x, —x} are not orthogonal. If X and T\x were orthogonal then, 
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by (£), —x € RN(T\z). Then (—R\x) U(T\{x, —x}) would be surrounding, 
but, by (c), X intersects this set, and, by (f) and (c), does not intersect its invo- 
lute. Finally, if X and T\\—x were orthogonal then, by (g), x € SN (T\-—2). 
Then (—S\—2x)U(T\{x, —x}) would be surrounding, but, by (c), X intersects 
this set, and, by (g) and (c), does not intersect its complement. Therefore none 
of T\z, T\—2, T\{x, —x} are surrounding, a contradiction. QED 


PROPOSITION 12.83 Cosurrounding sets have the following properties: 


(a) complements of convex sets are cosurrounding; 


(b) complements of admissible cosurrounding sets (i.e. covectors) are convex. 


Proof (a) Follows immediately from property (b) of surrounding sets. 
(b) Straightforward exercise. QED 


The oriented matroid of cosurrounding sets is called the dual of the original 
oriented matroid. 


THEOREM 12.84 Every oriented matroid is the dual of its dual. 


Proof Let X be orthogonal to all cosurrounding sets. The complement Æ\ [X] 
of the convex set |X] is cosurrounding and disjoint from X, therefore X N 
—(E\[X]) = 0. Now —(£\[X]) = E\ - [X] = E\|—X], and so X C [—X]. 
Then [X] is linear, so X is surrounding. QED 


9.7.1 Example: Oriented matroids from Euclidean subspaces. The 
orthogonality relation on a finite involuted set has the following geometric 
interpretation in terms of Euclidean space considered with its standard inner 
product. Recall that two points x, y € R” are orthogonal, written x L y, if their 
inner product is zero. Consider now the involuted set as the set of vertices of the 
crosspolytope, as discussed earlier, and, for any of its subsets X, let Xe denote 
the corresponding set of crosspolytope vertices. It is straightforward though 
tedious to prove that X L Y if and only if there exist x € relint( Xe) and 
y € relint(Y.) such that x L y. It is left as an exercise in classical Euclidean 
geometry to prove that: 


PROPOSITION 12.85 For any subspace S of R”, the set of all faces of the 
crosspolytope whose relative interior intersects S is the set of vectors of an ori- 
ented matroid. Moreover, the covectors are those faces whose relative interior 
intersects the orthogonal subspace S+. 
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9.8 Hemispheres 


The structure of a dual oriented matroid is bound-up with the following 
concept. 


DEFINITION 12.86 For any hyperplane H in the linear matroid of an oriented 
matroid, its closed hemispheres are the sets |H U x] and |H U —a], for any 
x ¢ H. Its open hemispheres are the complements of its closed hemispheres. 


According to the convexity properties in the previous section, for the defini- 
tion of a closed hemisphere it makes no difference which x ¢ H is chosen. It is 
a simple consequence of the convexity and linearity properties that the union of 
the two closed hemispheres is the whole ground set, the intersection of the two 
closed hemispheres is the hyperplane, one closed hemisphere is the involute of 
the other, and closed hemispheres are convex but not linear. To summarize: 


PROPOSITION 12.87 Where H`, H* are the open hemispheres of the hyper- 
plane H: 


(a) H* =—(H~), 

(b) {H—, H, H*} is a partition of the ground set; 

(c) the two closed hemispheres are H U H`, H U H7; 
(d) H U H` = [|H U x] for any z € H-. 


PROPOSITION 12.88 The open hemispheres of an oriented matroid are pre- 
cisely its cocircuits. 


Proof If a closed hemisphere H U H~ contains a surrounding set S' then, by 
property (b) of surrounding sets, so does its involute H U H*. Then S C H, 
and therefore every open hemisphere is cosurrounding. Conversely, if U is 
admissible and cosurrounding then L = E\(U U —U) is linear because it is 
both symmetric and the intersection of two convex sets. If U is non-empty then 
L + E, and so L is contained in some hyperplane H. The remainder of the 
proof relies on the following result from Folkman and Lawrence, 1978. QED 


LEMMA 12.89 If C, D are circuits of an oriented matroid such that C C 
DU-D then C = D or C = —D. 


Proof Suppose for contradiction that C intersects both D and — D and then 
show that C must then properly contain another circuit.) To complete the proof, 
the open hemisphere H™ is a subset of U U —U. Then, when U is a cocircuit, 
one of U, —U is equal to H~, and then the other is equal to H+. QED 
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The following is a combinatorial version of Weyl’s Theorem. 


PROPOSITION 12.90 The convex sets in an oriented matroid are precisely the 
intersections of its closed hemispheres. 


Proof The ground set is the intersection of the empty collection of closed 
hemispheres. The complement of a convex set C is cosurrounding. Then if 
x ¢ C there is a cocircuit that contains x and is disjoint from C. Cocircuits 
are open hemispheres, and the complement of an open hemisphere is a closed 
hemisphere. QED 


9.9 Polytopes 


If R” is used as the basic mathematical model in spatial computation then 
polytopes are quite fundamental in that they are regions that can be finitely 
specified (i.e. as convex hulls of finite sets). For a treatment of polytopes and 
oriented matroids, see Ziegler, 1995. Within a finite spatial model, it is tempting 
to say that the polytopes are simply the convex sets. This is certainly true if the 
model is flat, though more care must be taken with spherical models. 


DEFINITION 12.91 A polytope in an oriented matroid is any set that is both 
convex and admissible. 


The requirement of admissibility constrains polytopes to be subsets of closed 
hemispheres, and so no non-empty subspace is a polytope. Without this con- 
straint, it would not be possible to obtain a version of, for example, the Krein- 
Milman theorem (see below). 

Say that a face of a polytope P is any of its subsets F = P\U, where U 
is cosurrounding and P N —U = Ø. Consider, for example, the case where U 
is a cocircuit (i.e. an open hemisphere). Then, where H is the corresponding 
hyperplane, P is contained in the closed hemisphere H U U and F = PA H. 
Some simple properties of faces are: 


PROPOSITION 12.92 Let P be a polytope: 
(a) both and P are faces of P; 
(b) all faces of P are again polytopes; 
(c) the intersection of two faces of P is again a face of P. 
Proof (a) E\P and Í are both cosurrounding. (b) Any face is the intersection 


of an admissible convex set with a linear set E\(U U—U). (c) Straightforward 
exercise. QED 
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Combinatorial versions of several classical convexity theorems, including 
Weyl’s Theorem above, have been proved for flat oriented matroids; see Björner 
et al., 1993. A version of Caratheodory’s theorem in the spherical case can be 
obtained for admissible sets using the fact that if X is admissible and x ¢ X, 
then x € [X] precisely when there is a circuit C such that -z € C C X U 
—zx, and then relating rank to circuits as for matroids. For the Krein-Milman 
theorem, say that an extreme point of a polytope P is any x € P such that P\x 
is convex. It has been shown for flat oriented matroids that any convex set is 
the convex hull of its extreme points, and a spherical version can be obtained 
via the “equivalence” of flat and spherical oriented matroids (to be discussed). 


9.9.1 Anti-matroids. A second important concept in combinatorial 
convexity is that of an anti-matroid, which is a set together with a closure (mono- 
tonic, increasing and idempotent) operator | | that satisfies the Anti-Exchange 


property 
(AE) for all x Æ y and all X, if x € [y U X]\[X] then y ¢ [x U X]. 


This should be compared with the exchange axiom for matroids. The main 
models are any subsets of Euclidean space with the inherited convex hull op- 
erator. The Anti-Exchange property has been considered as a foundation for 
digital geometry by Pfaltz and others (Pfaltz, 1996; see also Coppel, 1998). 

The anti-exchange property does not hold in spherical geometry; for a counter- 
example, let X be a closed hemi-sphere and let x, y be distinct points in the 
opposite open hemi-sphere: then [a U X] = [y U X] = the whole sphere. The 
property does however hold locally on the sphere, as the induced geometry 
on an open hemi-sphere is Euclidean. Spherical oriented matroids do not ei- 
ther have the anti-exchange property, though they do have the following local 
version (see also Las Vergnas, 1980). 


PROPOSITION 12.93 Let X bea subset and x, y points of an oriented matroid 
that are all contained in some cosurrounding set, and let |x| 4 [y]. Ify € 
[x U X]\[X] then x ¢ [y U X]. 


Proof Ify € [x U X]\[X] then y ¢ [X], so there is some S C X such 
that —x U y U S is surrounding. Suppose for contradiction that y € [x U X], 
that is, that there is some T C X such that —y U x U T is surrounding. By 
elimination on x, the set y U —y U S U T is surrounding. But —y U S U T is 
not surrounding because y ¢ [X], and y U S U T is not surrounding because it 
is a non-empty subset of a cosurrounding set. Therefore © U T is surrounding, 
and so, by orthogonality, must be empty. But then {—z, y} is surrounding, and 
then x € [y], a contradiction. QED 
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10. Flat oriented matroids 


The “flat” oriented matroids of Bland & Las Vergnas are far more commonly 
studied than spherical oriented matroids. The two concepts are equivalent, 
though some care has to be taken over expressing this equivalence formally. 

Flat oriented matroids do not require any involution, but the price for this is 
the use of signed sets. A signed subset of a set is an ordered pair (X+, X—) of 
its subsets such that X*+, X- are disjoint. Its support is the set Xt U X- and 
its opposite is the signed set (X~, X*). 


Oriented matroid axioms A flat oriented matroid is a finite set together with 
a collection of its signed subsets, called signed circuits, satisying: 


(OM1) the opposite of a signed circuit is again a signed circuit; 


(OM2) the set of supports of the signed circuits is the set of circuits of a 
matroid; 


(OM3) if (X+, X) Æ (Y+, Y`) are signed circuits and x € X+ N Y~ then 
there exists a signed circuit (Z*, Z~) such that Z* C (Xt U Y+)\x 
and Z7 C (X7 UY7)\a. 


The underlying matroid of a flat oriented matroid has as its circuits the set of 
supports of the signed circuits. The oriented matroid is said to be an orientation 
of its underlying matroid. Not all matroids can be oriented; in particular, no 
projective space can be oriented. 

For an understanding of the definition, consider that one means of adding 
convexity to a matroid is to partition the complement of each of its hyperplanes 
into two open half-spaces, with a closed half-space then being the union of 
an open half-space with its hyperplane. A convex set is then defined as any 
set that is the intersection of a collection of closed half-spaces. Recall that 
the complements of hyperplanes in a matroid are precisely its cocircuits, so a 
definition of open-half spaces involves splitting each cocircuitin two. When the 
cocircuits are considered as circuits of the dual matroid, this results eventually 
in the axioms above. 


10.1 Examples 


10.1.1 Realizable flat oriented matroids. Consider the matroid of 
linear dependences on a finite subset E of Euclidean space. Any circuit {x,..., 
£k} of this matroid may be converted into a pair of opposite signed circuits as 
follows. For any linear dependence \y71 +--+ + Ayrp = 0, let X+ = {2; | 
ài > 0} and X~ = {a; | A; < 0}. The collection of all signed circuits 
constructed in this way is an orientation of the matroid of linear dependences. 
An example is given in Fig. 12.22. 
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Figure 12.22. The signed circuits are (Ø, a), (a, Ø), (b, c), (c, b). 


Now consider the matroid of affine dependences on E. Any circuit of this 
matroid may be converted into a pair of opposite signed circuits as follows. 
A Radon partition of the circuit (or indeed any other set) is any partition of it 
into two sets whose respective relative interiors intersect. Each circuit admits a 
unique Radon partition, which is then converted into a pair of opposite signed 
circuits. The collection of all signed circuits constructed in this way is an 
orientation of the matroid of affine dependences. (The signed circuits for the 
point set in Fig. 12.22 are (a, bc), (bc, a).) 


10.1.2 Flat oriented matroids from graphs. Consider a directed 
graph, which may have loops and multiple edges. A set of edges is a circuit if 
it is a circuit in the undirected graph obtained by forgetting edge orientation. 
Moving around a circuit, from one edge to the next, say that an edge is positive if 
it is oriented in the direction one is moving, and negative if it is oriented against 
this direction. This gives a partition of the circuit, which is then considered as a 
pair of opposite signed circuits. The collection of all signed circuits constructed 
in this way is an orientation of the graph matroid. An example is given in 
Fig. 12.23. 


Figure 12.23. The signed circuits are (ab,0), (0, ab), (ac, d), (d, ac), (bd, c), (c, bd), 
(f,0), (0, f). 
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10.2 Reorientation 


A morphism between involuted sets Æ, F is any function f : E — F such 
that, for all z, f(—x) = — f(x). A symmetry of E is any of its bijective 
endomorphisms. Geometrically, this is symmetry of the crosspolytope. 

Any symmetry on an involuted set E extends in the obvious way to a bijection 
2” — 2”. For any such bijection there exists a symmetric set 9 C E such 
that the bijection is of the form X — (X\S)U(—X N S). Conversely, any 
symmetric set gives such a bijection. 


DEFINITION 12.94 A reorientation of a spherical oriented matroid is the im- 
age of all its surrounding sets under a symmetry. 


It is a simple exercise that the reorientation of a spherical oriented matroid 
is again a spherical oriented matroid. 

Reorientation helps explain the relationship between spherical and flat ori- 
ented matroids. A flat oriented matroid may be constructed from a spherical 
one as follows. Let T' be any transversal, and say that the signed circuits in T 
are those signed sets of the form (CNT, —CNT), for all circuits C. This gives 
a flat oriented matroid, the underlying matroid of which is the sub-matroid on T 
with respect to the underlying matroid of the spherical oriented matroid. This 
construction is analogous to the construction of real projective space from the 
sphere discussed earlier. 

Conversely, beginning with a flat oriented matroid on FE, make a copy —E 
and say that a circuit in E U —E is any set of the form Xt U — X7, for all 
signed circuits (X+, X~). This gives a spherical oriented matroid. 

Despite the simplicity of these translations, some care has to be taken over 
the equivalence of the two concepts. To see this, consider the important notion 
of a flat oriented matroid being acyclic. A signed set (X+, X7) is said to be 
positive if XT = (), and a flat oriented matroid is acyclic if it contains no 
positive signed circuit. Every realizable flat oriented matroid, for example, is 
acyclic. It cannot make any sense, however, to say that a spherical oriented 
matroid is acyclic. Whether or not an acylic flat oriented matroid is derived from 
a spherical oriented matroid depends entirely on the choice of the transversal 
(in fact, there is always some transversal that is acyclic). 

The reorientation of a signed set (X*,X7) at a set S is the signed set 
((X*\S) U(X7~ NA S), (X7\S)U(X* N S)). Beginning with a flat oriented 
matroid and any subset of the ground set, the reorientation of all signed circuits 
at this set gives a set of signed circuits of another oriented matroid. The relation 
of one flat oriented matroid being a reorientation of another is an equivalence 
relation. Beginning with a spherical oriented matroid, the collection of all 
flat oriented matroids derived from all transversals is an equivalence class. 
Conversely, any two spherical oriented matroids equivalent under reorientation 
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have the same equivalence class of flat oriented matroids. Therefore spherical 
and flat oriented matroids are the same concept up to reorientation. 
10.3 Oriented matroids and computational geometry 


One application of flat oriented matroids is as a foundation for computational 
geometry. This is the approach taken by Knuth, 1991, who argued that working 
axiomatically in this way gives better algorithm design. Knuth worked with 
the following definition. 


CC-System Axioms A C'C-system is a finite set together with a ternary relation 
(write xyz to mean that (x, y, z) is in the relation) such that: 


(CCl) xyz => yza, 

(CC2) xyz > “rzy, 

(CC3) xyz V yxz, 

(CC4) zxy A wzy A wrz = wry, 

(CC5) zyv A zyw A zyz A zvw ^ ZWT = ZVT. 


The main intended models are as follows. Consider any finite subset of 
the plane that does not contain any three collinear points. For any three points 
x, y, z, consider the unique circle that contains these points on its circumference. 
Then let xyz if, when moving around the circumference in a clockwise direction, 
after hitting x one hits y before z. An example is given in Fig. 12.24. 


eee ee 


Figure 12.24. The ternary clockwise relation is abc, bca, cab, abd, bda, dab, acd, dac, cad, 
bdc, dcb, cbd. 


CC-systems have the following flat oriented matroid structure. A hyperplane 
is any set that contains exactly two points. The complement of a hyperplane 
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{x,y} is called a cocircuit, and may be partitioned into the two sets {z | xyz}, 
{z | xzy}. The resulting set of signed cocircuits satisfies the flat oriented 
matroid axioms. In fact, CC-systems are equivalent to rank 3 flat oriented 
matroids in which all sets of cardinality 3 are bases of the underlying matroid. 


11. Algebraic spatial models 


It seems by now to be fairly well understood what the basic mathematical 
structures involved in discrete topology are: graphs, finite (or locally finite) 
posets (To-spaces) and simplicial or cell-complexes. These, anyway, are those 
structures employed routinely for representation of Euclidean topology in com- 
putational areas of application such as geographical information systems, spatial 
database theory, digital topology and image analysis. 

It has been argued here that these structures themselves may usefully be 
considered as instances of closure spaces of various types, and also that this 
approach facilitates unification with classical continuous topology. 

In extending topology to geometry, it seems that by far the most sophisticated 
finite models of the linear and convex structure of Euclidean space are oriented 
matroids. Finite geometry is a broad area of research, but much of it is not 
Euclidean in flavour and no theory seems to match that of oriented matroids in 
establishing combinatorial Euclidean geometry. 

Oriented matroid theory has some serious deficiencies, however. Arguably 
the most serious is that there is no general notion of morphism, and conse- 
quently no category. Even if there were, it is difficult to see how even basic 
constructions such as products could be obtained. Oriented matroids also have 
a quite severe lack of point-extensions, which is a serious impediment to any 
theory of refinement. 

The spherical version of oriented matroids has been presented here because 
it might be more conducive to a development of finite algebraic models of 
Euclidean geometry. The question is how far can the combinatorial method 
in Euclidean spatial representation be pushed? Must it stop at convexity and 
linearity? Must, for example, metrical structure always be expressed using R? 
Given that the research programmes of developing finite Euclidean geometry 
and topology have been largely successful, a logical progression would seem 
to be to try to express this finite structure algebraically. This would parallel 
modern classical geometry in placing algebra (vector spaces) first, topology 
and geometry being derived from the algebra. 

One would hope that an algebraic treatment might lead to a reasonable cate- 
gory of finite spatial models, the present lack of which has been stressed several 
times in this chapter. Any such category would be expected to at least have prod- 
ucts, as surely any true version of Euclidean geometry must ultimately involve 
the construction of higher-dimensional spaces as products of lower-dimensional 
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ones. Classically, this is only possible algebraically. Any purely synthetic ap- 
proach to Euclidean geometry does not, it seems, have this construction. It is 
only taking products of vector spaces that works. The combinatorial analogue 
of this is that matroids do not, apparently, have products. 

A more directly practical motivation is that algebraic spatial representation 
might facilitate better storage and computation of spatial objects. For example, 
it is probably fair to say that Knuth’s oriented matroid approach to computa- 
tional geometry has not yet caught on. Knuth himself regards oriented matroids 
as a useful tool for computing the geometry of finite point sets in R”, and points 
in an oriented matroid are (at least when doing computational geometry) to be 
regarded as points in R”. The ternary clockwise relation is computed using 
standard determinants of matrices computation on such points, and collinear 
triples of points are explicitly ruled out of the axiom system precisely because 
betweenness of points in R” that may have irrational coordinates is not de- 
cidable in finite time. The interesting point here is the idea of a background: 
as in so much of spatial computation, R” is the mathematical background in 
terms of which spatial objects and locations are ultimately considered. Ori- 
ented matroids, though they are alternative finite backgrounds, are in general 
far too large to feasibly be stored. It is far simpler to store a point set as a 
set of coordinates in R” and compute their geometry algebraically (e.g. by 
calculating determinants of matrices), rather than store explicitly a basic set of 
their spatial relations (the oriented matroid). It might be that finite coordinate 
systems for computational geometry, if such structures can exist, would retain 
the undoubted merits of Knuth’s approach while avoiding explicit storage of 
too much information. 

Our specification for a finite algebraic Euclidean geometry is any well- 
understood finite algebra that has a natural oriented matroid structure. There 
is no shortage of these in 1-dimension. Consider any ring Zən: a natural invo- 
lution is x ++ x + n, and it is fairly clear that the oriented matroid of the ring 
considered as a point set on the circle may be derived from its algebra. 

The graph pictured is a natural cyclic order (on a cyclic group). The isomor- 
phisms of this graph are the rotations x +> m + 2, and the anti-isomorphisms 
(i.e. order-reversing) are the reflections x ++ m — x. The rotations together 
with the reflections form of course a (Coxeter) group, namely the Dihedral 
group of order 2n. Zon is then simply the subgoup of rotations. 

The aim now is to extend this to higher dimensions. Recall that convexity 
in Euclidean geometry is derived algebraically from the fact that the field R is 
ordered. Spherical orders have convexity too, one that is locally (i.e. on open 
hemispheres) Euclidean, and this is part of the theory of spherical oriented 
matroids. One might therefore hope that the modules (Zo,,)* have spherical 
convexity, to be derived from the cyclic structure of the ring of scalars Zən in 
a way analogous to the construction of classical Euclidean convexity out of the 
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Figure 12.25. Zg considered as points on a circle (think of a clockface). 


linear order on R. The geometry of these modules (as of finite vector spaces) 
is, however, not spherical but toroidal: the product of two circles is not a sphere 
but a torus. We do not see, moreover, how a spherical geometry may naturally 
be constructed out of a toroidal one. 

One the other hand, the product of two discs (filled-in circles) is spherical. 
(In fact it is a simple result that the product of two Euclidean convex sets is 
again convex.) Might there be some way of regarding Zən as a disc rather than 
a circle and deriving the spherical structure of products accordingly? Consider 
the simplest case, that of Zə considered as an interval (a 1-dimensional disc). 
The product of k intervals is the k-cube, whose vertices certainly sit on the 
surface of the (k + 1)-sphere and therefore have an oriented matroid structure. 
But this structure seems to resist all attempts to understand it! It is astounding 
that one of the most regular, symmetric objects (a platonic solid) in mathematics 
has a natural Euclidean geometry that in higher dimensions can only, it seems, 
be computed by brute force. Aichholzer and Aurenhammer, 1996, for example, 
estimate that the running time for determining the number of subsets of vertices 
of the 9-cube that generate hyperplanes in R? is about 35 years. 

Problem Find a combinatorial description of the geometry of the n-cube. 

Any such description would surely have some bearing on the open: 

Las Vergnas Cube Conjecture The linear matroid on the vertices of the n-cube 
has, up to reorientation, exactly one orientation. 

What is doubly annoying here is that any hypergraph (i.e. a finite set together 
with any collection of its subsets) has a natural Euclidean geometry, a spherical 
oriented matroid structure, as the vertices of the n-cube correspond naturally 
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-1-11 1-11 
-1-1-1 -1-11 


Figure 12.26. The 3-cube, whose spherical oriented matroid circuits are {—1 — 1 — 1,11 — 
1,—111,1 — 11} and {111,—1 — 11,1 — 1 — 1,—11 — 1}. Higher dimensional cubes have 
many more circuits, which seem to resist combinatorial characterization. 





to subsets of a set of cardinality n. Whether or not this geometry is of interest 
or has meaning in any context one cannot really say, since there is no existing 
good description of it. 

To compound the issue, the spherical oriented matroid on the vertices of 
the dual of the n-cube, namely the n-crosspolytope, is utterly trivial: it is the 
oriented matroid in which only the symmetric sets are surrounding. (The dual 
of this oriented matroid is that in which all sets are surrounding.) 


11.1 Symmetry 


One would expect any spherical formulation of space to have a natural sym- 
metry, akin to that it inherits when considered as embedded on the Euclidean 
sphere. If this formulation were algebraic, one might hope that the symmetry 
group and the algebra would be closely related. The problem is, of course, that 
the symmetry group of the Euclidean n-sphere has, in dimensions higher than 
one, essentially very few n-dimensional finite subgroups, certainly not enough 
to re-generate the group. This is an apparent violation of the (generalized) 
Correspondence Principle, and this lack of finite symmetry is perhaps the real 
challenge to the finitist programme of combinatorial spatial representation. 

A way forward, however, might emerge out of a recent approach to matroids 
in A. V. Borovik and White, 2003 that is wholly algebraic and based entirely 
on symmetry. What follows is a brief presentation of this. 


11.2 The Gale order 


For any finite set E of cardinality n and for any 0 < m < n, let Em denote 
the set of all subsets of cardinality m. 

For any linear order < on E and any X, Y € Em, whose respective inherited 
orderings are xı < -+> < £m and yı < -+ < Ym, let X <m Y if, for each i, 
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xi < yi. The order <m is clearly a partial order, and is called the Gale order 
on Em determined by <. 

It is easy to prove that the Gale order is always a lattice ordering. Matroids 
can be defined in terms of the Gale order; the following result is in A. V. Borovik 
and White, 2003. 


THEOREM 12.95 Let B bea non-empty subset of Em. Then B is the collection 
of bases of a matroid if and only if for each linear order on E the partial order 
on B induced by the Gale order has a top element. 


Proof (Only if:) Assume for contradiction that for some linear order there 
exist B # C € B that are both maximal in the induced Gale order. By the 
basis exchange axiom, for any x € B\C there is some y € C\B such that 
B\x Uy € B. Then x > y otherwise B would not be maximal. By a similar 
argument, for any y € C\B there is some x € B\C such that y > x, clearly 
giving a contradiction. 

(If:) Let B,C € Band let x € B\C. If B\C = z then C\B = y and 
(B\x) Uy = C. If not, then B,C are incomparable in the Gale order of the 
linear order £ < +--+ < cy <+: < Cj < bj <--++ < by, where the c;’s are the 
elements of C\ B and the b;’s are the elements of B\x. There is some element 
of B above both, and this can only be {b,..., bg} U some c;. QED 


11.3 The Greedy algorithm 


Condider a matroid together with a linear order on its ground set. The order 
typically is induced by a mapping of the ground set into the reals; for example, 
a cost function. The following is an efficient means of finding the maximum 
basis in the induced Gale order on the set of bases: 


Greedy algorithm 


Input: A matroid with a linear order on its ground set 
f=) 
loop 
Find the largest x ¢ I such that J U x is independent 
I:=IUa 
Stop when there is no x ¢ I such that J U x is independent 
end loop 
Return [ 


THEOREM 12.96 The Greedy algorithm always returns the maximum basis 
in the induced Gale order on the set of bases. 
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Proof Every independent set is contained in a basis, so the algorithm returns 
a basis. Let the return be {21,...,2%m}, where x] > --- > Xm. As all subsets 
of a basis are independent, each x; was chosen on the ith loop. Suppose for 
contradiction that there is a basis y1 > --- > Ym that is not below the returned 
basis in the Gale order. Let ¿i be the least index such that y; > xi. Now 


{yi,--+, yi} and {x1,...,x;-1} are both independent, so there is some y; ¢ 
{v1,..., £i—1} such that {x1, . . . , xi—1, Yj } is independent. But yj > yi > Ti, 
contradicting that x; was chosen on the ith loop. QED 


The Greedy algorithm is greedy because it always chooses the largest element 
available on each loop. Itis efficient because this is never a bad choice. Matroids 
can be defined in terms of the Greedy algorithm; the definition is similar to that 
in terms of the Gale order. 


11.4 The Symmetric groups 


A linear order on a set may be interpreted as a permutation (a.k.a. bijection, 
isomorphism) of its elements, and there is a one-to-one correspondence between 
the set of all linear orders on a finite set (say, of cardinality n) and the set of its 
permutations. To see this, choose any linear order xı < -++ < £n and associate 
this with the identity function. Any other linear order yı < --+ < Yn is then 
associated with the permutation x; +> yi. 

The characterization of a matroid in terms of the Gale order involves quan- 
tification over all linear orders on a set. The set of linear orders, interpreted as 
the set of permutations, forms a group, with the group operation being function 
composition. This is the symmetric group on n elements, Sym, which is the 
isomorphism group of a set of cardinality n. The theory of Coxeter matroids 
develops matroid theory in terms of the symmetric groups alone, the Gale order 
being in this context the Bruhat order, which has a natural algebraic and geo- 
metric meaning. The following is a brief account of the Bruhat order, without 
proofs; a full account may be found in A. V. Borovik and White, 2003, Brown, 
1989, Ronan, 1989. 

Consider Sym, explicitly as the isomorphism group of the set [n] = {1,..., n}. 
The group operation is denoted o. For 1 < i < n, let s; denote the iso- 
morphism that swaps 7,7 + 1 and leaves all other elements fixed. The set 
S = {s1,...,5n—1} then generates Sym. The corresponding Cayley graph 
has as vertices the elements of Sym», and there is an edge between vertices 
u, v if there is some generator s; such that v = uo s;. This graph has no loops, 
and is undirected since v = uo s; > u (= uo s; 0 8;) = v o si. There is at 
most one 2 such that v = u o s;, and so the edge between u, v may be labelled 
unambiguously by 2. Each vertex therefore has degree n — 1. Another name for 
this graph is the n-permutahedron. The graph is highly symmetric, as each left 
group action is a graph isomorphism: for any w, there is an (7-)edge between 
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u,v if and only if there is an (7-)edge between w o u, w ov. An example is 
shown in Fig. 12.27. 

This all has the following geometric interpretation. Sym, is the symmetry 
group of the n-simplex, the elements of [n] being interpreted as the vertices 
of the n-simplex. The elements of Sym,, may themselves be interpreted as 
simplices on the surface of the n-simplex, namely the simplices obtained by 
taking the barycentric subdivision of each facet of the n-simplex. Each such 
simplex is an (n — 1)-simplex, and defining any two such simplices to be 
adjacent if they have a common (n — 2)-face gives the n-permutahedron. 


a, 


132 213 


Figure 12.27. The Cayley graph of Syms, obtained by associating the linear order 123 with 
the identity. 


11.4.1 The Bruhat order. A reflection in Sym, is any conjugate of 
any element of the generating set, that is, any element of the form wo sow}, 
where w € Sym, and s € S. Concretely, the reflections are the isomorphisms 
that swap i, j for some i Æ j and leave all other elements fixed. Geometrically, 
the reflections are the reflections of the n-simplex. 

The wall of a reflection r is the set of all edges (u,v) in the Cayley graph 
such that ro u = v (equivalently r o v = u, since ro r = id). Each edge is 
contained in exactly one wall. Each wall cuts the Cayley graph into exactly 
two components, called the hemispheres (or roots) of the reflection. Concretely, 
where r is the reflection that swaps 7, 7, one of its hemispheres is the set of all 
linear orders in which 7 < j and the other is the set of all linear orders in 
which j < i. Each reflection maps one hemisphere bijectively onto the other. 
Geometrically, where elements of Sym, are considered as simplices in the 
barycentric subdivision of the n-simplex, a half-space of a reflection is the set 
of all simplices on the same side of the reflecting hyperplane. An example is 
shown in Fig. 12.28. 
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Figure 12.28. The triangle, together with planes of reflection and induced barycentric subdivi- 
sion of facets. 


A folding is “half” a reflection: one half-space is fixed while the other is 
reflected onto it. Formally, where r is a reflection with half spaces H~, HT, 
the corresponding folding of Ht onto H~ isthe map Symp —> Symn,x > wif 
x € H`, x> roxifx € H”. Eachreflection has precisely two corresponding 
foldings. Every folding is a graph morphism. 


DEFINITION 12.97 For any element w, the Bruhat order on Sym, with bot- 
tom element w is defined as: u <w v if there exist foldings f,,..., fk, each 
towards a half-space that contains w, such that u = fi o --- o f(v). 


The Bruhat order is clearly a partial order. For any K C {1,...,n—1},a 
K-residue is any component of the graph obtained from the Cayley graph by 
removing all edges not labelled by some 7 € K. Equivalently, a -residue is, 
in Symp, any left coset of the subgroup generated by the set {s; | i € K}. 
It happens that, for any w, any residue has both a top and bottom element in 
its inherited Bruhat order. Moreover, for any K-residues R, S and any Bruhat 
order <,,, the following are equivalent: the bottom element of R <,, the bottom 
element of S; the top element of R <,, the top element of S; some element 
of R <, some element of S. Setting R <w S if these properties hold gives a 
partial order, called the Bruhat order for w on the set of -residues. 


DEFINITION 12.98 A Coxeter matroid for a symmetric group is a collection 
B of K-residues such that, for any w E€ Symp, B has a top element in its 
partial order induced by the Bruhat order <. 


This is in fact a generalization of the definition of a matroid. Matroids arise 
from the special case that K = {1,...,n — 1}\i. In this case, there is a 
bijection between the K-residues and the subsets of {1,...,7} of cardinality 
i. The Bruhat order and the Gale order then coincide. 
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In fact, Coxeter matroids are more general still. The above definitions, of 
Cayley graph, reflection, wall, half-space, folding, Bruhat order, etc., work for 
any Coxeter group. 

The following exercise might help with some of the above: 


In the Boolean algebra over n elements, say that two maximal flags (a.k.a. 
maximal chains) are adjacent if they differ at exactly one element. Show that 
this gives the n-permutahedron. Say, moreover, that two adjacent maximal 
flags are ¿-adjacent if the element at which they differ has height 7. Show that 
this is 7-adjacency in the permutahedron. 
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1. From the relational database model to the constraint 
database model 


The constraint database model can be seen as a generalization of the classical 
relational database model that was introduced by Codd in the 1970s to deal with 
the management of alpha-numerical data, typically in business applications 
(Codd, 1970). A relational database can be viewed as a finite collection of 
tables or relations that each contain a finite number of tuples. 

Fig. 13.1 shows an instance of a relational database that contains the two 
relations Beer and Pub. This database contains tourist information about beers 
and the pubs where they are served. It also contains the location of the pubs, 
given in (x, y)-coordinates on some tourist map. Each relation contains a finite 
number of tuples. A relational database is usually modeled following a database 
schema. Aschema contains information on the relation names and on the names 
of the attributes appearing in relation. In this example, the attributes of Beer 
are Name, Pub, City and Postal code. The complete schema of the relational 
database of Fig. 13.1 could be written as Beer(Name, Pub, City, Postal code), 
Pub(Pub, x, y). 
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Beer 
Name Pub City Postal code 
Duvel De Muze Antwerpen 2000 
Hoegaarden Villicus Hasselt 3500 
Geuze La Bécasse Brussel 1000 

Pub 
Pub x y 
De Muze 16 10 
Villicus 16.1 14 
La Bécasse 10.4 12.3 











Figure 13.1. An example of a relational database consisting of the two relations Beer and Pub. 


The x and y attributes of the relation Pub have a geometric or geographic 
interpretation. But values of these attributes can simply be stored as numbers, 
as is usually done in business databases. A tourist could consult this database 
to find out the locations of pubs where his/her preferred beers are served. First- 
order logic based languages (and their commercial versions, such as SQL) 
are used in the relational database model, to formulate queries like this. The 
vocabulary of these logics typically contains the relation names appearing in 
the schema of the input database. For instance, the first-order formula 

















p(z, y) = Ip3c3p' (Beer (Westvleteren, p, c, p') A Pub(p, x, y)) 





when interpreted over the database of Fig. 13.1, defines the (x, y)-coordinates 
of the location of the pubs where they serve my favorite beer. 

But a tourist is usually also given more explicit geographic information, e.g., 
in the form of maps such as the one depicted in Fig. 13.2 and he/she typically 
wants to ask questions that combine spatial and alpha-numeric information, 
such as “Where in Flanders, not too far from the river Scheldt, can I drink a 
Duvel?” 

In the relational database model, it is difficult to support queries like this one. 
Unlike the locations of pubs, the locations of rivers or regions would require 
the storage of infinitely many x- and y-coordinates of points. Storing infinitely 
many tuples is not possible and in computer science it is customary to find finite 
representations of even infinite sets or objects. 

In the 1980s, extensions of the relational model were proposed with special- 
purpose data types and operators. Data types like “polyline” and “polygon” 
were introduced to support, e.g, the storage of rivers and regions. Ad-hoc 
operations like intersection of polygons were added to popular query languages 
such as SQL. Since then, spatial database theory and technology has developed 
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Figure 13.2. Spatial information map of Belgium. 


towards more sophisticated data models and more elegant query formalisms 
supported by, for example, appropriate indexing techniques. For an overview of 
the developments in spatial databases in the last two decades, we refer to Rigaux 
et al., 2000. 

Looking again at the polylines and polygons in Fig. 13.2, we may remark 
that there are other finite ways to store them, besides the indirect method of 
storing their corner points. Indeed, each line segment can be described by 
linear equations (equalities and inequalities). Moreover, polygonal figures can 
be described by combinations of linear inequalities. This description is more 
explicit than listing the corner points. If we agree that the combinations of 
linear equations may appear in the tuples of the relations of a database under 
a geometric attribute name, the spatial information displayed on the map of 
Belgium, which could be categorized into region, city, and river information, 
could be captured in a database with the three relations Regions, Cities, Rivers. 
Each of these relations has Name and Geometry as attributes, where the latter 
can be viewed as having an x-and a y-component. Name is a traditional alpha- 
numeric attribute and Geometry has a spatial or geometric interpretation. Of 
course we could include more thematic information, e.g., we could add to the 
City relation the number of inhabitants. 

The database instance with this schema, corresponding, to the map shown in 
Fig. 13.2, is given in Fig. 13.3. 
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Cities 
Name Geometry(x, y) 
Antwerp | (x = 10) A (y = 16) 
Bastogne | (x = 19) A (y = 6) 
Bruges (a = 5) A (y = 16) 
Brussels | (x = 10.5) A (y = 12.5) 
Charleroi | (x = 10) A (y = 8) 
Hasselt (x = 16) A (y = 14) 
Liège ( = 17) A (y = 11) 
Rivers 
Name Geometry(x, y) 
Meuse | ((y < 17) A (5z — y < 78) A (y > 12)) V 
(y < 12) Aley =6)A (y > 11))V 
WSE- w=- \(y29)V 
((y < 9) A(z =13)A (y > 8) 
Scheldt | ((y < 17) A (a+ y = 26) A (y > 16)) V 
ty < 16) A (2a — y = 4) A (y > 14)) V 
((x <9) A(x > 7) A (y =14)) v 
o 14) A (—32 + 2y = 7) A (y > 11)) v 
(y < 11) A (2z + y = 21) A (y > 9)) 
Regions 
Name Geometry(x, y) 
Brussels (y < 13) A (x < 11) A (y È> 12) A^ (x > 10) 
Flanders (y < 17) A (5a — y < 78) A (x — 14y < —150)^ 
(£ +y > 45) A (3a — 4y > —53) A (=((y < 13)^ 
(x < 11) A (y > 12) A (z > 10))) 
Walloon Region | ((x — 14y > —150) A (y < 12) A (19% + Ty < 375)A 
(x — 2y < 15) A (5a + 4y > 89) A (æ > 13)) v 
((-a + 3y > 5) A (£ +y > 45)^ 














Figure 13.3. Representation of the spatial database of Belgium shown in Fig. 13.2. 
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The geometric components of the relations in Fig. 13.3 are described using 
linear equalities, linear inequalities and Boolean combinations thereof, i.e., 
using ^ (conjunction), V (disjunction) and ~ (negation). Figures that can be 
described in this way are sometimes referred to as semi-linear set figures. 

One of the most important application areas of spatial databases is Geo- 
graphic Information Systems (GIS), where in most cases polygonal-shaped 
geometric figures are considered. In most cases this data resides in the two- 
dimensional plane or in the three-dimensional space (Rigaux et al., 2000). In- 
deed, in GIS, information is mostly linear in nature, but in other applications, 
like CAD-CAM, or medical imaging we can find spatial figures that are not 
linear. Using polynomial equalities and inequalities rather than just linear ones 
gives us wider modeling capabilities. Fig. 13.4 gives an example of a figure 
in the plane that can be described by the following combination of polynomial 
(in)equalities: 


(a? /25 + y?/16 < 1) A (a? 4 4r + y* — 2y > —4) 
A^ (x? — 4x + y? — 2y > —4) A ((a? +y? — 2y £8) V (y > -1)). 


This figure is described by a formula containing two variables, namely x and 
y, representing the coordinates of points in R?. 


Figure 13.4. An example of a semi-algebraic set in R?. 


Figures that can be modeled by polynomial inequalities are known, in math- 
ematics, as semi-algebraic sets and their geometric and topological properties 
are well-studied in real algebraic geometry (Bochnak et al., 1998). 

Semi-algebraic sets are, together with classical alpha-numeric data, the basic 
ingredients in constraint databases. As we have seen above in Fig. 13.2, these 
sets appear in a constraint database by means of a defining formula. In this 
sense, the constraint database model is a generalization of the relational database 
model. 

Like the classical relational database model, first-order logic can be used to 
formulate queries in the constraint model. Semi-algebraic sets are described 
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by Boolean combinations of (linear) polynomial inequalities, which are basi- 
cally quantifier-free formulas in first-order logic over the reals. This logic has 
addition and multiplication as functions, order as relation and zero and one 
as constants. In the constraint database model, an extension of this logic with 
predicates to address the relations in the input database is used as a basic logical 
query language. This logic turns out to be a language in which a lot of relevant 
spatial database queries can be formulated. For example, the query “Where 
in Flanders, not too far from the river Scheldt, can I drink a Duvel?” can be 
expressed by the formula 


(x,y) = Regions(Flanders, x, y) ^ 
y' (Rivers(Scheldt, x’, y’) A (a — 2’)? + (y—y')? < 1) A 
jpacip' (Pubs(p, x, y) A Beer(Duvel, p, c, p’)). 





~ 
































Here, we translate “not to far from the river Scheldt” by “at most distance 1 from 
the some point of the Scheldt”. We remark that some variables in this expression 
are assumed to range over finite domains (namely p, c, p’), but others range over 
the real numbers (namely x, y, x’ and y’). Nevertheless, it turns out that queries 
expressed by first-order formulas like this one can be effectively evaluated on 
constraint databases. In our example the output is a two-dimensional geometric 
object and the query evaluation algorithm guarantees that it can also be described 
by a Boolean combination of polynomial inequalities. 

The ideas presented above are at the basis of the constraint database model. 
The basic idea is to extend or generalize the relational model and not only to 
allow finite relations, but also finitely representable relations. 

We remark that the constraint database model was introduced by Kanellakis 
et al., 1995. It has received a lot of research attention since. An overview of 
research results in this field can be found in Kuper et al., 2000, and Revesz has 
written a textbook on the subject (Revesz, 2002). 


Overview. This chapter is organized as follows. In Sec. 2, we describe 
the constraint database model with its data models and basic query languages. 
Sec. 3 gives an overview of some definitions and results in real algebraic ge- 
ometry that will be used further on. In Sec. 4, we discuss query evaluation in 
the constraint database model through quantifier elimination. We also outline 
some quantifier elimination algorithms there. Sec. 5 is devoted to the expressive 
power of first-order logic over the reals as a query language for constraint data- 
bases. Topological queries get special attention. Finally, in Sec. 6, we discuss 
some more powerful query languages for constraint databases that are exten- 
sions of first-order logic, with transitive closure operators, with while-loop and 
with topological operators. 
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2. Constraint data models and query languages 


In this section, we define the logics FO(+, x, <, 0, 1), i.e., first-order logic 
with polynomial constraints, and FO(+, <, 0,1), i.e., first-order logic with lin- 
ear constraints, and show how they form the basis of the constraint approach in 
both the modeling and querying of spatial data. More specifically, we introduce 
the polynomial and linear constraint model and extend FO(+, x, <,0,1) and 
FO(+, <,0, 1) to query languages for the respective models. 


2.1 The logics FO(+, x, <, 0,1) and FO(+, <, 0, 1) 


Let (+, x, <, 0, 1) be a so-called vocabulary with two functions symbols of 
arity two (+ and x), one predicate symbol of arity two (<), and two constant 
symbols (0 and 1). In the constraint model, this vocabulary will be interpreted 
on the real field, i.e., the structure consisting of the set of real numbers, R, 
equipped with the standard addition, multiplication, and order. 

We define FO(+, x, <,0,1) as the first-order logic over the vocabulary 
(+, x,<,0,1). We build formulas in FO(+, x,<,0,1) in the standard way: 
a term t in FO(+, x, <,0,1) is either a variable x;; a constant (0 or 1); or of 
the form t + t or t x t' for terms t and t’. In other words, terms are poly- 
nomials with integer coefficients. Next, atomic formulas in FO(+, x, <,0,1) 
are formulas of the form t = t' or t < t for terms t and ¢’. Finally, formu- 
las in FO(+, x, <,0,1) are built from atomic formulas by using the Boolean 
connectives (^, V, or —) and quantifiers (Vx; or 4a;). A variable is called free 
in a formula if it is not bounded by a quantifier. We denote by y(x1,..., £n) 
the fact that the FO(+, x,<,0,1) formula ọ has n free variables 71,..., £n- 
A formula without any free variables is called a sentence. A formula without 
quantifiers is called quantifier-free. 

Similarly, we define FO(+, <, 0,1) as the restriction of FO(+, x, <, 0,1) 
in which formulas are constructed from terms which do not use multiplication 
(i.e., formulas without x). In other words, the terms in FO(+,<,0,1) are 
polynomials with integer coefficients of degree at most one. We also say that 
FO(+, <, 0,1) is the first-order logic over the vocabulary (+, <,0, 1). 

We define the satisfaction of a formula y(x1,..., £n) in FO(+, x, <, 0,1) 
by real numbers 71,...,7, € R, denoted by 





(R, +, x,<,0,1) = y(ri,..-,7n), 
inductively on the structure of vy: 

a (R,+,x,<,0,1) H (t = Fries tl tase he) Ht Chess 
eee 
n o KOD HEEL Ei Ge abd (aaa, PI) < aes 

Tn); 
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7 (R,+, x, <,0, 1) = (ay) (T1,---5 Tn) if (R,+, x, <,0,1) = y(ri, 
.., Tn) does not hold; 


m (R,+, x,<,0,1) EF (PAW)(T1,..., rn) if (R,+, x, <,0,1) E y(n, 
~a3 tp) and (R, +, x, <, 0,1) = ori, <2 Trn); 

m (R,+,x,<,0,1) H= (pV w)(11,...,7n) if (R,+, x, <,0,1) = (r1, 
saaTi) or (R, +, ,<,0,1) Erie Tn): 

+, Xx, <,0,1) H (Vreno) (ri, ...,Tn—-1) if for all elements r € R, 
+, X, <,0,1) H| (r1, ..-,f'n-1,r); and 















































( 
( 
( 








+, Xx, <,0,1) H| (rny)(1,..-,7n—-1) if there exists an element 
r € R, (R, +, x,<,0,1) H (r1, oss Pats Th) 





As described above, in the constraint model, the satisfaction of formulas in 
FO(+, x,<,0,1) and FO(+, <, 0, 1) is defined with respect to the real field 
R. However, any mathematical structure which interprets the vocabularies 
(+, x, <,0,1) or (+, <, 0,1) can be used instead. 

Of particular importance in the constraint model are the quantifier-free for- 
mulas in FO(+, x, <,0, 1) and FO(+, <, 0, 1). As we will see in the next sec- 
tion, the representation of spatial objects by means of quantifier-free formulas is 
the basis of the data model in constraint databases. In Sec. 4, we show that both 
FO(+4+, x, <,0,1) and FO(+, <,0,1) admit quantifier elimination. In short, 
this means that any formula in FO(+, x, <, 0,1) (respectively FO(+, <,0,1)) 
is equivalent to a quantifier-free formula in FO(+, x, <,0, 1) over R (respec- 
tively in FO(+, <, 0, 1)). Hence, we do not loose any generality by considering 
quantifier-free formulas only. As mentioned in the introduction, a (quantifier- 
free) formula represents a possibly infinite set of points. More specifically, 
they describe sets of points which correspond to semi-algebraic sets, in case 
of FO(+, x, <,0, 1), and semi-linear sets, in case of FO(+, <, 0, 1)(see also 
Sec. 3). 


EXAMPLE 13.1 InFig. 13.4 of Sec. 1, the smiling face shows all pairs (r1, r2) € 
R? that satisfy y(x, y), i.e., (R, +, x,<,0,1) H y(ri,r2), where y(x, y) is 
the quantifier-free formula 





a? /25 + y*/16 < 1 A2? +4r +y? — 2y > —4A 
r? — 4r +y? — 2y > —4A (£? +y? — 2y £8Vy> 1). 





We remark that y has two free variables and that it uses polynomials of degree 
at most two. 


Apart from the modeling of spatial data, the logics FO(+, x, <,0,1) and 
FO(+, <,0,1) serve also as the basis of the standard query languages in the 
constraint model. We come back to this point in the next sections. 
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2.2 The polynomial constraint data model 


First, we discuss the general polynomial constraint model which is based on 
FO(+, x,<,0,1). In the next section, we elaborate on the linear constraint 
model which uses FO(+, <, 0,1). 


The polynomial constraint data model. A database schema S is a finite 
set {S1,...,.5),} of relation names. Each relation name S; (i = 1,..., k) is of 
some arity n;, which is an integer. A polynomial constraint relation instance 
of S;, or constraint relation of S; for short, maps S; to a quantifier-free for- 
mula ys,(%1,..-,2n,) with n; free variables in the logic FO(+, x, <,0, 1). A 
(polynomial) constraint database instance over S consists of a set of constraint 
relations of S1,..., Sx. 

The semantics of a relation instance of S;, denoted by [(.S;), is the possibly 
infinite (semi-algebraic) subset 


Cee Tni) Ee R™ | (R, +, x, <,0, 1) = ps:(r1,. . safni): 


The semantics of a (polynomial) constraint database instance D, denoted by 
I(D), over the schema S, is the collection of semi-algebraic sets [(.S;), with 
S; a relation name appearing in S. 


EXAMPLE 13.2 Let S = {S}, where S is a binary relation name. A constraint 
database instance D over S maps, for instance, S to the quantifier-free y(x, y) 
given in Example 13.1. The semantics of D is the smiling face shown in 
Fig. 13.4. 


It is clear that the same semi-algebraic set can be represented by different 
formulas. Indeed, consider again Fig. 13.4. Suppose that the description of 
the smiling face given in Example 13.1 is extended with the disjunct (x = 
0 A —1/2 < y < 1/2) (i.e., a vertical line segment), representing a nose. This 
new representation will not lead to the addition of new points in the smiling 
face, since all the points in the nose are already part of the face. 

Two constraint relations of S and S’ (i.e., formulas) are said to be equivalent 
if I(S) and I(S’) are the same semi-algebraic set (i.e., if I(S) = I(S")). 
Similarly, we say that two database instances are equivalent if their relations 
are pairwise equivalent. 


REMARK 13.3 In the remainder of this chapter, we use the terms constraint 
relation and semi-algebraic set interchangeably, since these notions refer to the 
same objects, albeit from different perspectives. 


Database queries in the constraint model. Before we explain how to use 
FO(+, x, <, 0, 1) as a query language for the polynomial constraint model, we 
define what we mean by a query on a constraint database. In standard relational 
databases, a query is a (partial) function associating with each input database 
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instance an output relation instance. In the constraint setting, however, there 
are two ways of looking at a query. 


= First, as in the relational setting, we can define a k-ary query over a 
database schema S as a partial function which associates with a database 
instance I(D) (i.e., a collection of semi-algebraic sets), a semi-algebraic 
set in R*, where D is any database instance of S. 


= Second, we can also view a k-ary query over S as a partial function 
associating with each database instance D (i.e., a collection of quantifier- 
free formulas), a quantifier-free formula in FO(+, x, <,0, 1) with k free 
variables. 


We call the first type of query an unrestricted query; the second is called a 
constraint query. A constraint query clearly only makes sense if it maps two 
equivalent database instances to equivalent relation instances (i.e., equivalent 
quantifier-free FO(+, x, <,0,1)-formulas). If a constraint query satisfies this 
property, we call a constraint query consistent. We remark that a consistent 
constraint query corresponds to a unique unrestricted query. 


EXAMPLE 13.4 Let S consist of binary relation S. Consider the constraint 
query Q which maps any constraint relation of S, given by ys, to the highest 
degree of polynomials appearing in ys. This query is clearly not consistent. 
Indeed, let pg = £? + y? = 1 and yy = (x? + y?)? = 1. Both formulas 
correspond to the same semi-algebraic set, i.e., the standard circle of radius 1. 
In contrast, Q returns 2 on input yg, whereas it returns 4 on input vy’. 

In the following, when we refer to a constraint database query, we mean a 
consistent constraint query. 


The logic FO(+, x, <,0, 1) as a query language for polynomial constraint 
databases. In this section, we take a closer look at the standard query 
language for polynomial constraint databases, which is an extension of 
FO(+,<,0,1) with predicates to address constraint relations that appear in 
the input database. 

If we consider queries over a database input schema S = {5j,..., Sk} 
then we can associate a query with a formula in the first-order logic over the 
vocabulary (+, x, <,0, 1, S1,..., Sk). Let (£1, ..., £m) be such a formula 


over(+, x, <,0,1,.51,...,5,). Given a constraint database D over S, we 
interpret y(x1,..., £m) over the (R, +, x, <,0,1), extended with the semi- 
algebraic sets, I(S1),..., I (Sp) as given by D. More specifically, the m-ary 
answer set of y(x1,..., £m) is defined as 


{(r1,..-,7'm) € R™ | (R, +, X, <, 0,1, 1(51),...,1(Sx)) K v(ri,..-,17'm)}- 


We also write the above for short as 


{(r1,--.,rm) E R” | R, D) E (r1, ...,rm)}- 
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It is clear that equivalent databases result in the same answer set. We say that 
py expresses the corresponding (unique) unrestricted query. In the sequel, we 
refer to these extensions of FO(+, x, <,0,1) simply by FO(+, x, <,0,1) if 
the input schema is clear from the context or irrelevant. 

An important property of any query language is that it is closed, i.e., the 
result of query should admit a representation in the same data model as the 
source relations. In particular, for FO(+, x,<,0,1) to be closed it should be 
the case that the result is a quantifier-free formula in FO(+, x, <,0, 1) again. 
However, since FO(+, x, <,0, 1) admits quantifier-elimination, and given the 
way FO(+, x, <, 0, 1)-formulas are evaluated, this requirement is satisfied (see 
also Sec. 4). 

In Sec. 1, we gave examples of queries expressed in FO(+, x, <,0,1). We 
give some more examples here. 


EXAMPLE 13.5 Let Qboundea be the unrestricted query which returns true if 
and only if the input semi-algebraic set in R? is bounded. In the first-order logic 
over (+, x, <,0, 1, S), where S is a binary relation name, the sentence 


Je(e £ OAVaVy(S(x,y) > 1? +y? < €”)) 
expresses Qbounded: 


EXAMPLE 13.6 Let Qinterior be the query that returns all points of any input 
semi-algebraic set in R? that have a neighborhood that is completely in the 
semi-algebraic set. Hence, Qinterior returns the topological interior of a semi- 
algebraic set in R*. This query can be expressed as 


ArVe'Vy'(r £0) A ((£ =)? + (y—y/)? <r? > S(x, y')). 


We remark that this formula has two free variables, so it defines a semi-algebraic 
set in R?. 








In the next section, we discuss the expressive power of the query language 
FO(+, x, <,0,1). For the moment, let us merely say that it is “rather limited.” 
Topological queries such as the topological interior are expressible in this 
logic, but we will see in Sec. 5 that important queries are not expressible in 
FO(+, x, <,0,1). More specifically, we will see that the query that expresses 
that a spatial database is topologically connected is not expressible. Due to the 
importance of this query in the spatial database practice, many efforts to extend 
FO(+, x, <,0,1) to richer query languages exist, some of which we discuss 
in Sec. 6. 


2.3 The linear constraint model: an application in 
Geographic Information Systems 


Next, we discuss the linear constraint model, which is less expressive than the 
polynomial constraint model (as we will illustrate in Sec. 5), but nevertheless 
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powerful enough to model applications like Geographic Information Systems, 
or GIS for short. 


The linear constraint data model. Since we emphasize the GIS aspect 
of the linear model here, we will also combine linear spatial information with 
classical alpha-numeric information, as is customary in the GIS practice. There- 
fore, for the sake of illustrating the suitability for GIS, we consider more general 
database schemas and instances in this section. 

Also, the linear constraint database model can be seen as based on the re- 
lational model. Moreover, linear constraint databases also require a lot of 
traditional database capabilities. In particular, if the linear constraint database 
consists purely of non-spatial flat relations, it degenerates into a traditional 
database for which the relational model offers a well-accepted representation. 

More formally, a linear constraint database scheme S consists of a finite set 
of relation names S4, ..., Sp. Each relation name S; (i = 1,..., k) is of some 
type [n;, m,], with n; and m; integers. A linear constraint database instance is 
a mapping that assigns a linear relation instance to each relation name appearing 
in the database scheme. A linear relation instance of S;, also called a linear 
relation for short, is a finite set of linear tuples of type [n;,m,]. A linear tuple 
of type [n;, m;] is straightforwardly defined as a tuple of the form 


(c, . isa OlT d jt) 


where c1,..., Cn, are thematic values, typically from some alpha-numeric do- 
main U (for instance, U could be the set of all strings over our alphabet and 
natural numbers) and y(21,...,2%m,) is a quantifier-free formula in the logic 
FO(+, <, 0,1) with m; free variables. 

The semantics of a linear tuple t = (c1,...,Cn,, P(1,---,2m,)) of type 
[ni, mi] is the possibly infinite subset of U" x R” defined as the Cartesian 
product {(c1,...,Cn;)} x Ai, in which A; C R” is the semi-linear set 





{(r1, EE Tm) € R™ | (R, +, <,0, 1) T (rı, aia Tmi) ys 


This subset of U™ x R" can be interpreted as a possibly infinite (n; + m;)-ary 
relation, denoted I(t). The semantics of a linear relation, S;, denoted [(.S;), 
is defined as [(S;) = Uses, I(t). Finally, the semantics of a linear spatial 
database, D over the schema S, is the set of relations [(S;) with S; a linear 
relation name appearing in the schema S = {S},...,5;,} of D. 

For GIS, where spatial information is often modeled in either the vector 
model or the raster model, and combined with traditional alpha-numeric in- 
formation often stored in a relational database, the linear constraint model is 
powerful enough. Indeed, in the vector model, three types of planar spatial ob- 
jects are standardly used, namely points, polylines and polygons. In the raster 
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model, the plane R? is divided by a regular grid. Clearly, if we assume the grid 
to be finite, both types of data can be modeled in the linear constraint model. 


EXAMPLE 13.7 In Sec. 1, we introduced the example of a map containing 
infomation about Belgium, as illustrated in Fig. 13.2. The spatial information 
displayed on the map of Belgium can be categorized into city, river, and region 
information. Therefore, we introduced three relations, each containing one of 
these spatial information sources (Fig. 13.3). The relations Cities, Rivers 
and Regions are of type [1,2] and model respectively points, polylines and 
polygons. Their thematic component contains names (or string information), 
whereas their spatial component contains formulas describing spatial features 
of Belgium. This example illustrated that the linear constraint model is suitable 
for GIS. 


The logic FO(+, <, 0,1) as a query language for Geographic Information 
Systems. In this section, we take a closer look at the standard query language 
for linear constraint databases which is an extension of FO(+,<,0,1) with 
predicates to address linear constraint relations that appear in the input database. 
Because of the mixed presence of thematic and spatial information, this query 
language will be an extension of FO(+,<,0, 1) in the sense of a two-sorted 
logic. More specifically, if we consider queries over a database input schema 
S = {S1,... , Sk}, we have, apart from the terms, formulas and quantifications 
possible in FO(+, <, 0, 1), the following ingredients: 


= apart from (real) variables 71, x2,... ranging over R, we also have in- 
finitely many thematic variables vı, v2,... ranging over U and distinct 
from the set of real variables; 


= we have atomic formulas of the form vı = v2, with vı and v2 thematic 


variables; 

= we have atomic formulas of the form Sj(v;,,..-, Vin, tja» +- -s tim, )» With 
S; a relation name of type [n;, mi], Vi,,---, Vj, are thematic variables, 
and tj,,...,tj,_, are terms in FO(+, <, 0, 1); and 


a universal and existential quantification of thematic variables. 


In the following, we will refer to this extension of FO(+,<,0,1), sim- 
ply as FO(+,<,0,1). Similar to the case of FO(+, x,<,0,1), a formula 
pvi, ..., Un, Z1,- --, £m) in FO(+4+, <,0,1) expresses a constraint query of 
type [n, m]. 

Finally, we shall give some typical example queries, illustrating the expres- 
sive power of FO(+, <,0, 1). 


EXAMPLE 13.8 An example of a (very simple) linear spatial query on the 
database in Example 13.3 is “Find all cities that lie on a river and give their 
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names and the names of the rivers they lie on.” This query can be expressed 
by the following first-order formula: 














plc, r) = drdy(Cities(c, x, y) A Rivers(r, x, y)). 





This formula defines an output relation of type [2, 0]. 


In all the remaining queries, we shall assume the input database consists of 
one relation S of type [0, 2]. 


EXAMPLE 13.9 The following FO(+, <,0,1)-sentence expresses Qpounded 
(see Example 13.5): 





VaVy(S(@,y) > -d<aAur<dA-d<yAy<ad). 


EXAMPLE 13.10 Several topological properties of a semi-linear set can be ex- 
pressed in FO(+, <,0, 1). For instance, the query Qinterior (see Example 13.6) 
is expressed by the FO(+, <,0,1) formula 





p(z, y) = Jevr Vy' (e £0) A ((la- x |<eA|y—y' |< £) > S(x,y’). 
The formula y(x, y) represents a semi-linear set in R?. 


In spite of all this, FO(+, <, 0,1) cannot be considered as a fully adequate 
query language for practical purposes. More specifically, there are very simple 
queries which are not expressible in FO(+, <, 0,1), which are expressible in 
FO(+, x, <,0,1). We return to this issue in Sec. 5. 


3. Introduction to real algebraic geometry 


In this section, we define and discuss semi-algebraic and semi-linear sets 
and review some well-known properties of these sets. We are interested in sets 
which are situated in the n-dimensional Euclidean space R”. 

An excellent introduction to real-algebraic geometry can be found in (Coste, 
2000b). Proofs of all the theorems given in this section can be found there. 
More advanced is the standard book in the field (Bochnak et al., 1987) and fora 
more algorithmic point of view we refer to (Basu et al., 2003a). An interesting 
book covering many other aspects of real algebraic geometry is (Benedetti 
and Risler, 1990). On a very advanced level, investigations of real-algebraic 
geometry in terms of constructible sets, real spectra, and spaces of orderings 
can be found in (Andradas et al., 1996). 

Finally, the generalization of real-algebraic geometry to so-called o-minimal 
geometry is described in (Coste, 2000a). An excellent book on o-minimal 
structures is (van den Dries, 1998). Interestingly, many results from constraint 
databases described in this chapter can be generalized to the o-minimal setting. 
We refer to the standard book on constraint databases for more details (Kuper 
et al., 2000). 
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3.1 Semi-algebraic sets and their basic properties 


Definition of semi-algebraic sets. A semi-algebraic subset of R” is a subset 
of points 7 = (x1,..., Zn) in R” satisfying a Boolean combination (expressed 
by disjunction, conjunction and negation—or in set-theoretic terms by union, 
intersection, and complement) of polynomial equations and inequalities with 
integer coefficients. It is easy to see that every semi-algebraic set in R” is the 
finite union of sets of the form 


{Z ER” | f(x) =0,  gi(#) > 0, g2(@),.-., ge(#) > OF, 


where f,91,-.--,g¢ are multivariate polynomials in the variables x1,...,%n, 
with integer coefficients. A semi-linear subset of R” is a semi-algebraic subset 
which is described by multivariate polynomials of degree at most one (i.e., 
linear multivariate polynomials). 


REMARK 13.11 It is easy to see that the class of semi-algebraic sets defined 
above coincides with the class of sets represented by quantifier-free formulas 
in FO(+, x,<,0,1). We therefore are free to choose either of the two rep- 
resentations. We more often use the representation in terms of quantifier-free 
formulas. 


EXAMPLE 13.12 In the introductory section we have already given an exam- 
ple of semi-algebraic sets (see, e.g., Fig. 13.4 ). Semi-algebraic sets can be 
used to model various spatial situations, but also spatio-temporal phenomena, 
as is illustrated in Fig. 13.5. Here a potential scene from Star Trek is depicted 
in which the starship Enterprise fires a photon torpedo. This scene plays in the 
three-dimensional (x, y, t) space, where x and y are spatial coordinates and t 
represents a time coordinate. The star ship remains at a constant position in 
space and can therefore be described by some fixed formula 


PEnterprise( £, Y, t) = (2? F y? = 1) V (x? a y? = (1/4)?) VS -) 


in which t does not appear. A fired photon torpedo follows the dotted line 
(between the moments t = Qand t = 1) an then explodes (depicted as increasing 
dotted circles, between t = 1 and ¢ = 2). At the bottom of Fig. 13.5 three 
frames of the movie are shown: at t = 1/2,1 and 2. The complete movie can 
be described by the set 


{(x,y,t) € R° x R | (YEnterprise(, y) A (0 < 
(y=OAr=4t)A(0<t 
es 
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Figure 13.5. USS Enterprise firing a photon torpedo at a (cloaked) Klingon vessel. 


Basic properties of semi-algebraic sets. The class of semi-algebraic sets 
is closed under finite unions, intersections and complements. Moreover, if 
A C R” and B C R” are semi-algebraic, then the cartesian product A x B 
is a semi-algebraic subset of R’*". A much deeper result is that the class of 
semi-algebraic sets is closed under projection as well: 


THEOREM 13.13 (TARSKI-SEIDENBERG) Let A bea semi-algebraic subset 
of R°} and let n : R"! — R” be the projection on the first n coordinates. 
Then 7(A) is a semi-algebraic set of R”. 


One may wonder whether all sets of R” are semi-algebraic. Already for 
n = 1, it can be shown that there are subsets that are not semi-algebraic. In 
fact, every semi-algebraic subset of R is known to be a finite union of open 
intervals (possibly unbounded) and points. Fig. 13.6 gives an example of a 
one-dimensional semi-algebraic set. It is the union of four open intervals (the 
leftmost being unbounded) and five points (three of which are isolated). 





Figure 13.6. An example of a semi-algebraic set in R. 


From this property it follows that the set of natural numbers N is not a semi- 
algebraic subset of R. Similarly, it is easily verified that the zig-zag line in R? 
shown in Fig. 13.7 is not semi-algebraic. 
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Figure 13.7. An example of a subset of R? that is not semi-algebraic. 


Let A be a semi-algebraic set of R” and let f : A — R be a real-valued 
function. Then f is called a semi-algebraic function if its graph 


I(f) = {(@,r) E Ax R| #€ Aandr = f(z)} 
is a semi-algebraic set of R"*+!. 


Curve selection. The following result says that any point on the border of 
a semi-algebraic set can be connected to the set via a continuous curve. 


THEOREM 13.14 (CURVE SELECTION) Let A be a semi-algebraic set of 
R”, and let £ € A\ A. Then there exists a continuous semi-algebraic function 
y : [0,1] — R” such that y(0) = and y((0,1]) C A. 


A proof of this theorem can be found, e.g., in (Bochnak et al., 1987, Propo- 
sition 2.5.3). 


REMARK 13.15 A set A of R” is connected if there exists no open sets U, V 
of R” such that A= UUV, UNV = Qand UNV = 0. A set A is semi- 
algebraically arc-connected if between any two points 5,t € A there exists a 
semi-algebraic funtion y : [0,1] — R” such that 7(0) = 3, y(1) = ¢ and 
¥([0,1]) C A. It can be easily verified that from the curve selection theorem, it 
follows that for a semi-algebraic A of R” being connected coincides with being 
semi-algebraically arc-connected. 


We remark that the curve selection theorem also holds when semi-algebraic 
is replaced by semi-linear. 


3.2 Decompositions of semi-algebraic sets 


Topological decomposition. The semi-algebraic sets of Rt are charac- 
terized above as being finite unions of open intervals and points. A similar 
characterization exists for semi-algebraic sets of R”, which we state here. A 
proof of this result can be found in (Bochnak et al., 1987, Theorem 2.3.6). We 
first recall the definition of a homeomorphism: a homeomorphism h between 
two sets X and Y is a continuous bijection which has continuous inverse. Two 
sets are called homeomorphic if there exists a homeomorphism between them. 
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THEOREM 13.16 Let A be a semi-algebraic subset of R". Then A can be 


written as a finite union 
nm 


A=U 4y, 


i=0j=1 


where each Aj; is homeomorphic to the open cube (0, 1). 


We remark that the dimension of (0, 1)’ is i. So, this theorem states that any 
semi-algebraic set of R” can be decomposed into finite unions of objects that 
are from a topological point of view, open cubes of dimension lower or equal 
ton. 


Cylindrical algebraic decomposition. In practice, more refined decom- 
positions of semi-algebraic sets are used that are also computable by more or 
less efficient algorithms. One such decomposition is given by the cell decom- 
position theorem for semi-algebraic sets. Before we can state this theorem, we 
will need the notion of cylindrical algebraic decomposition (CAD) of R”: A 
CAD of R” is a special partition of R” into finitely many cells. The definition 
is by induction on n: 


(i) a CAD of R! is a collection 
{(-00, a1), (a1, a2), keai (ak, +00), {ai}, ee’ {ak}}, 
of open intervals and points, where aj < --- < a,x are points in R. 


(ii) a CAD of R”+! is a finite partition of R"*+! into (semi-algebraic) cells 
A such that the set of projections 7(A) is again a CAD of R”. Here, m : 


R"+! — R” is again the usual projection map defined by 7(x1,...,2n, 
Tni) = eta) 
We still have to specify what a cellin R” +t is. Let (i4, .. . , im) be a sequence 


of zeros and ones of length m. We define a cell inductively on m as follows: 


(i) a (0)-cell is a one-element set {r} of R, a (1)-cell is an open interval 
(a,b) CR. 


(ii) Suppose (i1,..., %m)-cells are already defined. Then an (71,..., im, 0)- 
cell is the graph T (f) of a continuous semi-algebraic function f:X—R, 
where X is an (71,...,%m)-cell. Furthermore, an (i1,...,%m, 1)-cell is 
a set of the form 


(f, g9)x = 17) eX xR|Z#eX and f(z) <r<g(z)}, 


where X is an (71,...,%m)-cell and f,g are continuous semi-algebraic 
functions on X, possibly equal to the constant functions +00 or —oo. 
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(a) (b) ` 
Figure 13.8. An example of a CAD in R?. 


A cell in R” is an (i1,...,%n)-cell for some sequence (i1,...,%n). A semi- 
algebraic A of R” is said to be partitioned by a CAD D of R” if each cell in D 
is either part of or disjoint with A. In other words, A is the union of cells in D. 


THEOREM 13.17 (FINITE CELL DECOMPOSITION) Given any semi- 
algebraic sets Aı,..., Ag of R”, there is a CAD of R” partitioning each 
Ay,..., Ak. QED 


EXAMPLE 13.18 Consider the semi-algebraic subset of R? given by the 
formula 


(£? +y? <1) v ((y=0) A(1 <2) A(a < 2)) v ((y#0)A(2 <2)). 


This set is shown in part (a) of Fig. 13.8. In part (b) of this figure, a CAD of 
R? is given, consisting of 25 cells, which partitions A. This CAD induces a 
CAD on the x-axis consisting of three (0)-cells and four (1)-cells (two of which 
are unbounded). On top of these intervals (0,0), (0,1), (1,0), and (1, 1)-cells 
are built. 


A proof of the finite cell decomposition theorem is given in (van den Dries, 
1998, Ch. 3, Theorem 2.11). A key ingredient in this proof is the so-called 
uniform finiteness property of semi-algebraic sets. This property is also useful 
to obtain inexpressibility results, as will be shown in Sec. 5. To state this 
property, we need some definitions. A set A of R"*! is called finite over R” 
if for each Z € R” the fiber Az = {r € R | (#,r) € A} is finite. We call 
A uniformly finite over R” if there is an N € N such that |Az| < N for all 
z € R”. We then have: 


THEOREM 13.19 (UNIFORM FINITENESS PROPERTY) If A C R°! is a 
semi-algebraic set which is finite over R”, then A is uniformly finite over R”. 
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As we will see in the next section, CAD is the basic tool for eliminating 
quantifiers. To be correct, we need an adaptation of the CAD to a given set 
of polynomials such that the sign of each of these polynomials is constant 
on each cell in the CAD. Moreover, in the context of quantifier elimination, 
CAD algorithms typically produce sample points in each cell, which enable to 
determine the sign of the polynomials. In Fig. 13.8, we have indicated a sample 
point for each cell in the CAD. 


Triviality. We remark that until now, all results hold when we replace 
semi-algebraic by semi-linear. However, for the following result to be true one 
needs to work in the semi-algebraic setting. 


EXAMPLE 13.20 Consider again the semi-algebraic set A and CAD of R? 
of Example 13.18, shown in Fig. 13.8. Going from left to right, let Cy = 
(—00, a1), C2 = {a1}, C3 = (a1, 42), C4 = {a2}, C5 = (a2, 03), Co = {a3} 
and C7 = (a3, +00) be the (0) and (1)-cells on the x-axis. If one looks at the 
intersections of the cylinders C; x R with A, then it is clear that A N (C; x R) 
is semi-algebraically homeomorphic to a product C; x F;, where F; is a semi- 
algebraic subset of R. In this example, F} = Fe = 0, Fo = Fy = F; = 
{bi} € R, F = [b2, 63] C R, and F7 = R \ {b4}. In other words, the x-axis 
is decomposed into cells, such that A looks like a constant set above any two 
points in the same cell. One then says that the projection map 7 : R? — R on 
the x-axis is trivial over each of the cells C4, .. . , C7. 


We now formalize the intuition behind the example above. Let A C R” and 
B C R” be two semi-algebraic sets and let f : A — B be a continuous semi- 
algebraic map. We can see A as a family of sets (i.e., fibers) { f~!(b) | b € B}. 
A semi-algebraic trivialization of f is a pair (F, A) consisting of semi-algebraic 
set F C RY, for some N, and semi-algebraic map À : A — F such that 
(f,A) : A = B x F is a homeomorphism. 

Let f : A — B and suppose that f has a semi-algebraic trivialization. Then 
it is easy to show that all fibres are semi-algebraically homeomorphic to each 
other. 

We call f semi-algebraically trivial if f has a semi-algebraic trivialization. 
Moreover, given B’ C B, we say that f is semi-algebraically trivial over B' if 
the restriction of f to f~'(B’) is semi-algebraically trivial. 


THEOREM 13.21 (TRIVIALITY THEOREM) Let f : A — B be a contin- 
uous semi-algebraic map as above. Then there is a finite partition of B = 
Bı U --- U Be such that each B; is semi-algebraic and f is semi-algebraically 
trivial over each B;. 
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3.3 The local conical structure of semi-algebraic sets 


Let A be a semi-algebraic set of R” and p'a point of the closure of A. Let 
B"(p,€) be the closed ball with center pand radius £ and let $”~!(p, £) be the 
sphere with center p and radius €. 

We denote by Cone (P, S*1(p,e) N A) the cone with vertex p and base 
S”-1(p,6) N A, i.e., the set of points in R” defined by Ap + (1 — A)# with 
à € [0,1] and Z € S"~!(p,c) N A. Let ||- || : R” — R denote the standard 
Euclidean norm. 


THEOREM 13.22 (LOCAL CONICAL STRUCTURE) For any point p of the 
closure of a semi-algebraic set A, there exists a number £ > 0 and a semi- 
algebraic homeomorphism 


h : B” (B,£) N A > Cone(p, S+ (B, €) N A) 
such that ||k(£¥) — p| = ||% — p| and h | gn-1(p,2)na= Id. 


A radius £ > 0 given by the previous theorem is called a cone radius of A in 

The local conical structure theorem is a direct consequence of the triviality 
theorem, for f : A — R defined as f(#) = ||% — pll. 

We examine the local conical structure of semi-algebraic sets of R? in more 
detail. Consider the semi-algebraic set of R? depicted in Fig. 13.9. For the 
point p, we have indicated a cone radius £ of A in p by the dotted lines. The 
intersection S!(p, ¢)MA consists of a finite number of points and open intervals. 
If we denote open intervals that belong A by R* and intervals that belong to 
the complement of A by R7, and if we similarly indicate points belonging 
to A by L* and points belonging to the complement by L~, we can describe 
a the intersection $1(p,¢) N A by means of a circular list over the alphabet 
{L*, L7, Rt, R7}. We call this circular list the cone type of A in p. The 
symbols L and R refer to lines and regions that arrive at p. There are two 
exceptions, however. For a point in the topological interior of A, we have that 
S'(p,e) = S'(p,e) N A, which we denote by F (for full). On the other hand, 
for an isolated point of A, we have $1(p,<¢) N A = Q, which we denote by Æ 
(for empty). For instance, the cone type of A in pin Fig. 13.9 is given by 





(AR ERE RE RL R ER ER), 


A semi-algebraic set A of R? also has a local conical structure at infinity. To 
see this, we embed R? as the (x, y)-plane in R3 and map A from this embedded 
plane onto the sphere $7((0,0,1),1), that rests on the (x, y)-plane, in the 
direction of its north pole (0, 0,2). If we then add the north pole to this set as 
the point at infinity of the semi-algebraic set, rotate the sphere such that (0, 0, 2) 
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Figure 13.9. A semi-algebraic set A of R? and the cone type of A in its points p given by the 





circular list (L+R Lt R7 L7 Rt L*R-LtR- Lt RIL R*). 
(0,0, 2) 








e(R?) 





(0, 0, 0) 





Figure 13.10. Illustration of the stereographical projection A. 


becomes the origin, and stereographically project back on the xy-plane, then 
the local conical structure of (0,0) in the resulting semi-algebraic set reveals 
the conical structure of the point at infinity in A. 

This implies that there exists a € > 0 such that { (x, y) | £? + y? > 62} A 
is homeomorphic to {(Az, Ay) | (x,y) € S1((0,0),e) AAA A> 1}. We can 
indeed view the latter set as the cone with top oo and base S1((0,0), €) N A. 

More formally, consider the embedding e of R? in R? that maps (x, y) to 
(x,y,0). Let ø be the reflection of R? defined by (x,y,z) > (x, y,2 — 2). 
Finally, let h : e(IR?) U {oo} — $?((0,0, 1), 1) be the homeomorphism of that 
maps the Alexandrov one-point compactification of e(IR*) stereographically 


) and 





onto the sphere S?((0,0,1),1), i.e., h(x, y,0) = ae Y, aia 
h(co) = (0,0, 2). 

We define the the cone type of A in œ to be the cone type of the point (0, 0) 
in the set e~'(h~+(o({(0, 0, 2)} UR(e(A))) \ {o0})). We remark that the cone 
type of A in œ is (E) if and only if A is a bounded subset of R?. 

Let A be a semi-algebraic set in R? and let p be a point in the closure of A. 
Then it easily verified that for any two cone radii £; and £2 of A in p (and oo) 
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p 


13 





Figure 13.11. Types of regular points of a closed semi-algebraic set: II(p) = (R7 Lt R* LT), 
I(q) = F and I(r) = (RLY R L*). 


we get the same cone type. In other words, the notion of the cone type of A in 
p (and oo) is well-defined. 

Let C be the set of all possible cone types of semi-algebraic sets of R?. We 
define: 


DEFINITION 13.23 Let A be a semi-algebraic set of R?. The point-structure 
of A is the function II( A) from AU {oo} to C that maps each point in the closure 
of A to its cone type. 


An important observation is the following: 


PROPOSITION 13.24 The number of points in the closure of A with a cone 
different from (R7 L7 RTL), (R-L* RTL), (R-L* RL"), (RTL Rt 
L~) and F is finite. 

We call point in the closure of A singular if it has a cone type different 
from (RL RL), (ROE ROE) ROE RE) (Re RE) or F. 
Otherwise, we say that a point in the closure of A is regular. It can be shown 
that only a finite number of cone types appear in A. Moreover, if A contains a 
point of cone type (R7 L7 RL), (R-L* RTL), (R-L*R-L*), (RL 
Rt L~) and F, then it must have infinitely many points of this cone type. 























EXAMPLE 13.25 Suppose that A is a closed semi-algebraic set of R?. By 
the the observation above, there are infinitely many points in which A has one 
of the following three cone types (R~ LT R*L*), (R~L*R™L*), and (F). 
Fig. 13.11 illustrates these different cone types. 


In Sec. 5, we will show the importance of the cone types and point structure 
for the expressibility of first-order logic over the reals. 


3.4 Triangulations 


An interesting question is whether semi-algebraic sets can exhibit more topo- 
logical properties than semi-linear sets. The following results shows that this 
is not the case. Roughly speaking, the triangulation theorem states that each 
semi-algebraic set is homeomorphic to a semi-linear one. To make this more 
precise, we need the following notations. 
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Let ao, @1,...,@% be (k + 1) affine independent points in R”. A k-simplex 
(ao, @1,.--, ap) is the set of points 


(00) a1,..., ak) = {So tia: | all t; > DD = 1} CR”. 


Note that k-simplex is of dimension k. Let ø be a k-simplex given by (ag, a1,..., ak). 
The closure of o, denoted by cl(c) is the set of points 


cl(c) = {So tia | all ti > 0,5 ti = 1} 2 


A face of o isa simplex corresponding to any nonempty subset of (ao, a1, . . . , ak). 
A complex in R” is a finite collection of simplices in R”, such that for all 

01,02 € K, either cl(a,) Ncl(o2) = Q, or cl(o1) O cl(a2) = cl(r), where 7 is 

a common face of cı and a2. We denote by || the union of the simplices of 

K. From the definition it is clear that || is a bounded semi-linear set of R”. 


THEOREM 13.26 Let A be a semi-algebraic set of R”. Then there exists a 
complex K in R” and a semi-algebraic homeomorphism h such that h( A) = 
|K], ie., A is semi-algebraically homeomorphic to |K]. 


4. Query evaluation through quantifier elimination 


In this section, we address in more detail how queries, expressible in the 
logics FO(+, <,0,1) and FO(+, x,<,0,1), may be evaluated. 

When we have a query expressed by a formula (21, . . . , £m ) over the vocab- 
ulary (+, x,<,0,1,51,...,.5);,) and we want to evaluate this query on a con- 
crete input database over the schema S = (S1, . . . , Sk), given by quantifier-free 
formulas yg, (21,..-,2%n,),--+, PS, (T1, - --, Enp) in FO(+, x, <, 0,1) (n; is 
the arity of S;, i = 1,...,k), we can proceed as follows: 


= we plug-in the descriptions ys, (%1,.-.,%n,),---, PS, (T1, ---, Enp) of 
the input relations into the query formula (x1, ..., £m) (this means that 
we replace each occurrence of some $;(v1,..., Un; ) in the query formula 


by ys;(U1, e.’ Un, )); 


m this results in a formula over the vocabulary (+, x, <, 0,1) that may 
contain quantifiers introduced by the query formula; 


= next, we eliminate these quantifiers and obtain a quantifier-free descrip- 
tion in FO(+, x, <, 0, 1) of the output relation. 


We remark that the same query evaluation strategy may be applied when x 
is omitted. 


EXAMPLE 13.27 The formula 


Fe(e A OA Va! Vy! ((2 — 2’)? + (y—y')? < e —> S(2’,y’'))) 
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over the schema (+, x, <,0,1) expresses the topological interior of a set S 
in R?. When we want to evaluate the query expressed by this formula on the 
disk given by z? + y? < 4, we first replace S(x’, y’) in the query formula by 
(x’)? + (y’)? < 4. This gives rise to the formula 


(a, y) = Jele 4 OAV2' Vy! ((2—-2')?+(y-y')? < è > (2')?+(y')? < 4)). 





The formula 7 contains three quantifiers. When we eliminate the quantifiers 
from w, we obtain as canonical quantifier-free description of the output the 
formula x? + y? < 4. 


The reader might wonder why we bother about eliminating quantifiers. In- 
deed, simply plugging in the FO(+, x, <,0, 1)-formulas of the input relations 
into the query formula yields a formula in FO(+, x, <, 0, 1) that also describes 
the output. And these formulas, even though containing quantifiers, may be 
used in turn to describe an input to further queries. Even without eliminating 
quantifiers, we would have a formalism that has this closure property. Closure 
is a much desired property in database theory where it is considered important 
that outputs of queries may serve as input for further queries (compositionality 
of queries). 

So, why is it so relevant to eliminate quantifiers? The answer lies in the 
question of what can we do with these formulas that describe output relations. 
Or rather, we should ask what we would like to do with these defining formulas. 

Typical questions that are asked in database practice are the following: 


= Membership test: for example, does (1,2) belong to the output relation 
given by y(u, v) = drdy (u = x+y ^A ((x = 1Va = 2)Ay = 3)) Vu = 
v? 



































= Emptiness test: forexample, is the set S given by drdy (z = r+yA((x = 
1V z= 2) ^y = 3)) empty? 





We observe that both questions, which are relevant to database practice, add 
up to deciding the truth of sentences of FO(+, x, <,0, 1). Indeed, to answer 
the membership test the truth of the sentence 














pay (Sey ASH VES) AG S32 





has to be decided. For the second test, the truth of the sentence 




















zJ3xr3y (z =z +y^((x=1vzr=2)^y=3) 


has to be determined. 

Deciding the truth of sentences is possible in decidable theories like 
(R,+,<,0,1)and (R, +, x,<,0,1). We discuss decision procedures for these 
theories in the subsections that follow. 
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4.1 Quantifier elimination for FO(+, <, 0,1) 
The theory of (R,+,<,0,1) has the following quantifier elimination 
property. 


THEOREM 13.28 The theory of (IR, +, <,0, 1) admits quantifier elimination. 
More specifically, this means that there is an algorithm that on input of aformula 


p(z1,..., £n) over (+, <, 0,1) returns a quantifier-free formula (x1, ..., £n) 
that is equivalent to (p(#1,...,%n) over (R,+,<,0,1). 
We say that two formulas y(x,...,@) and Y(z1,..., £n) are equivalent 


if they define the same n-ary relation over R. 

For (R, +, <, 0, 1), there turns out to be a conceptually very simple quantifier 
elimination procedure that goes back to Fourier in 1826 and that was rediscov- 
ered by Motzkin in 1936 (Motzkin, 1936) and by several other researchers, 
even as late as the second half of the 20th century. We sketch the algorithm 
of Fourier now. Let us concentrate on the problem of eliminating a single 





existential quantifier from a formula y(21,...,2%m—1) of the form 
Jtmp(«1, og sien) 
where ~(21,..-,2%m) is a Boolean combination of atomic formulas of 


FO(+, <,0,1). So, Y(a1,...,£%m) can be written as 


d 


ei m-—1 
VA 21 9ig cous + > ti 
k=1 


i=1 j=1 


with 6;; € {=,<, <, >, >}. If we abbreviate the terms coj; + y Ckij£i 
by tij, then we can remark that for any values given to the the variables 
T1,- - - , €m—1,; the terms t;; can be ordered, let us say (after re-indexing) as tı < 
to < --- < tk. It is clear that for any two values of £m taken strictly between 
some t; and ¢;41 (see Fig. 13.12), the truth value of £m Oij Coij + yo CkijTi 
is the same. The same is true if we take any two values of £m strictly smaller 
than t; or strictly larger than ¢;. 

Therefore, when the 71,..., £1 vary, the existential quantifier in 4x, 
(@1,..-.,£@m) is equivalent expressible by the disjunction 


VV Wl tiim): 


Lm=t; OF Lm=1/2(ts +t; ) OF tm=co 











In this disjunction, for £m all values on and in between all of the t;; are con- 
sidered. In the above formula £m = too can be achieved by considering all 
values t; + 1 for £m. It is clear that the above given disjunction suffices to 
replace the quantifier. 
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Lm = — OO 2£m = t2 + t3 


T 


ty to t3 tk 





Figure 13.12. The relevant values for £m. 


We remark that this procedure takes exponential space in the size of the input 
formula and that the time complexity of this procedure is doubly exponential. 


EXAMPLE 13.29 Suppose we want to eliminate the quantifier in 





glx, y) = 3z(r+z=y^Az<2+7). 





To start with, we write this formula in the right form, namely 3z(z = y—xrAz < 
2+). When we apply the procedure described above, we get the quantifier-free 
formula ¢)(x, y) = 








(y-x@=y-xAy-“<24+2)V 

2Q+ea=y-HLA24+4<24+2)V 
y-x2-l=y-aAy-“2-1<2+2 
2+a-l=y-a“A24+a2-1<242 
tl=y-arAy-#4+1<242 
PAR tA2+aH4+1<24+2 
(y—xt+2+2=2%y-—a2)Ay—x+24+24 < 2(242)). 








SSS eS 








( 
( ) 
( ) 
(y ) 
( ) V 











As mentioned above, we remark that the —oo and +00 from the algorithm 
are implemented by substracting and adding 1 from the terms tı = y — x 
and tg = 2 + x respectively. Also remark that many atomic formulas in this 
expression for y(x, y) are trivially true or false. So, y(x, y) could be further 
simplified. 


4.2 Quantifier elimination for FO(+, x, <, 0,1) 


In the 1930s, Alfred Tarski showed that FO(+, x, <,0, 1) has the algorith- 
mic quantifier elimination property too. Tarski published this result only in 
1948 (Tarski, 1948). 
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THEOREM 13.30 There is an algorithm that on input of aFO(+, x, <,0,1)- 
formula p(x1,...,%n) returns a quantifier-free FO(+, x, <,0,1)-formula p 
(£1,...,£n) that is equivalent to the given formula p(#1,...,%n) over 
(R,+, x, <,0, 1). 


The quantifier elimination procedure given by Tarski is based on a theorem 
by Sturm on real root counting and has a huge complexity (it is not elementary 
recursive), which makes it unsuitable for practical purposes. 


EXAMPLE 13.31 A well-known example of quantifier elimination is the fol- 
lowing. Consider the formula 





y(a, b,c) = a # 0 A Ir(az? + br +c=0). 


This formula describes triples (a, b, c) for which the quadratic equation ax? + 
ba + c = 0 has a real root. From high-school mathematics, we know that 
(a, b, c) is equivalent to the quantifier-free formula 


(a,b,c) =a £ 0A (b? —4ac > 0). 


Improvements to Tarski’s procedure were proposed by Seidenberg, 1954, 
but a major breakthrough was achieved in Collins, 1975, which introduced the 
cylindrical algebraic decomposition (CAD) of semi-algebraic sets. His algo- 
rithm takes as input a system of polynomial equalities and inequalities that 
describe a semi-algebraic set in some R”. The algorithm returns a partition- 
ing of R” in a finite number of cells that are described by sign conditions on 
polynomials in n variables. These cells are actually accompanied by sample 
points in each of the cells that allow us to determine the sign conditions of these 
polynomials in these cells. The algorithm of Collins to compute a CAD has in 
the worst-case doubly-exponential sequential time complexity in the number of 
variables. It was the first quantifier elimination algorithm to be implemented, 
however. It has undergone numerous improvements, resulting in the imple- 
mentation QEPCAD (Quantifier Elimination by Partial Cylindrical Algebraic 
Decomposition) by Hong, 1990. We refer to Caviness and Johnson, 1998 for a 
description at length of the current state of CAD. 

A formal definition of a CAD was given in Sec. 3. It is beyond the scope of 
this chapter to give a full description of Collins’ CAD algorithm, but we want 
to give an idea of the major steps in the algorithm. 

Suppose the input of the CAD algorithm is an FO(+, x, <, 0, 1)-formula in 
prenex normal form 

















CO his ., Abe, yk SO Ws os . , Um, iy ta En). 


Here, (u1,..-,Um;21,---,2n) is a Boolean combination of expressions of 
the form p = 0, p > 0, p > 0 or p Æ 0, where p is a polynomial. It is custom 
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in the quantifier elimination literature to distinguish between the variables 
(%1,...,@p) and parameters (u1,..., Um) of the given formula y(u1,..., Um, 
T1,- --, £n). The goal is to eliminate the variables from the formula y(w4,..., 
Um) Via the computation of a CAD of G(u1,...,Um,21,---;2n)- 

The main construction steps in the construction of a CAD of the set A = 
{ (ti, .. Um, £1; -< ss Ga) E RH” | E(u,- , Um, £1,- 25 En) } are: 


m the projection phase: here the (m + n)-dimensional semi-algebraic 
set A is iteratively projected onto lower dimensional spaces (R™+” 
SS R@tnr-1 S auc easly RD); 


= the basis phase: here real roots are isolated in R! and sample points are 
computed (using numeric methods); 


= the extension phase: here, again in an iterative way (Rt — R? > --- > 
Ratni _, R™+”), a lifting to higher dimensions takes place. Stacks 
of cells (sections and sectors) are built, iteratively, together with sample 
points. 


The output of the CAD algorithm is a sequence C1,...,Cm4n, where each 
C; is a partition of Rê into cells. Each cell C is given by means of quantifier-free 
FO(+4+, x, <, 0, 1)-formula yo and a sample point. In particular, for each of the 
cells in the resulting decomposition of R™*” it is recorded whether it belongs 
to the given semi-algebraic set A or not. 

By construction, the set {(u1,..., Um) E€ R™ | p(ui,...,Um)} consists 
of all cells C in Cm for which there exists a cell C” in Cm+n which is in the 
stack C x R” of C and which belongs to A. Hence, a quantifier-free equivalent 
formula for y(u1,...,Um) is obtained as a disjunction of all formulas yo 
describing cells C in Cm such that in the stack above C'a cell of Cm+n belongs 
to A. 


EXAMPLE 13.32 Weillustrate the CAD algorithm using the three-dimensional 
set given by the quantifier-free FO(+, x, <,0, 1)-formula 


r? +y? +2? < 1V (r? +y? + (z2) = 1 ATS 5/2) 
V (r? +y? + (2-3 =1A z > 5/2): 


This set is depicted in Fig. 13.13. 

In the projection phase this three-dimensional set is projected on (x, z)-plane 
and then this projected set is in turn projected on the z-axis (details omitted). 
On the real line certain “special points” are determined. In Fig. 13.14, these 
special points are coloured grey on the z-axis. These special points are always 
finite in number and they partition the line R into a finite number of points and 
open intervals (two of which are unbounded). 
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Figure 13.13. An example of a semi-algebraic set in R°. 


TA 























Figure 13.14. The 1- and 2-dimensional induced CADs of the semi-algebraic set of Fig. 13.13. 


In the example of Fig. 13.13 and 13.14, there are 7 points and 8 intervals. 
This partition is called the one-dimensional induced CAD of the given set. 
Next, in the extension phase, stacks are built on the one-dimensional CAD. 
The stack above the second interval, for intance, consists of two curves (called 
sections) and three regions (called sectors), two of which are unbounded. The 
cells in these stacks form the two-dimensional induced CAD of the given set. 
Finally, stacks are built on these cells, resulting in the CAD of the given set, 
or to be more precise, of the description of the given set. For each of the 
cells in this decomposition of R? it is recorded whether it belongs to the given 
semi-algebraic set or not. 

During the 1990s, more efficient quantifier elimination algorithms were pro- 
posed. In 1990, Heintz, Roy, Solerno show a doubly exponential sequential 
time complexity in number of quantifier alternations, rather than in the num- 
ber of quantifiers (Heintz et al., 1993). Later on, Heintz et al. show single 
exponential complexity if you work with alternative data structures, such as 
arithmetic Boolean circuits, to store systems of polynomial equalities and in- 
equalities (Heintz et al., 1993). In the TERA project, the software Kronecker 
was developed and it is for the moment the most efficient software for quantifier 
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elimination (TERA-project, 1993) over the reals. The Kronecker implementa- 
tion is decribed in (Giusti et al., 2001). We refer to (Basu et al., 2003b) for a 
detailed overview of algorithms in real algebraic geometry. 


5. Expressiveness results 


In this section, we discuss some results concerning the expressive power 
of FO(+, x,<,0,1) and FO(+, <,0,1) as query languages for constraint 
databases. 

The development of constraint databases has given rise to two directions of 
research. Firstly, classical relational database questions have been reconsid- 
ered. For instance, it is known that graph connectivity of finite relations is not 
expressible in first-order logic over relations (the same holds for other properties 
such as parity, majority, etc.). These expressiveness results can be re-addressed 
in the presence of arithmetical operations. Indeed, when we assume that the 
finite relations are embedded in the reals, we can ask whether connectivity, 
parity, majority, etc., are expressible when the vocabulary of first-order logic is 
extended with +, xX, <,0 and 1. Secondly, expressiveness questions related to 
the possibility of representing infinite relations have been studied. In this sec- 
tion, we start by giving some results on finite relations and then show how they 
help to settle questions concerning the expressive power of FO(+, x, <, 0,1) 
on infinite relations. 


5.1 Expressiveness results for finite databases 


Here, we state a generic collapse result which allows to reduce—or collapse— 
expressiveness questions in the presence of arithmetic to the arithmetic-free case 
(Benedikt et al., 1996). We illustrate its implications on the first-order expres- 
siveness of properties over finite databases over the reals. We also give the 
dichotomy theorem which gives a bound on the query result (in case it is finite) 
for first-order expressible queries (Benedikt and Libkin, 2000). This bound can 
be used to show inexpressibility results, as we shall illustrate. 

Consider the following decision problems on finite relations: 


= The decision problem MAJORITY for two finite sets Sı and Sə is: 
MAJORITY (51, S2) is true if and only if S2 C Sı and |S1| < 2|S9|; 


= The decision problem PARITY for a finite set S is: PARITY(S) is true if 
and only if |S] is even. 


The proof of the following lemma is a routine exercise in finite-model theory 
(Ebbinghaus etal., 1984). Itcan, e.g., be proven using the well-known technique 
of Ehrenfeucht-Fraissé games. This lemma holds for arbitrary finite structures. 
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LEMMA 13.33 Onfinite structures over the signature (<, S1, S2), the decision 
problem MAJORITY(S}4, S2) is not expressible in FO(<, S1, S2). Likewise, on 
finite structures over the signature (<, S), the decision problem PARITY(S) is 
not expressible in FO(<, S). 


Benedikt, Dong, Libkin and Wong proved that any first-order formula over 
the reals that is invariant under monotone bijections from R to R is equiva- 
lently expressible on finite relations in the restriction of first-order logic that 
only uses order constraints (Benedikt et al., 1996). This collapse result was a 
breakthrough in the line of research towards understanding of the expressive 
power of first-order logic over the reals and related structures (Belegradek et al., 
1996; Benedikt and Libkin, 1996; Benedikt and Libkin, 1997; Grumbach and 
Su, 1995; Grumbach et al., 1995; Paredaens et al., 1995; Stolboushkin and 
Taitslin, 1996). 

Consider structures over the vocabulary (+, x,<,0,1,51,..., Sx) that are 
expansions of R with k finite relations on R. We call such structures finite 
structures over the reals (to emphasize the difference with finite structures in 
the sense of relational databases). A first-order formula over the vocabulary 
(+, x, <,0,1,.$1,..., Sk) is called order-generic if on such structures, it is 
invariant under monotone bijections f : R — R. Benedikt, Dong, Libkin, and 
Wong showed the following (Benedikt et al., 1996): 


THEOREM 13.34 (COLLAPSE THEOREM) For each order-generic formula 
in FO(+, x, <,0,1,.51,..., Sk), there exists a formula in FO(<, S1,..., Sk), 
that is equivalent to it on finite structures over the reals. Furthermore, in the 
latter formula the quantifiers may be assumed to range only over the constants 
actually occurring in the relations S1, ..., Sx. 


The following lemma, which specializes Lemma 13.33 from general finite 
ordered structures to finite structures over the reals, now follows directly from 
Lemma 13.33, Theorem 13.34 and the observation that the properties PARITY 
and MAJORITY of finite structures over the reals are invariant under monotone 
bijections from R to R. 


LEMMA 13.35 Onfinite structures over the signatures (+, x, <,0,1, S1, S2), 
the decision problem MAJORITY(S}, S2) is not first-order expressible. Simi- 
larly, on finite structures over the signatures (+, x,<,0,1,S), the decision 
problem PARITY(S) is not first-order expressible. 


Apart from the above collapse results, there are also other results that are 
useful for showing that the expressive power of FO(+, x, <,0,1) is rather 
limited on finite structures over the reals. 


THEOREM 13.36 (DICHOTOMY THEOREM) Let y be a FO(+, x, <,0,1, 
S1,...,5%)-formula. There exists a polynomial p, such that for any finite 
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structure over the reals D, the query result of p( D) is either infinite or bounded 
by po(|D|), where |D| denotes the number of elements in the structure D. 


We provide an example of how to use the dichotomy theorem for showing 
inexpressibility results in the next section. 


5.2 Inexpressibility results for infinite databases 


We now apply the Dichotomy theorem to obtain an inexpressibility result 
for infinite databases. The example that we want to discuss concerns the linear 
€-approximation of semi-algebraic sets. 


DEFINITION 13.37 Let A be a semi-algebraic set of R?. A (linear) £- 
approximation of A is a semi-linear set B of R? which is homeomorphic 
to A via a homeomorphism h : R? — R?, and such that for any p € A, 


d(p, h(p)) < €. 


THEOREM 13.38 Let € > 0 be a real number. There is no ,+,x,<,0, 
1, S)-formula that expresses a linear €-approximation of a relation S in R?. 


Proof Consider the query Q that returns the empty set if the relation S does 
not consist of three non-collinear points, and otherwise returns the corner points 
of an €-approximation of the circle determined by the three points of S. Here, 
a corner point is a point in which two straight-line segments make an angle 
different from 180 degrees. 

Clearly, the construction of a circle through three points is expressible in 
FO(+, x, <,0,1)(the reader may want to verify this!). The same holds for 
the selection of the corner points of a semi-linear set (the reader may want to 
verify this as an exercise too). Hence, if we assume that the -approximation 
query can be expressed FO(+, x,<,0,1,S), then Q is also expressible in 
FO(+4+, x, <,0, 1,5) by a formula y. However, the number of corner points 
which is equal to |p(D)|, can be made arbitrarily large by choosing D to consist 
of three far enough apart points. This contradicts the dichotomy theorem, which 
guarantees the existence of a polynomial py such that the output of y, when 
applied to D is bounded by py(|D|) = py(3). QED 


We remark that an alternative proof of Theorem 13.38 can be obtained using 
the uniform finiteness property of semi-algebraic sets instead of the dichotomy 
theorem. 


5.3 Expressing topological properties of spatial databases 


In this section, we will show that topological connectivity of planar geometric 
figures is not expressible in FO(+, x, <,0, 1). First, we remark that topological 
connectivity of a set in the plane is a query that is invariant under topological 
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Figure 13.15. The query Q is topological. 


transformations of the plane, i.e., it is invariant under isotopies 7 : R? — R? (an 
isotopy is an orientation-preserving homeomorphism, i.e., it can be seen as a 
stretching transformation of the plane seen as a rubber sheet). Such queries are 
called topological queries. Formally, a query Q (over an input schema with one 
binary relation S) is called topological if for any two instances A and B of S for 
which there is an isotopy i : R? — R? such that i(A) = B, i(Q(A)) = Q(B) 
holds. This concept is illustrated in Fig. 13.15 

We remark that deciding whether a query is topological is undecidable. For 
completeness we give the proof first presented in Paredaens et al., 1994. 


THEOREM 13.39 Testing whether a query expressible in FO(+, x, <,0,1) 
is topological is undecidable. 


Proof Let S be the database schema consisting of a single unary relation S. 
For other schemas, the proof is similar. We will reduce the problem of deciding 
the truth of sentences of the V*-fragment of number theory to the problem of 
deciding whether a query is topological. The V*-fragment of number theory is 
known to be undecidable since Hilbert’s 10th 

We encode a natural number n by the unary finite relation enc(n) = {0, 1, 2, 
...,n}. A k-dimensional vector of natural numbers (71, n2, ..., Nng) is encoded 
by the relation 


enc(ni,2,...,M%) = ence(n1) U (enc(n2) + nı + 2) U 
--U(enc(ng) + ny + 24+--++ ng_-1 4+ 2). 
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For a fixed k, the corresponding decoding is expressible in FO(+, x, <, 0,1) 
(the reader might want to verify this). 
We now associate with each first-order sentence 


Va4Vax2°: -Vany(21, TEN Stn) 


of number theory the following query Qy expressed by the FO(+, x, <, 0, 1) 
formula over S: 


Qoy = if S encodes a vector (n1,...,n%) € N“ then 
if (n1, ..., np) then return Ø 
else return 0 


else return (). 


It is easily verified that Q, is topological if and only if the sentence Vz1Yz2 - - - 
Vn p(z1,..., £n) is true. Therefore, if testing whether a query expressible 
in FO(+, x,<,0,1) is topological would be decidable, so would be the V*- 
fragment of number theory. QED 


In the remainder of this section we investigate which databases can be distin- 
guished by means of topological queries expressible in FO(+, x,<,0,1, S$). 
FO(+4+, x, <,0, 1,5). By definition, isotopic databases cannot be distinguished 
in such a way. It turns out that reverse direction does not hold. In general, it 
is not known when two databases are distinguishable by topological queries. 
There are two exceptions however. The first exception is for databases consist- 
ing of a single closed semi-algebraic set, the second exception is for databases 
consisting of, in general, many semi-algebraic sets, but in which only points 
of “regular” cone types are allowed. The latter case is discussed in (Grohe 
and Segoufin, 2002). We will only describe the case of closed databases. The 
following results are taken from (Kuijpers et al., 2000). 


Cut- and glue transformations and the first-order inexpressibility of 
connectivity. Here we describe two transformations on closed semi- 
algebraic sets in R?. We call these transformations the cut- and the glue trans- 
formation. To apply the cut transformation to a set A C R?, one first needs 
to create locally (via a rubber-sheet transformation of the plane) a rectangular 
strip in A and then perform a cut as illustrated by the left to right direction in 
Fig. 13.16. So, this cut removes a rectangular part of the strip and perforates 
one of the remaining ends. The glue transformation is the inverse of the cut 
(illustrated by the right to left arrow in Fig. 13.16). 

We will show that whenever two planar sets differ from each other by a cut- 
or glue transformation, they cannot be distinguished by a topological query 
expressed by a sentence in FO(+, x, <,0,1, S). 
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<-> = 


Figure 13.16. The cut and glue transformations on figures in R?. 


THEOREM 13.40 Let A and B be closed semi-algebraic sets in R?. If B is 
obtained from A by a cut- or glue transformation, then A and B are indis- 
tinguishable by a FO(+4, x, <,0,1, S)-sentence that expresses a topological 


query. 


Proof Assume, for the sake of contradiction, that there exist closed semi- 
algebraic sets A and B that differ by one cut- or glue transformation but which 
can be distinguished by a first-order expressible topological sentence. Hence, 
there exists a first-order sentence y, which expresses a topological query, such 
that (A) = TRUE and y(B) = FALSE. 

Consider the decision problem MAJORITY about two finite sets of reals 
Sı and S2 (see above). We will prove the existence of a formula 7(z, y) 
in FO(+, x,<,0,1, S1, S2) such that for any finite database D = (D1, D2) 
over (S1, S2), we have that for E4(D) = {(x, y) € R? | (R, D) H (2, y)}, 
p(Ea4(D)) = TRUE if and only if MAJORITY(D}, D2) is FALSE. 

By Lemma 13.35, this then yields the desired contradiction. This reduction 
technique is inspired by (Grumbach and Su, 1995). 

Obviously, for any finite D = (D1, D2), the part Dı C Də can be tested in 
first-order logic. For given Dı = {r1,..., fn} and Də = {a1,..., am} with 
O<ry <- < rp andO < ay <: < Gm, we construct within the fixed 
rectangular part a of Rĉ?, where the cut-or-glue transformation takes place, a 
closed semi-algebraic set E(D) consisting of interconnected strips. 

This construction is similar to constructions in (Grumbach and Su, 1995) and 
is illustrated in Fig. 13.17 for n = 6 and m = 4. The construction is as follows. 
Take a rectangular subarea a’ of a. Let (bp, so) be the left bottom corner of 
a’ and let h and w be its height and width. Then sets D} = {s0,..., Sn} and 
D} = (ba; b1,- ., bm, Bndays - - , b2m fs with s; = so + rih/rn (0 < i < n), 
bi = bo + a;w /2am and bm+i = bi + w/2 (0 < i < m) are constructed. Then, 
the following closed strips of E(D) are constructed: 





1. the filled convex quadrangle with corners (b;, sj), ((b; + bi+1)/2, s3), 
(biti, Sj+1), ((bi+1 + bi+2)/2, Sj+1) forO <i <2m—land0<j<n 
and for i = j = 0, 





2. the filled convex quadrangle with corners (bam—1, Sj), ((b2m—1 + b2m)/ 
2; sj), (bam, Sj+1) (bom, (s; + Sj+1)/2) forO<j<n, 
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Figure 13.17. Construction of E(D) for D = (Di, D2) with Dı = {1,3,5,6,7,9} and 
Dz = {1,3,6,7} in the rectangular area a. 


3. the filled convex quadrangle with corners (bo, (sj+1 + sj+2)/2), ((b1 + 
b2)/2, sj+2), (b1, sj+2), (bo, $741) forO <j <n—1. 


Finally, a number of additional closed strips are added in the area a \ a’ (as 
illustrated in Fig. 13.17) to complete the construction of E(D). Remark that the 
complete construction of Æ(D), as described above, starting from D, and D2 
can be expressed for any D = (D1, D2) by a formula FO(+, x, <,0, 1, S1, S2). 

We then glue E(D) to the part of A outside the cut-or-glue transformation 
area œ and denote the resulting set by E.4(D). Note that outside this area, 
E4(D) and B are identical. 

Hence, there exists a formula in FO(+, x, <, 0, 1, $1, S2) such that for any 
D, we obtain a semi-algebraic set E.4(D). Moreover, by construction E(D) 
will be homeomorphic to the right part of Fig. 13.16 if MAJORITY(Dj, D2) is 
true, and homeomorphic to the left part of Fig. 13.16 otherwise. 

Hence, in case of majority, .4(D) is homeomorphic to B, and in the other 
case it is homeomorphic to A. Since ọ expresses a topological query which 
distinguishes between A and B, we can use y to express MAJORITY. QED 


With some more work additional cut and glue transformations can be proved 
to produce results that are indistinguishable from the original spatial figure. 
Three such transformations are shown in Fig. 13.18. In (a), we have a stip that 
can be cut (this time without producing a perforation in one of the sides). In 


836 HANDBOOK OF SPATIAL LOGICS 


(a) 


(b) 


(c) —— — 
D - — D 














Figure 13.18. Three more cut and glue transformations on figures in R?. 


Figure 13.19. One and two balls cannot be distinguished by a topological FO(+, x, <, 0, 1)- 
query. 


(b), we have the reverse transformation of (a) and in (c) we see how parallel 
lines can be rewired, even when some data D is in between them. 


REMARK 13.41 A direct consequence of Theorem 13.40 is that the connec- 
tivity query is not expressible in FO(+, x, <,0,1). Indeed, this follows im- 
mediately from the observation that two disks can be transformed into a single 
disk using a cut and glue transformation (see Fig. 13.19). 


We remark that Theorem 13.40 also holds when semi-algebraic is replaced 
by semi-linear. In this case, linear versions of the cut and glue transformations 
have to be considered. 


The point structure of a closed semi-algebraic set in R?. In the previous 
section, we have shown that there are non-homeomorphic sets that nevertheless 
are indistinguishable by FO(+, x, <,0,1)-sentences expressing topological 
properties. More specifically, the cut- or glue transformations relate some 
indistinguishable databases. However, there are a few other transformations 
with the same property and the question now arises as to which databases can 
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be transformed into each other using one of these transformations. For closed 
databases, we can characterize this exactly. 
First, recall the definition 13.23 of the point structure of a database. 


DEFINITION 13.42 Let A and B be closed semi-algebraic sets in R?. We 
say that II(A) is isomorphic to II(B) (denoted by II(A) = II(B)) if there 
is a bijection f from A U {oo} to B U {co} with f(co) = ov, such that 
II(A) = II(B) o f. 

The main result in this context is that indistinguishability by topological 
queries can be expressed in terms of point-structure isomorphism. 


THEOREM 13.43 Let A and B be closed semi-algebraic sets in R?. The sets 
A and B are indistinguishable by topological FO(+, x, <,0, 1)-queries if and 
only if TI(A) S I(B). 


Proof We briefly sketch the proof of this theorem. If two closed semi-algebraic 
sets have a different point structure than they can be distinguished by a topolog- 
ical query that expresses that the one set contains a different number of points 
than the other set with a specific cone. In Sec. 5.4, we will discuss in more 
detail how a cone of a point can be expressed in FO(+, x, <,0, 1). 

If two closed semi-algebraic sets in R? have the same point structure, they 
can be transformed into the same canonical semi-algebraic set using the cut 
and glue transformations shown in Fig. 13.18. First, two-dimensional lobes 
are cut around singular points, then the lines are cut into lobes. This produces 
“flowers” that may be connected by stems. This rewriting process is illustrated 
in Fig. 13.20. QED 


One may wonder whether this characterization holds for arbitrary semi- 
algebraic sets in R? too. This is not the case as it can be shown that the two sets 
shown in Fig. 13.21 can be distinguished by a topological FO(+, x, <,0, 1)- 
sentence even though they have isomorphic cone structures (Grohe and Segoufin, 
2002). 

Theorem 13.43 gives us an idea of which closed semi-algebraic sets in 
the plane are distinguishable by topological first-order queries, but it doesn’t 
give us a full picture of the expressive power of the topological fragment of 
FO(+, x,<,0,1). There are results that characterize this expressive power, 
however. It has been shown that the topological fragment of FO(+, x, <, 0,1) 
just allows us to formulate queries that talk about types of cones appearing 
in a semi-algebraic sets and on the number of points having particular cones 
(Benedikt et al., 2006). 


5.4 Expressing the cone radius 


As we have seen, the local conical property of semi-algebraic sets plays a 
prominent role in the study of first-order expressiveness of topological 
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Figure 13.20. The transformation of two closed semi-algebaric sets (on the left and right hand 
top) into their canonical form (left bottom). 
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Figure 13.21. Semi-open annuli with the opposite open sides can be distinguished by a topo- 
logical FO(+, x, <, 0, 1)-query. 





Figure 13.22. Illustration of how the intersections S*(p,r) N A change in terms of r. 


properties. In this section, we will show that FO(+, x, <,0,1) is expressive 
enough to find a cone radius for each point in the database. 
More specifically, the following result holds (Geerts, 2003). 


THEOREM 13.44 There exists an FO(+4+, x,<,0,1, S1, S2)-formula (r), 
with S and S> of arity n, that for a semi-algebraic set A in R” and a point pin 
IR", defines one r in R for which (A, {p}) H y(r) and which is a cone radius 
for Ain p. 





To give the complete proof of this theorem would lead us too far afield. 
Instead, we provide the intuition behind the proof. Moreover, we only consider 
the case when n = 2 and assume that A is a semi-algebraic set consisting only 
of points of cone type (LL). 

Consider the semi-algebraic set depicted on the left of Fig. 13.22. On the 
right of this picture, we have shown the intersections of circles of various radii, 
centered around p with A. As can be seen, each time there exists a point ¢ 
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in A which has a tangent line perpendicular to the line going through p and ¢ 
(depicted by the dashed circles), the topological type of the intersection, which 
in this case is nothing else than the number of points, changes. 

This observation can be formalized using a variant of the triviality theorem 
which states that the topological type of $1(p, r1) O A and $'(p, r2)N A are the 
same (meaning that there exists an homeomorphism between these two sets) if 
for any r € [r1, r2] there exists no point g such that d(p, q) = r and the tangent 
line in gis perpendicular to g— p. 

It can be shown that there exists an FO(+, x, <, 0, 1, 51, S2)-formula y 
which returns for any pair (A, p) all point g which have a tangent line perpen- 
dicular to the vector g — p. Using y we define 


rp = ymin d(9,) | (A) KD}. 


It is clear that for any (A, p), rgis expressible by a first-order formula. By 
definition, there is no point ¢ with a tangent line perpendicular to g — p for any 
0 <r < rp. In other words, for all 0 < r < rp, all interesections S lip.r) nA 
are homeomorphic, and it can be shown that rj is indeed a cone radius of 
Ain p. 

We briefly mention that for the general case, we have to decompose A first 
into parts on which a tangent space can be defined. Moreover, this has to be 
shown to be first-order expressible. Next, similar to the case above, a cone 
radius of a point pcan then be defined as a radius smaller than any point ¢ with 
a tangent space perpendicular to q’— p, where the tangent space is taken relative 
to the decomposition of A. 





5.5 The expressiveness of FO(+, x, <, 0,1) and 
FO(+, <, 0, 1) 


We end this section with a remark on the comparison of the expressive 
powers of FO(+, x, <,0,1) and of FO(+, <, 0,1). We will illustrate that the 
expressive power of FO(+, <, 0, 1) is less than that of FO(+, x,<,0, 1). 

The topological interior of a two-dimensional set © can be expressed in 
FO(+, x, <, 0,1) by the formula 


ele OAV2'Vy' (a — x)? + (y — y)? < e? = S(a',y’')). 





Since the topology of R? based on open discs is equivalent to the one based 
on open rectangles, we can equivalently express the topological interior of a 
semi-algebraic subset of R? in FO(+, <,0, 1) by the formula 





Jele > OAVa'Vy'((la—2' |< eAly—y' |< £) > S(a’,y’)). 


But there are other queries for which the multiplication seems to be really 
necessary to express them in first-order logic. If we want to express that a 
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two-dimensional semi-linear set is convex, for instance, then we can do this in 
FO(+, x, <,0,1) with the formula 


Vavy(S(Z) A S) = YAO < A < 1 > S(AZ + (1— 2)9)). 


Clearly, in the subexpression AZ + (1 — A)ğ multiplication is used and it may 
seem difficult to imagine that convexity of semi-linear sets might be expressible 
without multiplication. 

But it turns out that we have the following property (Vandeurzen et al., 1996). 


PROPOSITION 13.45 A semi-linear set of R” is convex if and only if it is 
closed under taking midpoints. 


We remark this property does not hold for subsets of R” that are not semi- 
linear, e.g., think of Q”. 

We can therefore express convexity of semi-linear sets by the FO(+, <, 0, 1)- 
formula 





Vivy(S(2) A SY) > AZ(27= T+ 9A S(2)). 

It is not true, however, that all queries expressible in FO(+, x, <,0,1) are 
also expressible in FO(+, <,0, 1). Indeed, FO(+, x, <,0,1) is clearly more 
expressive than FO(+, <, 0, 1) as far as queries that return some n-dimensional 
result are concerned. For example, the constant query that returns on any input 
the n-dimensional unit sphere, for instance, is not expressible in FO(+, <, 0, 1). 
But what about Boolean queries? It turns out that FO(+, x, <,0,1) is again 
more expressive than FO(+, <, 0,1) as far as Boolean queries are concerned, 
as is illustrated by the following theorems. The first theorem is from (Afrati 
et al., 1994) and was proved using Ehrenfeucht-Fraissé games. 


THEOREM 13.46 The Boolean query deciding whether a semi-linear set S 
contains real numbers u and v satisfying u? + v? = 1 is expressible in 
FO(+4+, x, <,0,1, S), but not in FO(+, <,0,1, S). 


Another example of a property that is not expressible in FO(+, <,0, 1) is 
from (Benedikt and Keisler, 2000). 


THEOREM 13.47 The Boolean query deciding whether a semi-linear sub- 
set S of R? contains a line is expressible in FO(+, x, <,0,1, S), but not in 
FO(+, <, 0,1, S). 


6. Extensions of logical query languages 


In this section we will extend first-order logic with operators in order to 
increase its expressive power. We begin with introducing transitive closure 
logics (Geerts and Kuijpers, 2000; Geerts and Kuijpers, 2005) and (Kreutzer, 
2001). Next, we extend first-order logic with a while-loop (Gyssens et al., 
1999). We conclude the section by extending first-order logic with specific 
operators for topological properties (Benedikt et al., 2003). 
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6.1 First-order logic with all-purpose operators 


Let us first make a small digression to the relational database setting. Here, 
a database D is a finite model of the signature S, where S consists of a binary 
relation name S. The transitive closure of D is usually computed in stages X;, 
i = 0,1,2,.... Initially, Xo = D and for n > 0 we have 





Ket Xn U { (x,y) Pa 2) A D(z,y)) }- 


Since the number of pairs in D is finite and by construction, Xn C Xn41, 
after finitely many steps we end up with the situation that X,4, = Xn. The 
fixed-point Xp is then the transitive closure of D. 

It is well-known that the transitive closure of D cannot be computed by a 
query expressible in first-order logic over S. In order to be able to express the 
transitive closure of D one has studied the extension of first-order logic with 
transitive closure (Ebbinghaus and Flum, 1995). 


EXAMPLE 13.48 We show that transitive closure logics can express that a 
graph G = (V, E) is connected. Indeed, let D be the (binary) database con- 
taining the edge relation E of G. If we interpret the formula 


TC py SOY) 


as an expression which evaluates on D to the transitive closure of D, then the 
expression 
VsVt|TCoyS(a, y)] (s, t) 


evaluates to true on D if and only if the corresponding graph G is a connected, 
i.e., if for any two s,t in the domain of D, i.e., the set of vertices V of G, 
there exists a sequence (vo, 1), (V1, V2), -- - , (Un—1, Un) such that D(v;, vi+1) 
fori = 0,1,...,n — 1 and s = vg and t = up. 


Motivated by this relational example, we start by adding the transitive closure 
operator to first-order logic in the constraint database setting. The resulting 
query language will be denoted by FO(+, x, <,0,1)+TC. 


First-order logic with transitive-closure operator. A formula in 
FO(+, x, <,0,1)+TC isa formula built in the same way as an FO(+, x, <, 0, 1) 
formula, but with the following extra formation rule: if ~(2, 7) is a formula 
with z and y k-tuples of real variables, with all free variables of Yy among 2, Y, 
and if 5, t are k-tuples of real variables, then 


(13.1) [TCzg4(@,9)] (50) 


is also a formula which has as free variables those in Sand f. 
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We will distinguish between FO(+, <,0,1)+TC and FO(+, x,<,0,1)+ 
TC depending on whether only linear (i.e., x is not allowed) or also non-linear 
constraints are allowed. 

To explain the semantics of a subformula of the above form (13.1), we 
compute again the following stages 


Xo = wW(D), Xn+1 = Xn U {(z, y) | F(X (z, u) ^ Xo(d, y))}, 





until the fixed-point, which we denote by X, is reached. Then the semantics of 
[TCz.7 (Z, ¥)| (, t) is defined as (5, t) belonging to the 2k-ary relation Xoo. 

The question is now how a formula in FO(+, x, <,0,1)+TC is evaluated. 
Assume that w is an FO(+, x, <, 0, 1)-formula. Then we simply can compute 
the quantifier-free description of X,+41 recursively by evaluating the corre- 
sponding FO(+, x, <,0,1)-expressions. After each computation we then test 
whether Xn = Xn+1. If this holds, we have obtained the fixed-point and test 
whether (8, t) is in Xn. 

In general, the semantics of a formula y in FO(+, x, <,0,1)+TC is evalu- 
ated in the standard bottom-up fashion. The result of the evaluation of subfor- 
mulas is passed on to formulas that are higher up in the parsing tree of y. 

However, in this context, we face the well-known fact that recursion involv- 
ing arithmetic over an unbounded domain, such as the polynomial inequalities 
over the reals in our setting, is no longer guaranteed to terminate. In other words, 
the computation of Xə might not terminate. Hence, the property of effective 
computability of queries expressible in FO(+, x,<,0,1) or FO(+, <, 0,1) is 
lost when extending these logics with recursion. In case the computation of 
Xoo does not terminate, then the semantics of the formula (13.1) (and any other 
formula in which it occurs as subformula) is undefined. 

We therefore have to distinguish between formulas for which the evaluation 
terminates, we call such formulas terminating, and formulas for which the eval- 
uation does not terminate, i.e., the non-terminating formulas. To illustrate this 
difference, we now provide an example of a terminating and non-terminating 
formula in FO(+4+, x, <,0,1)+TC. 


EXAMPLE 13.49 Let S consist of the binary relation S. Consider the 
FO(+, x, <,0,1)+TC formula 


[PCryS(x,y)] (s, t) 
and let D = {(x,y) | y = 2x}. We have for each n > 0, 
Xn = {(2,y) |H ENE < n,y = 2r}. 





It is clear that for all n > 0, Xn # Xn+ı and hence we need to compute 
infinitely many stages until the fixed point Xə is reached. We have depicted 
this example in Fig. 13.23. It easy to see t hat Xə is not a semi-algebraic set. 
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Figure 13.23. Computation of [TCz;5(«,y)|(s,t) for D = {(x,y) | y = 2x} (thickest 
line). Consecutive stages are drawn in decreasingly finer lines. 


On the other hand, even when X is semi-algebraic, its computation may still 
be non-terminating. Moreover, one may wonder whether the non-termination 
of the FO(+, x, <,0,1)+TC formula in Example 13.49 is caused by the un- 
boundedness of the input database. However, it easy to see that even bounded 
input databases may cause non-termination. 

Since FO(+, x, <,0, 1) is a sub-language of FO(+, x, <,0,1)+TC, there 
are infinitely many terminating formulas. We now give an example of a termi- 
nating formula which is not in FO(+, x, <,0,1). 


EXAMPLE 13.50 Let the database schema S consist of the unary relation 
name S. Consider the formula 


[TCoyelr, T, y) ^ S(r)] (s, t) 


where y(r, x, y) defines the graph of the continuous piecewise affine function 
that maps x to 





0 if0<x<}i, 
y= <x 1 ift<gz<]l, 
1-1 ifsr= 


Then, for D = {p} and p € N we have for each n > 0, 


i=1 


It is clear that Xp+1 = Xp and hence this FO(+, x,<,0,1)+TC formula 
is terminating. Moreover, it expresses the query Qyat which tests whether 
the number stored in a database is a natural number. We have illustrated this 
example in Fig. 13.24. It is easily verified that the query Qnat cannot be 
expressed in the logic FO(+, x, <,0,1). 
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1 





Figure 13.24. Example of the terminating FO(+, x, <, 0, 1)+TC formula of Example 13.50 
for D = {4}. 


As a result of the above example and Example 13.50, we have that 
FO(+4, x, <,0, 1) is strictly less expressive than FO(+, x, <,0,1)+TC. 

As explained in Example 13.48, transitive closure logic can express the 
connectivity of finite graphs. Similarly, FO(+, x,<,0,1)+TC can express 
the connectivity of constraint databases. We first consider the case of linear 
constraints. 

Let S be a schema with one relation name S of arity n. Consider the following 
FO(+4, x, <,0,1)+TC formula connected(S): 


Vat ((S(5) A S(é)) > [TCz.7 lineconn](s, t)) , 
where lineconn(7, y) is the formula 
VACO < A< LAVEE= AZ + (1—-A)F— SH). 


We now claim that a pair of points (p,q) belongs to transitive closure of 
lineconn(D) (with D semi-linear) if and only if p and g belong to the same 
connected component. Hence, if we can show that connected is a terminating 
formula, then this implies that connected expresses connectivity of semi-linear 
sets. 


THEOREM 13.51 The formula connected terminates on all linear constraint 
databases D over S and correctly expresses connectivity of semi-linear sets. 


Proof Since D is semi-linear, two points p'and g'belong to the same connected 
component of D if and only if there exists a piecewise linear path from p to 
q lying entirely in D. This follows directly from the semi-linear version of 
Theorem 13.14 and Remark 13.15. So, indeed [TCz.71ineconn](5, ¢) holds 
if and only if 5 and F belong to the same connected component of D. 

To conclude that the evaluation of the transitive closure in the formula 
connected ends in finitely many steps, we need to show that there exists 
an upper bound on the number of line segments in S, needed to connect any 
two points in the same connected component of S. Now, any semi-linear set 
can be decomposed into a finite number of convex sets (van den Dries, 1998, 
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Ch. 8, Exercise 2.14 (2)). The finiteness of this decomposition yields the desired 
bound since any two points in a convex set are connected by a single straight 
line segment. We have illustrated this in Fig. 13.25 for n = 2. QED 





Figure 13.25. Asemi-linear set decomposed in convex sets. The boundaries of the convex sets 
are shown in dotted lines. We have depicted a piecewise linear path (dashed line) between points 
sandt. 


The above FO(+, x, <, 0, 1)+TC-formula connected cannot be applied to 
arbitrary semi-algebraic inputs, while still guaranteeing termination. Indeed, 
the evaluation of connected on the binary relation depicted in Fig. 13.26, 
consisting of the points lying strictly between the parabola y = z? and the 
translated one y = x? + 1/2, will not terminate. Although any two points in 
this set can be connected by a finite number of line segments, there is no upper 
bound on the number of segments needed to connect two points. 

There are also examples of semi-algebraic sets s for which there exist two 
points in a connected component that cannot be connected by any finite piece- 
wise linear path. Here we take as example the set defined by (y? — x? > OAx < 


Figure 13.26. Asemi-algebraic set, the points lying strictly between the parabola y = x° and 
the translated parabola y = x? + 1 /2, on which termination is not satisfied. 
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Figure 13.27. A semi-algebraic set with a cusp point. 


OAy<1)V(a#=O0Ay = 0), depicted in Fig. 13.27. The cusp point (0,0) at 
the bottom cannot be connected by a line segment to any point in the interior of 
this set. So, the query expressed by connected terminates after two iterations, 
but it does not contain all pairs of points that are in the connected component. 
This is obviously because piecewise-linear connectivity does not correspond to 
connectivity for arbitrary semi-algebraic sets. 

Basically, Fig. 13.26 and 13.27 illustrate the only two cases where the 
connected query does not work correctly (i.e., cusp points and cusp points 
towards oo). 

Nevertheless, we can also express connectivity of arbitrary semi-algebraic 
sets. The proof of this results is a reduction to the linear case. This reduction 
is possible by the following result. 


THEOREM 13.52 There exists a terminating FO(+, x, <, 0, 1)+TC formula 
which expresses the linearization query “Return a semi-linear set which is 
homeomorphic to the database.” 


We remark that by the triangulation theorem (Theorem 13.26), the existence 
of such semi-linear set is guaranteed. 

So, given a database D, we first apply the linearization query and then apply 
the connected query. This clearly results in the desired FO(+, x, <,0,1)+TC 
formula expressing connectivity. 

So far, we have shown individual queries which can be expressed in 
FO(+, x,<,0,1)+TC. Moreover, we have seen that not all formulas in 
FO(+, x,<,0,1)+TC are terminating. We now describe two ways of con- 
trolling this termination and its effect on the expressiveness. 


Transitive Closure with stop conditions. A formula in FO(+, x, <,0, 1) 
+TCS is built in the same way as an FO(+, x, <,0, 1) formula, but with the 
following extra formation rule: if y(x, y) is a formula with 7 and 7 k-tuples of 
real variables, ø is an FO(+, x, <,0,1) sentence over the input database and 
a special 2k-ary relation name X, and 5, t are k-tuples of real variables, then 
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is also a formula which has as free variables those in Sand t. We call o the stop 
condition of this formula. 

The semantics of a subformula of the above form (13.2) evaluated on data- 
bases D is defined in the same manner as in the case without stop condition, 
but now we stop not only in case an 7 is found such that X; = X;+1, but also 
when an i is found such that (D, X;41) H o, whichever case occurs first. As 
above, we also consider the restriction FO(+, <,0,1)+TCS. 





EXAMPLE 13.53 As an example of an FO(+, x, <,0,1)+TCS formula over 
a two-dimensional input database S, we take 














[TCr; S(x,y) | dedy(X (x,y) Ay = 1A 10x < 1)](s, t). 

















Here the stop condition is o = drdy(X(x,y) ^y = 1A 10a” < 1). When 
applied to the graph of the function shown in Fig. 13.23, we see that X3 satisfies 
the sentence in the stop condition since for instance (+, 1) belongs to it. The 
evaluation has become terminating (as opposed to the expression without stop 
condition in Example 13.49). On input the graph of the function shown in 
Fig. 13.23, this expression still terminates after four iterations (since X5 = X4, 
not because the stop condition is satisfied) and returns the same result as in the 
case without stop condition. 





Transitive Closure with start points and parameters. We can also allow 
parameters in the transitive closure and restrict the computation of the transitive 
closure to certain paths, after specifying starting points. We denote the resulting 
logic with FO(+, x, <,0,1)+KTC. A formula in FO(+, x, <,0,1)+KTC 
is built exactly as in FO(+, x, <,0,1)+TC with the exception that parameters 
u are allowed, i.e., 


has as free variables i, § and t. 

The semantics of a subformula of the form (13.3), with 5 = (s1,..., Sk) 
evaluated on a database D is defined in the following operational manner: Let 
I be the set of indices 7 for which s; is a constant. Then we start computing the 
following iterative sequence of (2k + £)-ary relations: 


Xo := Y(D) A N (s: = z) 
icl 
and 


Xi41 = Xi U {(2, J, u) € Ree | AZCAG CG, Z, ü) A WZ, J, u)) } 
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and stop as soon as X; = X;41. The semantics of 

[TCz;z Vz, y, WI (5, t) 
is then defined as (5, t, t) belonging to the (2k + £)-ary relation X;. 


EXAMPLE 13.54 Asanexample of an FO(+, x, <,0,1)+KTC formula over 
a two-dimensional input database D, we take 


(PCa Sle Gt) 


When applied to the gph of the function, shown in Fig. 13.23, we see that 
Xo = DN {(x,y) | £ = 4} and this set is just {(4, 5)}. Next, X; is computed 
to be {(4, 5)} U {(4, 1)}. In subsequent iterations, no further tuples are added 
(i.e., X3 = Xj). This example shows that in FO(+, x, <,0,1)+KTC, the 
evaluation can be restricted to the computation of certain paths in the transitive 
closure and this gives control over the termination. 


We will also consider the fragment FO(+, <,0,1)+KTC of this language. 


6.2 Computational Completeness 


One may wonder what the expressive power of the transitive-closure logics 
is. It turns out that adding stop conditions or allowing start points and pa- 
rameters drastically increases the expressive power. More specifically, both 
FO(+,<,0,1)+TCS and FO(+,<,0,1)+KTC are computationally com- 
plete on semi-linear constraint databases. 

This means that for every partial computable (in the sense of Turing com- 
putable) query Q on semi-linear databases, there exists a formula y in 
FO(+, <,0,1)+TCS (respectively in FO(+, x, <,0,1)+KTC) such that for 
each semi-linear database D, (D) is defined if and only if Q(D) is defined, 
and in this case y(D) and Q(D) are equal. 


THEOREM 13.55 BothFO(+,<,0,1)+TCS andFO(4+4, <,0,1)+KTC are 
computationally complete on linear constraint databases. 


We remark that this holds only for linear databases. However, a similar result 
holds on arbitrary databases for the extension FO(+, x, <,0,1)+WHILE of 
FO(+4+, x, <,0,1) with a while-loop. 


An extension of FO(+, x,<,0,1) with a while-loop. A program in 
FO(+, x, <,0,1)+ WHILE is a finite sequence of assignment statements and 
while-loops. Each assignment statement has the form 


bee doe oe oa (eel ear res 
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where y is an FO(+, x, <, 0, 1) formula that uses the relation names S; (of the 
input database schema S = {.$},...,.5,,}) and previously introduced relation 
names. Each while-loop has the form 


WHILE y DO P OD, 


where P is a FO(+, x, <,0, 1)+ WHILE program and y an FO(+, x, <, 0,1) 
formula that uses relation names S; from the input schema and previously 
introduced relation names. 

The semantics of a program applied to a spatial databases is the operational, 
step by step execution. So, the effect of an assignment statement is to evaluate 
the FO(+, x, <,0, 1) formula on the right-hand side on the constraint database 
D augmented with the previously assigned-to relation variables, and to assign 
the result of the evaluation to the relation variable on the left-hand side. The 
effect of a while-loop is to execute the body as long as non-halting condition y 
evaluates to true. One relation name Rout is designated as the output relation 
and when the FO(+, x, <, 0, 1)-+- WHILE program terminates, the current value 
of Rout is considered to be the output. 

Again, we have the problem of non-terminating while loops, as the following 
example shows. 


EXAMPLE 13.56 Let D=[0,1] and consider the following 
FO(+, x, <,0, 1)+WHILE program P over the input schema S = {S}, with 
S unary: 


R={(2)| $(@)} 
Y :=9,; 
WHILE R Æ Ý Do 
Y ={(2)| (z, R)} 
R:= R\Y; 
OD 


where y is an FO(+, x, <,0,1) such that when it is evaluated on any finite 
sequence of closed non-overlapping intervals { [a;, bj] } it results in the sequence 
of closed non-overlapping intervals {[a;, a; + bea, la; + daja Obvi- 
ously, P is non-terminating on D and is P(D) is undefined. However, if we 
allowed infinitely long computations, then P(D) would be the Cantor set which 
is depicted in Fig. 13.28. 





Figure 13.28. The iterative construction of the Cantor set. 
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The previous example shows that in contrast to the transitive-closure logics, 
an FO(+, x, <, 0, 1)+-WHILE program can recursively reduce the size of re- 
lations. This already hints that FO(+, x, <,0,1)-++WHILE is more powerful. 
Indeed, we have the following result. 


THEOREM 13.57 The languages FO(+, x, <,0, 1)+WHILE is computation- 
ally complete on constraint databases. 


The only result known for transitive-closure logics on arbitrary databases fol- 
lows from the completeness on linear databases (Theorem 13.55) and the fact the 
linearization query is expressible in FO(+,x,<,0,1)+TC 
(Theorem 13.52): 


THEOREM 13.58 The languages FO(+4+, x, <,0,1)+TCS and 
FO(+, x, <,0,1)+KTC are computationally complete on constraint data- 
bases as far as Boolean topological queries is concerned. 


Indeed, given a partial computable Boolean topological query Q on a database 
D, we can apply the linearization query to get D, that contains lineariza- 
tions of all relations in D. By definition of a topological query, Q(D) and 
Q(D) are equal. Since there exists an FO(+, x, <, 0, 1)+TCS(respectively 
FO(+, x, <,0,1)+KTC) formula ¢ expressing Q on linear database, we can 
express Q on D in FO(4+, x, <,0,1)+TCS(respectively 
FO(+, x, <,0, 1)+KTC) by (D). 

There are many open problems related to these recursive extensions of 
FO(+, x, <, 0,1). Forexample, itis not known whether FO(+4, x, <,0,1)+TC 
is less expressive than FO(+, x, <,0,1)+TCS or FO(+, x, <,0,1)+KTC 
and it is also unknown whether FO(+,x,<,0,1)+TCS and 
FO(+, x, <,0,1)+KTC are complete on arbitrary databases. 

So far, we have been extending FO(+, x, <, 0, 1) with generic operators ex- 
pressing many (in some case every) queries inexpressible in FO(+, x, <, 0,1). 
As a result, queries could be undefined because of non-terminating 
computations. 

In the next section we will show how FO(-+, x,<,0,1) can be extended 
with specific operators aimed to express specific queries and at the same time 
guarantee closure. 








6.3 Extensions of FO(+, x, <, 0,1) with topological 
operators 


We have seen that connectivity of a databases is a property which one 
would like to see being expressible in a query language. We have seen that 
FO(+,x,<,0,1) lacks the power to express this query, while 
FO(+, x, <,0,1)+TC provides enough power to do so. 


852 HANDBOOK OF SPATIAL LOGICS 


However, it is possible to extend FO(+, x, <,0,1) directly with a connec- 
tivity operator. More generally, define a topological property Top as acollection 
{T,..-,Tn,.-.} where Jn is a family of sets in R” such that if X € 7n, then 
for each homeomorphism 

hof R”, h(X) € Th. 


EXAMPLE 13.59 Of special interest is when Top expresses the property of 
being connected. Other examples of Top are the property of being closed, being 
of dimension k, containing exactly two holes and so on. 


Let T be a set of topological properties. We then define the language 
FO(+, x, <,0, 1) +T by extending FO(+, x, <, 0, 1) with the following rule: 
if p(z, y) is a query then 


V(x) = Topy © y(7, y) 


is also a query for any Top € T. The semantics of such a formula is as follows: 
D H (a) iff (ë, D) = {Ë| D H y(G,5)} € Top. 


EXAMPLE 13.60 The query “Is the intersection of regions S and T connected” 
can be written as Connz © S(#) A T(z). The query y(x) = Conn(y, z) © 
S(x,y, z) returns the set of all c € R for which the intersection of S with the 
plane x = cis aconnected set. 


THEOREM 13.61 The logic FO(+, x,<,0,1) + T is closed on constraint 
databases. 


Proof The result follows from a simple induction on the formulas. The only 
case to prove is U(%) = Topy © y(Z,y) for Top € T. Let 7 and y be of 
length n and m, respectively. By induction, y(D) results in a semi-algebraic 
set in R” x R™. By the triviality theorem for semi-algebraic sets, there exists 
a decomposition C of R"*™ into finitely many cells which is trivial over R” 
and such that ~(D) is the union of cells of C. Let C’ be the projection of C onto 
R”, and let C be a cell in C’. By triviality, for every d, b € C, it is the case that 
((p(D))a and ((p(D));z are homeomorphic, and thus agree on Top. Therefore, 
the output of y% on D is a union of finitely many cells in C’. Moreover, since 
each cell is semi-algebraic, so will be the output y( D). QED 


The linear analog of the previous theorem also holds for the language 
FO(+,<,0,1) + T. However, since the triviality theorem does not hold in 
this case, one first needs to develop an alternative decomposition theorem 
(see Benedikt et al., 2003 for details). 
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1. Introduction 


Mathematical morphology (MM) is a branch of image processing, which 
arose in 1964. It is associated with the names of Georges Matheron and Jean 
Serra, who developed its main concepts and tools, expounded it in several books 
(Matheron, 1975; Serra, 1982; Serra, 1988), and created a team at the Centre 
de Morphologie Mathématique on the Fontainebleau site of the Paris School 
of Mines. 

MM truly deserves the adjective “mathematical”, as it is heavily mathema- 
tized. In this respect, it contrasts with the various heuristic or experimental 
approaches to image processing that one sees in the literature. It stands also as 
an alternative to another strongly mathematized branch of image processing, the 
one that bases itself on signal processing and information theory, following the 
works of prestigious pioneers named Wiener, Shannon, Gabor, etc. Indeed, 
these classical approaches proved their value in telecommunications. However 
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MM claims that analysing the information of an image is not like transmitting 
a signal on a channel, that an image should not be considered as a combination 
of sinusoidal frequencies, nor as the result of a Markov process on individual 
points. It considers that the purpose of image analysis is to find spatial ob- 
jects, therefore images contain geometrical shapes with luminance (or colour) 
profiles, which can be investigated by their interactions with other shapes and 
luminance profiles. This makes the morphological approach especially relevant 
in situations where image grey-levels (or colours) correspond directly to sig- 
nificant material data, as in medical imaging, microscopy, industrial inspection 
and remote sensing. 

In its development, MM has borrowed concepts and tools from various 
branches of mathematics: algebra (lattice theory), topology, discrete geom- 
etry, integral geometry, geometrical probability, partial differential equations, 
etc.; in fact any mathematical theory that deals with shapes, their combinations 
or their evolution, can be brought to contribute to morphological theory. 

MM started by analysing binary images (sets of points) with the use of 
set-theoretical operations. In order to apply it to other types of images, for 
example grey-level ones (numerical functions), it was necessary to generalize 
set-theoretical notions, such as the relation of inclusion and the operations of 
union and intersection. This was done by using the lattice-theoretical notions of 
a partial order relation between images, for which the operations of supremum 
(least upper bound) and infimum (greatest lower bound) are defined. Therefore 
the central structure in MM is that of a complete lattice, and the basic morpho- 
logical operators (dilation, erosion, opening and closing) can be characterized 
in this framework. 

When analysing sets, one considers their topology: is the set in one or 
several pieces, how many holes has it, etc. Some topological notions, in partic- 
ular connectedness, have been generalized in the framework of complete lat- 
tices. Nowadays, most morphological techniques combine lattice-theoretical 
and topological methods. 

The computer processing of pictures quickly led to digital models of geometry. 
The pioneering work in this field is that of Azriel Rosenfeld, who died in 
2004 after having contributed to digital geometry and image processing for 
40 years. Thanks to its algebraic formalism, mathematical morphology is per- 
fectly adapted to the digital framework. Moreover, the topology of digital 
figures can be studied in the framework of combinatorial topology, a field that 
was developed in the first half of the 20th century by mathematicians like Paul 
Alexandroff (Alexandroff, 1937; Alexandroff, 1956; Alexandroff and Hopf, 
1935). In particular the latter proposed in 1935 to subdivide the Euclidean plane 
into rectangular cells, in such a way that cell interiors, sides, and corners are 
considered as points in an abstract space, whose combinatorial relations provide 
the topology. This idea prefigured the notion of pixels, and the corresponding 
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Alexandroff topology was formally developed by Efim Khalimsky and popu- 
larized by Vladimir Kovalevsky; it has been shown that many “paradoxes” of 
digital geometry (like non-parallel lines which do not intersect) find a natural 
solution in that topology. 

MM has also borrowed tools from integral geometry in order to measure some 
parameters on images. However these measurements are usually preceded by 
some image processing operations, in order to restrict the measure to some 
appropriate features: for example, to estimate the average length of particles 
whose width is at least w, we apply first an operator eliminating all particles 
narrower than w, then we make a length measurement on the remaining ones. 

MM has also a probabilistic aspect, where images and shapes can be con- 
sidered as random events. Suppose for example that one asks n experts to 
extract a certain set S from an image, say n anatomists have to extract the left 
half of the liver from an X-ray scanner hepatic image; they will disagree, and 
extract n different sets 5 ,,...,5;,; now, how does one derive the “average” 
of these n sets, or their “standard deviation”? Furthermore, if one designs a 
computer algorithm for extracting that set, which produces the set Sauto, how 
does on evaluate the statistical significance of Sauto w.t.t. to the distribution 
S1,...,5n? Such problems are studied in geometric probability, through the 
theory of random sets and functions (Matheron, 1975; Serra, 1982; Serra, 1988). 
This should not be confused with Markov field models for image processing: 
there the random variable is the grey-level of an individual pixel, and it evolves 
in space by a Markov process. 

Image analysis has considered the varying scales at which things are seen. 
This has been formalized by multi-scale models governed by partial differential 
equations (PDEs). This has happened also for morphological operators, for 
which new PDEs have been given, leading to a new understanding of their 
functioning. 

The theory of morphological operators relies on the formalism of lattice the- 
ory, and the latter underlies also several theoretical aspects of computer science: 
fuzzy sets, formal concept analysis and abstract interpretation of programming. 
In fact, the lattice-theoretical tools developed in each speciality can be used for 
the other ones. For example, a research on fuzzy morphology has been under- 
taken since several years. Also, the tools of MM, developed for the purpose of 
filtering and segmenting images, have found applications for modelling spatial 
concepts, like “close to” or “between”. 

The link between logic and lattice theory is obvious. Boole’s logic is the first 
example of a Boolean algebra, while non-classical logics have been modeled 
as non-Boolean lattices. As MM analyses spatial shapes by means of lattice- 
theoretical operations, it is adapted to the logical analysis of spatial relations, 
while its abstract mathematical tools can be used in order to illuminate some 
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aspects of logic, for example modal logic, and to build new operations in such 
a framework. 

The purpose of this chapter is to present the basic theory of MM (Sec. 2), 
then to show how its tools can be applied to various specialities dealing with the 
analysis of spatial shapes and spatial relations, such as formal concept analysis, 
rough sets and fuzzy sets (Sec. 3), and finally to show its relevance in logic 
(Sec. 4). 

Let us now describe the basic operations of mathematical morphology, first 
in the case of sets (or binary images), and next in the case of numerical functions 
(or grey-level images). We must warn the reader that in several works (includ- 
ing important ones, for instance Serra, 1982; Soille, 2003), the definitions given 
for the basic operations (Minkowski addition and subtraction, dilation, erosion, 
opening and closing) differ from ours in that in some cases the structuring el- 
ement must be replaced by its symmetrical; also the notation can be different 
(in particular Serra, 1982). The definitions given here for morphological oper- 
ations are standard (Heijmans, 1994), in the sense that they are consistent with 
the original definitions given by Minkowski, 1903 for the Minkowski addition 
and Hadwiger, 1950 for the Minkowski subtraction, and that they follow the al- 
gebraic theory (see Sec. 2), which allows to give a unified treatment (Heijmans 
and Ronse, 1990; Ronse and Heijmans, 1991) of such operators in the case of 
sets, numerical functions, and many other structures. 


1.1 Morphology on sets 


Consider the space E = R” or Z”, with origin o = (0,...,0). Given 
X C E, the complement of X C E is X° = E \ X, and the transpose or 
symmetrical of X is X = {—x | x € X}. For every p € E, the translation 
by pis the map E — E : x œ> x + p; it transforms any subset X of E into its 
translate by p, Xp = {£ + p| x E€ X}. 

Most morphological operations on sets can be obtained by combining set- 
theoretical operations with two basic operators, dilation and erosion. The 
latter arise from two set-theoretical operations, the Minkowski addition ® 
(Minkowski, 1903) and subtraction © (Hadwiger, 1950), defined as follows 
for any X, B € P(E): 


XəB = (JX, 
beEB 
= (Bes 


Lex 
(14.1) = {e+b|2eX,be B}; 


XOB = Xe, 
bEB 
= {[pEeE| B, CX}. 
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Formally speaking, X and B play similar roles as binary operands. However, 
in real situations, X will stand for the image (which is big, and given by 
the problem), and B for the structuring element (a small shape chosen by 
the user), so that X 6 B and X © B will be transformed images. We define 
the dilation by B, ôg : P(E) — P(E) : X + X @ B, and the erosion by 
B, €B : P(E) —> P(E): X + X © B. It should be noted that dilation and 
erosion are dual by complementation, in other words dilating a set is equivalent 
to eroding its complement with the symmetrical structuring element: 


(14.2) (X®B=X°oBŘ, (XOB)S=X°OR . 


Therefore the properties of erosion are derived from those of dilation by duality: 
dilation inflates the object, deflates the background and deforms convex corners 
of the object; thus erosion deflates the object, inflates the background and 
deforms concave corners of the object. By Equation (14.2), we can also obtain 
alternate formulations for Minkowski addition and subtraction: 


X@B = {peEE|(B)ynX FH} ; 


(14.3) XOB = {pE E|Yz¢X,p¢(B)}. 


We illustrate in Fig. 14.1 the dilation and erosion of a cross by a triangular 
structuring element. 

Dilation and erosion are the basic elements from which most morphological 
operators are built. The first example is the hit-or-miss transform, which uses 
a pair of structuring elements. Let A and B be two disjoint subsets of E; A 
will be the foreground structuring element and B the background structuring 
element; we then define: 


X@(A,B) = {pE E| A, CX and B,C X*} , 
= (XOA)N(X°SB)=(XCA)\(X@B). 


This will give the locus of all points where A fits the foreground and B fits 
the background. This operation corresponds to what is usually called template 
matching. 

The main operators derived from dilation and erosion are opening and clos- 
ing. We define the binary operations o and e by setting for any X, B € P(E): 


XoB = (XOB)OB, 


| {Bp | p € Band B, € X} ; 


(14.4) 


The operator yp : P(E) —> P(E): X > X o Bis called the opening by B; it 
is the composition of the erosion €p, followed by the dilation ôg. On the other 
hand, the operator yg : P(E) — P(E): X > X e Bis called the closing 
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Figure 14.1. Top: The figure X is the cross, and the structuring element B is the triangle; the 
position of the origin is indicated by a thick dot. Bottom: The dilation X 6 B of X by B. Right: 
The erosion X © B of X by B is obtained as the complement of the dilation X° @ B of the 
complement X° by the symmetrical structuring element B. 


by B; it is the composition of the dilation ôg, followed by the erosion €g. The 
two are dual by complementation: 


(14.5) (Xo BS = X°oB , (Xe B)°=X°oB. 


Hence the properties of closing are derived from those of opening by duality: 
opening removes narrow parts of the object and deforms convex corners of the 
object; thus closing fills narrow parts of the background and deforms concave 
corners of the object. We illustrate in Fig. 14.2 the opening and closing of a 
cross by a triangular structuring element. 

Given a family 6 of structuring elements, the opening by B, written yg, is 
the union of openings by elements of B, while the closing by 5, written yg, is 
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Figure 14.2. We use the same figure X and structuring element B as in Fig. 14.1. Top left: The 
opening X o B of X by B is the union of all translates of B which are included in X. Bottom 
right: The closing X e B of X by B is obtained as the complement of the opening X° o B of 
the complement X° by the symmetrical structuring element B. 


the intersection of closings by elements of B: 


wX) = LJ (XoB), 
BEB 

AR BO eB: 
BEB 


For example, if H and V are respectively a horizontal and a vertical line segment 
of length a, y g,v} will extract from a line drawing all horizontal and vertical 
lines of length at least a (as well as all blobs whose height or width is at least a). 

The most interesting properties of the opening and closing (by one or several 
structuring elements) is that they are idempotent: yg(yg(X)) = yg(X) and 
ysB(ypB(X)) = yp(X). This means that if we consider them as filters, they do 
their job completely, and there is no need to repeat them. This contrasts with 
the behaviour of other image processing operators, like the median filter, where 
repeated applications can further modify the image, without a guarantee that it 
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will reach a stable result after a finite number of iterations (indeed, the median 
filter can produce oscillations). The opening can be used to filter out positive 
noise, that is, to remove noisy parts of the object, typically small components; 
on the other hand, the closing can be used to remove negative noise, that is, 
to add to the object noisy parts of the background, typically small holes. By 
repeated composition of an opening and a closing, one can obtain four new 
filters: 


= opening followed by closing; 

= closing followed by opening; 

= opening followed by closing, then by opening; 
= closing followed by opening, then by closing. 


All four are idempotent, and no other operator can be obtained by further com- 
position (Serra, 1988). They can be used as filters to remove both positive and 
negative noise; for example, they constitute an alternative to median filtering 
for removing speckle noise. 

These operators have a drawback: they deform the frontier between the object 
and background. Typically, if one uses a disk-shaped structuring element, they 
will round the corners of objects. However, one may want to filter out small 
components or holes of the object, without modifying the shape of the other 
components and holes. In other words, we look for filters which do not act 
at the level of pixels, but of connected components of the foreground (called 
grains) and of the background (called pores). 

The basic operation for this purpose is shown in Fig. 14.3: from a figure F’, 
we extract the union of all connected components of F (grains) that intersect a 
marker R. 





Figure 14.3. Left: We have a figure F' (shown hatched) and a marker R (grey). Right: All 
connected components of F that intersect R are shown hatched. 


We can formalize this operation as follows. We assume that E is a digital 
space (E = Z” or a bounded grid in Z”), and that the connectivity arises from 


Mathematical Morphology 865 


an adjacency graph on F, for example, the 4- or 8-adjacency on Z”, the 6- or 26- 
adjacency on Z’ (Rosenfeld and Kak, 1976). Let V be the structuring element 
comprising the origin o and the pixels adjacent to it, so that for any pixel p, the 
set comprising p and its neighbours is Vp; note that V is symmetrical (V = V). 
Given a set F (called the mask) and a subset R of F (called the marker), we 
define the geodesical reconstruction by dilation (from marker R in the mask F’) 
as the limit 
rec (F, R) = U Re 
neN 
of the increasing sequence of sets Rn, n € N, defined recursively as follows: 


Ro = RAF and Yn EN, Rati =(Rri@VINF . 


This will indeed give the union of all grains of F marked by (i.e., intersecting) 
the marker R. 

The dual operation is the geodesical reconstruction by erosion; here the 
marker R is a superset of the mask F (F C R), and it is defined as 


reco(F, R) = [recg(F°, R°)|° . 


This is in fact the limit („pen Fn of the decreasing sequence of sets Rn, n € N, 
defined recursively by 


Ro = RUF and Yn EN, Razgi=(RrieV)UF . 


The behaviour of reco is to reconstruct all pores of F which are not completely 
covered by the marker R; in other words, all connected components of the 
background F’° which are included in R, are added to F. We illustrate this 
operation in Fig. 14.4. 

Given an opening y, we define the opening by reconstruction rec as the 
geodesical reconstruction by dilation using the opening as marker: 


Yrec(X) = rece (X,7(X)) ` 


Similarly for a closing y, we define the closing by reconstruction Prec as the 
geodesical reconstruction by erosion using the closing as marker: 


Prec( X) = rece(X,y(X)) . 
Note that for a connected structuring element B containing the origin, we have 


reco(X,XoB) = rece(X,X 6B) 
and reco(X,XeB) = reco(X,X @B) . 


The opening and closing by reconstruction are again idempotent operators; they 
respectively remove small grains and fill small pores, but they do not deform the 
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Figure 14.4. Top: We have a mask figure F (hatched) contained in a marker R (R \ F is shown 
in grey). Bottom: reco(F, R) (hatched) is made of F and all connected components of F° 
which are completely covered by the marker R. 


remaining boundaries between foreground and background. They can then be 
composed (as explained above: opening followed by closing, closing followed 
by opening, etc.) in order to provide idempotent filters that remove grains and 
pores on the basis of their width, without distorting the contours of objects. 
Other idempotent filters can be built, that act directly on grains and pores, 
for example, the area opening (which removes all grains whose area is below 
athreshold) and the area closing (filling all pores whose area is below a threshold). 


1.2 Morphology on functions 


In computer imaging, grey-levels are coded by numerical values, the low 
ones corresponding to dark pixels, and the high ones corresponding to bright 
ones. Hence in mathematical morphology (Heijmans, 1994), grey-level images 
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are usually considered as numerical functions Æ — T, where E is the space 
of points and T is the set of grey-levels; it is always a subset of R = RU 
{—o0, +00}. The grey-levels are numerically ordered, and morphological op- 
erations usually compute at each point in E a combination of numerical suprema 
and infima of grey-level values. Thus one supposes that T is closed under the 
operations of non-empty numerical supremum and infimum; in the terminology 
that we will introduce in Sec. 2, T is a complete lattice. Usually one takes for 
T one of the sets R, Z = ZU {—00, +00}, [a,b] = {a ER | a < z <b} 
(with a,b € Randa < b), or [a...b| = [a,b] O Z (with a,b € Z anda < b). 
We write to and tı respectively for the least and greatest element of T' (thus 
to = —œ and tı = +00 for T = R or Z, while to = a and tı = b for T = [a, b] 
or [a...0]). 

The set T? of functions E — T inherits the numerical order on T by the 
pointwise ordering of functions: 


(14.7) F<G = VpE E, F(p)<G(p) . 


This is the analogue for functions of the inclusion relation for sets. Now the 
analogues for functions of the union and intersection operations for sets, are the 
supremum (least upper bound) and infimum (greatest lower bound), obtained 
by pointwise supremum and infimum operations: 


(14.8) V F:E >T: p = sup Fi(p , NFE: E >T: p= inf Fi(p) . 
icl icl 


We write F V G and F A^ G for the supremum and infimum of two functions (cf. 
the union and intersection of two sets); as the two binary operations V and ^ are 
commutative and associative, we can write F) V---V Fn and F1 A- -A Fh, which 
are in fact respectively equal to Viet,....n} F; and Nie,....n} F;. The least and 
greatest functions are the ones with constant values tg and tı respectively, they 
are the analogues of the empty set Ø and the whole space E. 

Given a function F : E — T and a point p € E, the translate of F by pis 
the function Fp whose graph is obtained by translating the graph {(x, F(x)) | 
x € E} by pin the first coordinate, that is, 


{u Poly) | p € E} ={(@@+p, F(a) |x € E} , 


in other words 
WEE, Fyly)=F(y—p) . 


We have thus the analogues for functions of the union, intersection and trans- 
lation operations for sets. It is then possible to define the dilation, erosion, 
opening and closing of a function by a structuring element, by making ana- 
logues of Eqs. (14.1, 14.4). 
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There is however a systematic method for extending operators on sets to 
operators on functions (Heijmans, 1991; Heijmans, 1994; Ronse, 2003). It 
relies on the notions of thresholding and stacking. Given a function F : E > T, 
the umbra (or hypograph) of F is the set 


U(F) = {(p,t) |p € E,t ET, F(p) 2 t} , 
and for any value t € T, consider the threshold set 
Xi(F) = {pE E| F(p) > t} ; 
thus (p, t) € U (F) iff p € X;(F). We illustrate these notions in Fig. 14.5. 








X, (F) 


Figure 14.5. The graph of F, and below it the umbra U (F`) (in grey). For t € T, the horizontal 
line at level t crosses the umbra in a section whose projection in F is the threshold set X+ (F). 


Given an operator Y% : P(E) —> P(E), the flat operator corresponding to 
(or flat extension of w) is the operator y” : TË — T¥ constructed as follows: 


1 Thresholding: For every t € T, we take the horizontal cross-section of 
the umbra U (F`) at level t, that is the set X,(F’) x {t}. 


2 Horizontal operation: We apply ~ horizontally to every such cross- 
section, that is, for every t € T we obtain the set Y (X;(F’)) x {t}. 


3 Stacking: The upper envelope of these sets y (X;(F)) x {t}, t € T, 
defines a function which gives y? (F). 


We illustrate this construction in Fig. 14.6, in the case where Y = dz, the 
dilation by a structuring element B. In fact, the values taken by yf (F) are 
given by the following formula: 


(14.9) VpeB, wo (F\(p)=\{teT |pev(X(F))} - 


Rather than using Equation (14.9) to compute the values y? (F’)(p), we can 
rely on the fact that the flat extension of operators transforms the operations on 
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Figure 14.6. Top left: The graph of F, the umbra U (F`) (in grey), and horizontal cross-sections 
X+(F’) x {t} of the umbra. Top right: The structuring element B (the position of the origin 
is indicated by a dot). Bottom left: We apply ôg, the dilation by B, horizontally to the cross- 
sections X;(F’) x {t}, obtaining the sets (X: (F) © B) x {t}. Bottom right: The upper envelope 
of the dilated cross-sections gives the dilated function 5 (F), also written F ® B. 


sets into the corresponding ones on functions, as it follows from the properties 
listed below (for the sake of brevity, in the formulas we omit the quantifications 
VX € P(E) and VF € TË): 


= Identity: If Y(X) = X, then yT (F) = F. 

= Translation: If Y(X) = Xp, then YT (F) = Fp. 

m Union: If Y(X) = User &l(X), then YT (F) = Vier GF (F). 

a Intersection: If Y(X) = N;cr &(X), then YT (F) = Ner GF (F). 
= Composition: If Y(X) = (¢(X)), then YT (F) = nT (C7 (F)). 


These properties can for example be used to give formulas for the flat extensions 
of dilation and erosion. As ôg (X) = Usep Xp and eg(X) = beg X- (see 
Equation (14.1)), we obtain for every F € TE: 


(14.10) bB(F)= \V Fe ad eg(F)= N Fo. 
be B beB 
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We get then for every p € E: 


dR(F)(p) = suppes F(p— 6) = supye ay, F(A) 
and eb (F)(p) infyep F(p+b) = infgep, F(q). 


It is customary to write F 6 B and F © B for ôL (F) and e% (F). Following 
Equation (14.4), we define Fo B = (F © B) ẹ B and F è B = (F ẹ B) O B; 
clearly F o B = y4 (F) and F è B = y} (F). Note that here the operations 
®, ©, o and e have a function as first operand, a set as second, and a function 
again as result. 

All set operators built by combining dilations and erosions through unions, 
intersections and translations, extend thus naturally as flat operators. Then 
the properties of the set operators translate directly to their flat extensions; for 
example, openings and closings are idempotent, and composing them leads to 
idempotent filters. In practice, flat operators behave on bright and dark parts 
of a grey-level image in the same way as the corresponding set operators do 
on foreground and background. For example, dilation inflates bright areas and 
deflates dark ones, while erosion does the contrary; opening darkens narrow 
bright zones, while closing brightens narrow dark zones; dilation and opening 
deform corners which are convex on the bright side, while erosion and closing 
deform corners which are convex on the dark side. In particular, filters obtained 
by composing opening and closing can be used to remove small defects in an 
image, such as speckle noise. 

There is still a duality between erosion and dilation, and between opening 
and closing. Let n be an inversion of T, that is a bijection T — T which 
reverses the order: t < t <> n(t) > n(t’); for example, if T = [a...b], we 
have n(t) = a + b — t; we extend it to an inversion N on functions, by setting 
N(F) : p+ n(F(p)) (here n and N stand for negative, in the photographic 
sense). Then: 


N(F © B) N(F)OB, N(FOB) N(F)OB , 
N(FoB) N(F)eB, N(FeB) N(F)oB. 


This expresses formally the fact that the behaviour of erosions and closings is 
derived of that of dilations and openings, by exchanging the roles of bright and 
dark points or zones in the grey-level image. 

It is also possible to give flat extensions of geodesical reconstruction by 
dilation or erosion. For a mask function and a marker function R, such that 
R < F, we define the geodesical reconstruction by dilation 


reco (F, R) = VV Bazy 
nen 


(14.11) 


where the functions Rn, n € N, are defined recursively by 


Ro = RAF and Yn EN, Rngi=(Rn OV)AF. 
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(Here V is the neighbourhood of the origin.) For R > F, we have the geodesical 
reconstruction by erosion 


reco(F, R) = A Rn , 
nEN 


where 
Ro= RVF and Yn EN, Rn =(RrOV)VF. 
In fact, the two are dual: 
reco(F, R) = N[recg(N(F),N(R))] - 


In the same way as the geodesical reconstructions on sets acted on grains and 
pores (connected components of the foreground and background), here these 
operators will act on flat zones, that is, maximal connected sets having a con- 
stant grey-level value. In particular, we can design openings and closings by 
reconstruction, as in the case of sets, and these filters will remove some bright 
or dark objects, and simplify the grey-levels of remaining objects, but they will 
not deform the contours between objects. They are thus very interesting image 
filters. 

The extension of morphology on sets that we have described, is called flat 
morphology. This terminology arises from the fact that we work on the “hori- 
zontal” structure of functions (see Fig. 14.6). We will now see morphological 
operators on functions that act both “horizontally” and “vertically” on them. 

As the operators will combine grey-levels by arithmetical additions and sub- 
tractions, it will no longer be possible to take a bounded interval for the grey- 
level set T’, otherwise the grey-levels resulting from these operations might 
overflow out of this interval. Thus T must extend from —oo to +00. Let 
T' = T \ {—00, +00}; formally we have the following two requirements: 


= T is closed under the operations of non-empty numerical supremum and 
infimum (thus T is a complete lattice); 


= T” is closed under the operations of addition and subtraction (in other 
words, T” is a subgroup of R). 


It is then easily seen that either T = R and T’ = R, or there is some a > 0 
such that T” = aZ = {az | z € Z} and T = aZ = aZ U {-co, +00} in 
the second case, we can make a scaling of grey-levels by 1/a, so here we can 
suppose without loss of generality that T = Z and T’ = Z. 

We gave above grey-level analogues of some set-theoretical operations. We 
have to extend this analogy further. First we redefine the umbra or hypograph 
of a function F : E — T, it is the set 


U(F) = {(p,t) € E x T' | t < F(p)} . 
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The difference with the previous definition is that we restrict t to T’, while 
before we had t € T. The points (p, t) of the umbra U (F) are the analogues of 
the points x € X fora set X. We have now to give the analogue of a singleton, 
namely a set {p} verifying {p} C X <= p € X; itis the impulse ipt, for 
(p,t) € E x T', defined as follows: 


Vz E€ E, Wit) = { 2 


We verify indeed that for a function F’ and an impulse 7,4), we have ipt < 
F & (p,t) E U(F). 
We call the support of a function F the set 
supp(F') = {p € E | F(p) > =œ} . 

Note that p € supp(F) iff there exists some t € T’ with (p,t) € U(F). We 
will see below that points outside the support are redundant in calculations; in 
fact, we can assume that F' is defined only on its support; conversely if F is 
defined only on a subset S of E, we extend it to a function on E by setting 
F(p) = —œ for all p € E \ S. 

We defined above the translation of a function by a point. We extend it 
to the translation by a pair (p, t). Given a function F : E — T and a pair 
(p,t) € E x T', the translate of F by (p, t) is the function Fip +) whose graph 
is obtained by translating the graph {(x, F(x)) | « € E} by p in the first 
coordinate and by ¢ in the second, that is, 


{¥; Foy) |p E E} = {(x +p, F(z) +t) |re E}, 
in other words 


t ifx=p, 
coo ifr Fp. 


WEE, Fon(y)=Fy—p) tt. 
We can now define the Minkowski addition and subtraction of two functions 
E — T, by analogy with Equation (14.1). Such an analogy already appeared 
partially in the definition of the dilation and erosion of a function by a set, 
Equation (14.10), but we have to extend it further. Given two functions F, G : 
E — T, we define their Minkowski addition F © G and subtraction F © G as 
follows: 


FeG = V Foa, 
(p,t)€U(G) 


Y Gip) ’ 


(p,t)EU(F) 
VV lies tte) | (p,t) E U(F), (p', t) € U(G)} ; 


FoG = ANo Fur»: 
(p.t)EU(G) 


= Vien | @)e ExT, Gey < F}. 


(14.12) 
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Figure 14.7. Top left: The two grey-level functions F and G', both having value —oo outside a 
bounded support. (In the next three illustrations, F and G are shown dotted.) Top right: F 6G 
is the supremum of translates of F by all points of U(G). Bottom left: It is also the supremum 
of translates of G by all points of U (F). Bottom right: F © G is the supremum of all impulses 
i(p,t) Such that G(p,4) < F; in fact there is a unique point p for which this is possible, so F © G 
is an impulse. 


These two operations are illustrated in Fig. 14.7. Usually, F plays the role of a 
grey-level image, while G is the grey-level analogue of a structuring element, 
and we call it then a structuring function. 

We can give a numerical expression for the values of F 6 G and F © G; for 
all p E€ E we have 


(F@G)(p) = aE h) + G(h)) 
= sup Ga h) + G(h)) , 
(14.13) hEsupp(G 
(FOG)(p) = int (F(p+ h) — G(h)) 
= ae (F(p+h)—G(h)) , 


with the following convention for dealing with expressions of the form +00 — oo 
inside the parentheses: if F(p — h) + G(h) takes the form +00 — 00, we set 
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it equal to —oo, while if F(p + h) — G(h) takes the form +00 — œo, we set it 
equal to +00. 

The operators 6g : TË — T?: FH F@Gandeg: TË > TE: Fu 
F © G are called dilation and erosion by G. We can now define the binary 
operations o and e as for sets, by FoG = (FOG)GGand FeG = (FSG)EG, 
leading thus to the opening by G, yg : T? — T” : F |œ FoG, and the closing 
by G, ya : TË — TË : Fe F e G; note that 


FoG= VC RO € EXT’, Gey sT 


which is analogous to the second line in Equation (14.4). We still have the 
duality by inversion. Define the transpose or symmetrical G of G by G(x) = 
G(—«); we have the grey-level inversion N on functions (given by N(F’)(p) = 
— F (p)); we get then 


N(F @G) 
N(F 0G) 


N(F)OG; N(FOG) 
N(F)eG; N(FeG) 


N(F)OG ; 
N(F)oG. 


(14.14) 


All properties of these operations ©, ©, o and e in the case of sets, extend to the 
case of functions. For example, the opening and closing by G are idempotent. 
Operators on functions Æ — T built from these operations, together with the 
supremum and infimum, constitute what is called grey-level morphology or 
functional morphology. 

Note finally that the dilation and erosion of functions by a set structuring 
element, Eqs. (14.10,14.11), are a particular case of dilation and erosion by 
a structuring function, Eqs. (14.12,14.13). Given a set B C E, define the 
function Bo : E — T having value 0 on B, and —oo elsewhere: 


0 if2eB, 


YEE; Bol) = { -o0 ifed B. 


Then for every function F : E — T, we have F ẹ B = F ẹ Bo and F O B = 
F © Bo. The function Bo is thus called a flat structuring function. Hence flat 
morphology is a particular case of grey-level morphology, with a restriction of 
structuring functions to flat ones. 

We explained above that flat operators behave on bright and dark parts of 
a grey-level image in the same way as the corresponding set operators do on 
foreground and background. This remains true here, but now the action is 
not only on the shape of these parts, but also on their grey-level profiles. For 
example, dilation and opening deform the grey-level profile on peaks, while 
erosion and closing do it on valleys. This is illustrated in Fig. 14.8; we see that 
the opening removes narrow peaks and the closing removes narrow valleys (as 
expected), but also the slope of jumps is reduced at the top with the opening, 
and at the bottom with the closing. 
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T 








Figure 14.8. Top left: The grey-level function F. Top right: The structuring function G. 
Bottom left: F o G, the opening of F (dashed) by G. Bottom right: F e G, the closing of F 
(dashed) by G. 


For most practical problems concerning grey-level images, flat morpholog- 
ical operators are applied, instead of functional ones. Indeed, their expression 
is simpler (as it does not involve adding or subtracting grey-levels), and they 
work correctly for bounded grey-levels (functional ones can lead to overflow). 
In fact, flat operators have the same potential as functional ones for dealing with 
spatial shapes of objects in a grey-level image. However, there are sometimes 
situations where the grey-level profile of objects matters as much as their shape, 
and in such situations one will use functional morphological operators. 

Let us say a few words about the computational complexity of morpholog- 
ical operations. Without any optimization, the complexity of the Minkowski 
operations is in O(N x S), where N is the size of the image and S is the size 
of the structuring element. However, thanks to various approaches, such as the 
decomposition of structuring elements, or the use of redundancies, it is possi- 
ble for some particular types of structuring elements (say, rectangles), to have 
a complexity in O(N x VS), O(N x log S), or even O(N). In digital grids 
using the usual connectivities based on neighbourhoods, geodesical reconstruc- 
tion has a complexity in O(N), thanks to the use of queues. In the binary case, 
pixels are inserted in the queue as soon as they receive a connected component 
label, and leave the queue when they transmit the label to their neighbours. For 
grey-level reconstruction, one uses a set of queues, one for each grey-level, 
with a priority order corresponding to the grey-level (e.g., for reconstruction 
by dilation, priority is given to the highest grey-levels). 
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2. Algebra 


We saw in the Introduction how to define morphological operations on sets by 
combinations of unions, intersections and translations, and how these operations 
can be adapted to numerical functions by translating union and intersection into 
supremum and infimum. For many practical applications, such a framework 
resting on the analogy between sets and numerical functions, where foreground 
and background correspond to bright and dark image areas, is sufficient. How- 
ever, if one wants to deepen the understanding of morphology, two questions 
come forward: 


= Instead of extending the morphology on sets to the one on functions 
“by analogy’, is there not a systematic approach that would give both 
as particular cases? Indeed, the early studies of grey-level morphology 
analysed the latter in terms of umbras of functions, they even attempted to 
make grey-level morphology a particular case of set morphology applied 
to umbras; however the correspondence between operations on functions 
and those on umbras is exact only for discrete grey-levels (Ronse, 1990). 


= Can we define similarly morphological operations on other types of ob- 
jects? For example, on the family F(R”) of closed subsets of R”: here 
an intersection of closed sets is closed, while a union of closed sets is 
not closed, but one could take instead the closure of their union; can we 
adapt Minkowski addition and subtraction in order to obtain all other 
morphological operators? 


The answer to both questions is yes. Morphology on sets, on functions, and 
on several other types of objects (closed sets, convex sets, etc.) can be seen 
as particular cases of a general framework based on complete lattices. This 
was first introduced by Serra, 1988, then developed by Heijmans and Ronse, 
1990, Ronse and Heijmans, 1991, Heijmans, 1991 and Heijmans, 1994. In this 
section, we present the algebraic fundamentals of mathematical morphology. 


2.1 Complete lattice framework for images and operators 


The basic idea is to generalize the notions of inclusion, union and intersection 
of sets, to other objects. 


DEFINITION 14.1 A partial order is a relation < that is reflexive, anti- 
symmetrical and transitive. Write > for the inverse of < (x > y iffy < x), it 
is also a partial order. A partially ordered set or poset is a pair (X, <), where 
X is a set and < a partial order on X. 

Acomplete lattice is a poset (X, <) in which every non-void part Y of X has 
a least upper bound or supremum \V Y, and a greatest lower bound or infimum 


AY. 
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It follows in particular that a complete lattice (X, <) has always a greatest 
element, namely \/ X, and a least element, namely N X. By analogy with 
Boolean algebras, the greatest (resp., least) element is also called the one (resp., 
zero), and it is written 1 or T (resp., O or L). Note also that every x € X is 
both lower bound and upper bound of the empty set. Hence: 


i= x= AU and T EE 


A complete sublattice of X is given by a subset Y of X, such that with the 
restriction to Y of the order < on X, (Y, <) is a complete lattice in which the 
supremum and infimum operations, as well as the zero and one, are identical 
to those in X; equivalently, it is a subset Y of X such that for every Z C Y, 
VZ,A\Z €Y (also for Z = 0, i.e.,0,1 € Y). 

Some examples of complete lattices are particularly useful for mathematical 
morphology: 


= The power set P(E), ordered by the set inclusion; here the supremum 
and infimum are the union and intersection. It represents the family of 
binary images. 


= The grey-level sets R, Z, [a,b] and [a...b] considered in Sec. 1.2 are 
complete lattices, and Z is a complete sublattice of R. 


= Given T one of the above complete lattices, and a space E, consider the 
set T? of numerical functions E — T. It is a complete lattice, in fact a 
power lattice of T, in the sense that its ordering, supremum and infimum 
derive from those on T by pointwise application, see Eqs. (14.7, 14.8). 
It represents the family of grey-level images. 


= The family F(R”) of closed subsets of R” is a complete lattice for the 
ordering by inclusion; here the infimum of a family of closed sets is its 
intersection, while its supremum is the closure of its union. Despite the 
same ordering as in P(E), itis not acomplete sublattice of P (E), because 
the supremum operation is not the same. Many metrics and topologies on 
sets are defined properly only for closed sets (Ronse and Tajine, 2004). 


= We can represent RGB colours as triples (r, g, b) of numerical values, so 
T? is the complete lattice of RGB colours, with componentwise ordering; 
now we represent a RGB colour image as a function E — T° associating 
to each point p € E a triple (r(p), g(p), b(p)) coding the RGB colour of 
p; thus the family of RGB colour images constitutes the complete lattice 
(T?)”, with the componentwise ordering. 


= Given a set F, the set II( E) of partitions on E is ordered as follows: 
given two partitions 7, and 72, we write 7, < 7 is 7 is finer than 72, 
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or equivalently 72 is coarser than 7; this means that every class of 71 is 
included in a class of 72. Then II(£) is a complete lattice. In fact, there 
is a one-to-one correspondence between partitions on E and equivalence 
relations on E; then the lattice structure of II( Æ) corresponds by bijection 
to the one of the family Equiv(£) of equivalence relations on E, con- 
sidered as a subset of E?: the ordering on partitions corresponds to the 
inclusion order between equivalences, and the infimum and supremum 
of a family of partitions correspond respectively to the intersection and to 
the transitive closure of the union, of the associated equivalence relations. 


Image processing operations can then be viewed as mappings L — L, where 
L is the complete lattice of images under consideration; we can also consider 
mappings from one complete lattice to another, for example, TE — P(E) 
(binarization of grey-level images), or TË — II(E) (segmentation). Such 
mappings are usually written by Greek letters, and are called operators; an 
operator is said on L when it is a mapping L — L. Given a set L (which can be 
acomplete lattice or not) and a complete lattice M, the set M” of operators L > 
M is acomplete lattice, which inherits the order and complete lattice structure 
of M “componentwise’”, as happened for functions, see Eqs. (14.7,14.8): for 
two operators 7,¢ : L — M, we have 


nSG = Weel, mx) <r), 


and for a family Y; (i € I) of operators L — M, their supremum and infimum 
are given by: 


Vivi L> M: xm V pila) and Avi L> M:s N yila) ; 
ie] il iel il 
There is another operation on operators, composition; given n : L — M and 
¢ : M — N, the composition of n followed by Ç is the operator n : L —> 
N : a+ C(n(ax)). Of particular interest is the composition of operators 
on L: the composition of two operators ¢,7 : L — L is always defined, 
and this gives an associative operation having as neutral element the identity 
id : L > L: x > zx, in other words the set L4 of operators on L is what one 
calls a monoid. Given an operator 7 on L, we define recursively the power yY” 
for every n € N: y? = id, y”+t = w[W"]. Let us recall some morphological 
terminology (Serra, 1988; Heijmans, 1994): 


DEFINITION 14.2 Given two posets L and M, an operator p : L — M is 


m increasing (or isotone, Birkhoff, 1995) if for all x,y € L, we have x < 
y => V(x) < Vy). 

= decreasing (or antitone, Birkhoff, 1995) if for all x,y € L, we have 
acy > ylz) = ply). 
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= an isomorphism if w is an increasing bijection, whose inverse 7t} is 
increasing. 


= a dual isomorphism if w is a decreasing bijection, whose inverse y)~' is 
decreasing. 


Given a poset L, an operator w on L is 
= extensive if y > id, that is, for every x € L we have u(x) > x. 
= anti-extensive if  < id, that is, for every x € L we have y(x) < a. 
= an automorphism of L if w is an isomorphism L — L. 
= a dual automorphism of L if is a dual isomorphism L — L. 


Given a set L, an operator w on L is idempotent if Yy = y, that is, for every 


x E€ L we have y(y(x)) = y(x). 


Note that if L is a complete lattice and 7 is an increasing operator on L, then 
we have (Heijmans and Ronse, 1990): 


(14.15) V(r ic I) CL, icI icI 


< 
n 
< 
8 
So "Se 
V 
<= 
= 
X 


Some other properties and specific families of operators (in particular, dilations, 
erosions, openings and closings) will be defined in the following subsections. 

Given an operator ~ on a set L, the invariance domain of ) is the set 
Inv(p) = {x € L | y(x) = xz}. Given an operator : L — M, the 
range (or image) of w is the set of y(x) for x € L; we write it w(L). 

In the lattice P(E) of parts of a Euclidean or digital space FE = R” or Z”, the 
dilation, erosion, opening and closing by a structuring element, Eqs. (14.1,14.4), 
are translation-invariant, in other words they commute with any translation of 
E. We can generalize this notion as follows. Let T be a group of automorphisms 
of the complete lattice L; in other words for every 7 € T, 7 is an automorphism 
of L and r~} € T, and for every 71,72 E T, 717 E€ T. An operator w on 
L is said to be T-invariant if it commutes with every element of T: V7 € T, 
TY = YT. 

There is an important principle: duality. We saw above that the inverse > of 
a partial order < is a partial order. Therefore every notion concerning posets 
and complete lattices admits a dual, which is the same notion expressed w.r.t. 
the inverse order >; as the inverse of > is again <, the duality is symmetrical. 
For example, the dual of the supremum operation is the infimum operation (and 
vice versa); for an operator, being extensive and being anti-extensive are dual 
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properties. Note that all notions relying only on composition of operators, and 
not on order, are auto-dual; this is for example the case for the identity operator 
and for the property of idempotence. 

An inversion of a poset L is a dual automorphism of L which is its own 
inverse, in other words a decreasing operator v on L such that v? = id. Then 
every operator 7 on L has a dual by inversion, ~* = vv, whose properties 
are dual to those of y. For example, the set complementation in P(E), and the 
grey-level inversion (image negative) N in T”, are inversions; in P(E) the 
dilation (resp., opening) by B is the dual by complementation of the erosion 
(resp., closing) by B, Eqs. (14.2,14.5), and in T¥ the dilation and opening by 
G are the duals by inversion of the erosion and closing by G, Equation (14.14). 


2.2 Moore families, algebraic closings and openings 


There are many mathematical situations where an object is “closed” under 
some operation: a closed set in a topological space, a convex set in R”, a 
subgroup of a group, a transitive relation. The interesting thing is that when an 
object is not closed, one can close it in a unique smallest possible way. From 
the algebraic point of view, it is thus fundamental to describe both the structure 
of the family of closed sets, and the properties of the closure operator. 


DEFINITION 14.3 Let L be a poset. 


I Asubset M of Lis a Moore family if every element of L has a least upper 
bound in M: 





Va € L, (3y € M,y > wand [vz € M, (z>zr > z>y)]) : 


2 A closing (or closure operator) on L is an increasing, extensive and 
idempotent operator L — L. 


The Moore family stands for the family of closed objects. The equivalence 
between the two concepts of closed object and closing an object, is expressed 
as follows: 


PROPOSITION 14.4 Let L be a poset. There is a one-to-one correspondence 
between Moore families in L and closings on L, given as follows: 


= To a Moore family M we associate the closing ọ defined by setting for 
every x E€ L: p(x) is equal to the least y E€ M such that y > x. 


= To aclosing y one associates the Moore family M which is the invariance 
domain of y: M = Inv(). 


Note that M = {y(x) | x E€ L}. Let us now consider the case where L is a 
complete lattice. 


Mathematical Morphology 881 


THEOREM 14.5 Let L be a complete lattice. A subset M of L is a Moore 
family iff M is closed under the infimum operation: 


VS CM, ASEM. 


In particular, NỌ = 1 € M. Given a Moore family M corresponding to a 
closing ọ, (M, <) isa complete lattice with greatest element 1 and least element 
(0) = A M, and where the supremum and infimum of a family N C M are 
given by (V N ) and N N, respectively. 


Note that (1) = 1 and y(/\ N) = A N. Letus mention also the following 
property: 


(14.16) VX CL, e( VV e(z)) 5 e(V x) l 


rex 


Let us illustrate the above results with the family F of closed sets in a topo- 
logical space Æ. Clearly F is a Moore family of P(E) (ordered by inclusion), 
which means that F is closed under arbitrary intersections, and contains the 
empty intersection (|) = E; now F corresponds to a closing, which is the 
topological closure operator cl, where for X C E, cl(X) is the least element 
of F containing X. However the Moore family F has two further properties: 


1 0 € F; by Theorem 14.5, this is equivalent to cl(@) = 0. 


2 F is closed under binary union: for C1, C2 E€ F, C1 U C2 E€ F. By 
Theorem 14.5, this means that cl(C, U C2) = C1 U C2. Now F is the 
set of cl(X) for X € P(E), and we can write C; = cl(X;), so in view 
of Equation (14.16), the condition is equivalent to: 


YX1, X2 € P(E), cl(Xı U Xə) = cl(X1ı) U cl(X2) 3 


Therefore one can characterize a topology, given by the family of closed sets, 
through the associated closure operator cl, which must be a closing (increasing, 
idempotent and extensive), preserve the empty set, and distribute binary union 
(Everett, 1944). 

Let us now consider the dual concepts and results: 


= [na poset L, a dual Moore family is a subset M such that every element 
of L has a greatest lower bound in M. 


= The dual of a closing is an opening on L: an increasing, anti-extensive 
and idempotent operator L — L. 


= There is a one-to-one correspondence between dual Moore families in L 
and openings on L, where the corresponding opening ~y and dual Moore 
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family M verify: y(x) is equal to the greatest y E€ M such that y < zx, 
and M is the invariance domain of y. 


= Inacomplete lattice L, M is a dual Moore family iff M is closed under 
the supremum operation; in particular 0 € M. Given a dual Moore 
family M corresponding to an opening y, (M, <) is a complete lattice 
with greatest element (1) = V M and least element 0, and where the 
supremum and infimum of a family N C M are given by V N and 


y(A N), respectively. 


= A topology on a space E can be characterized by its topological interior 
operation int, which is an opening verifying int(E) = E and int(X1 N 
Xə) = int (X,) N int(X2). 


Let us now describe the structure of the families of openings and closings. 
This will lead to some standard methods to construct them. 


PROPOSITION 14.6 Let L be a complete lattice. 


I The supremum of any family of openings on L is an opening, and the 
set of openings on L is a dual Moore family in L’. For every in- 
creasing operator w on L, the greatest opening < % is T'(w), it verifies 
Inv(P(w)) = Inv(id Ay). 


2 The infimum of any family of closings is a closing, and the set of closings 
on L is a Moore family in L}. For every increasing operator y on L, 
the least closing > w is (p), it verifies Inv(®(wW)) = Inv(id V y). 


By Proposition 14.4 (and its dual), for any x € L, I'(~)(x) is the greatest 
y € Inv(id A w) such that y < x, and ®(7)(zx) is the least y € Invu(id v ~) 
such that y > x. 

By Theorem 14.5 (and its dual), the set of openings (resp., closings) is a 
complete lattice, where id is the greatest opening (resp., the least closing), and 
the least opening is the constant operator L — L : x +> O (resp., the greatest 
closing is the constant operator L — L : x œ> 1). 

One can construct openings and closings by specifying some of their invari- 
ants. Let b € L and let T be a group of automorphisms of L. The structural 
opening and closing yer and Yp,r are defined by 


yor (x) = \/{r(b) |7 ET, 7(b) < z}, 
9,0 (x) = /|\{r(0) |r €T, T(b) > z}. 


More generally, given a family S C L, we define then 


14189 ysr=\ 3r ad gsr= N pst., 
ses ses 


(14.17) Vx € L, 
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and we have 


ys.r(z) = \{r(s) | s € S, 7 €T, t(s) < z}, 


14.19) V L 
( ) TEL l ys,t(x) = ANts) |se S, TET, r(s)> zx}. 


These operators are a T-invariant opening and closing, respectively, and in fact 
every T-invariant opening and closing takes this form: 


sup 


PROPOSITION 14.7 Let L be a complete lattice. For any S C L, let (SYT 
(resp., (S jint ) be the least subset of L containing S which is closed under T 
and under the supremum (resp., infimum) operation. We have 


(95 = { V r(s)|XoTx s} 
(7,s)EX 


and (Sf = { A r(s)|XCTx sh. 
(7,s)EX 


Then ysr and ps,r are a T-invariant opening and closing, respectively, with 
these sets as their respective invariance domain: 


Inv(ys,r) = (ST and Inv(ps T) = (syinf : 


Conversely, every T-invariant opening y and closing ọ take this form: y = 
YInv(y),T and p = PInv(y),T- 


A well-known example is when L = P(E), for E = R” or Z”, and T is 
the group of translations of Æ. Then the structural opening and closing give 
the opening and closing by a structuring element: for every X,B € P(E) 
we have yp.r(X) = X o Band yp r(X) = (X° o B°): = X e [B° (with 
b € [Š] = —b ¢ B). Fora family S of structuring elements, we get the 
openings and closings of the form given in Equation (14.6). We obtain thus 
the well-known fact that every translation-invariant opening (resp., closing) is 
a union of openings (resp., intersection of closings) by structuring elements. 

When T reduces to the identity id, we simply write %, pb, ys and ys. Then 
the above result characterizes arbitrary openings and closings as being yg and 
ys for some S C L. 

In the next subsection, we will see how openings and closings arise from 
dilations and erosions. 


2.3 Galois connections and adjunctions 


At the beginning of the 19th century, Evariste Galois built a connection 
between fields of numbers generated by roots of equations, and groups of per- 
mutations of these roots. This type of correspondence is the first example of a 
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general technique used in algebra to build an association between two types of 
structures. It has thus been named after him. 


DEFINITION 14.8 Let A and B two posets, with two operators a: B— A 
and 3: A — B. We say that a and 8 form a Galois connection if 


(14.20) Va € A, Vb € B, a<a(b) => b< bla). 


Note that a and 8 play symmetrical roles. Galois connections are often used 
in mathematical morphology to establish a dual isomorphism between two types 
of structures, thanks to the following result: 


PROPOSITION 14.9 Let A and B two posets, and leta : B— A and  : 
A — B form a Galois connection. Then: 


1 a and ß are decreasing, a = apa and B = Bap. 


2 aß is a closing on A, Ba is a closing on B, Inv(aB) = a(B) and 
Inv(Ba) = B(A) (so that a( B) and 3(A) are Moore families). 


3 The restriction of 3 to a( B) is a dual isomorphism a( B) — P(A) whose 
inverse B(A) — a(B) is the restriction of a to B(A). 


One can generally characterize the types of maps a and 6 which may appear 
in a Galois connection, but in the case of complete lattices, this characterization 
is straightforward: 


DEFINITION 14.10 Let A and B be complete lattices. An operator a: B —> 
A is a Galois map if it exchanges supremum and infimum: 


V(z; i € 1) CB, alia) = \a(2i) : 
iel icI 


In particular (for I = Ù), œ maps the least element Opg of B onto the greatest 
element 1 4 of A. 


PROPOSITION 14.11 Let A and B be complete lattices. Then: 


1 Givena: B — A and 3: A — B forming a Galois connection, a and 
b are Galois maps. 


2 Conversely, given a Galois map a : B — A, there is a unique Galois 
map 3: A — B such that a and 2 form a Galois connection (and vice 
versa). 


3 Given œaı,@2 : B —> A and 21, b2 : A —> B such that a; and p; forma 
Galois connection for i = 1,2, we have a, < a2 & (1 < ba. 
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4 Given a; : B — A and 2; : A — B forming a Galois connection for 
i€ I, Nier au and Nicer Bi form a Galois connection. 


Inother words, Galois maps form a Moore family in the complete lattice of oper- 
ators A — B (or B — A), and Galois connection establishes an isomorphism 
between the two complete lattices of Galois maps A — B and B — A. 


Of particular interest are Galois connections between subsets of two sets, 
which were characterized by Ore, 1944 in terms of a relation between the 
points of the two sets: 


THEOREM 14.12 Let V and W two sets. 
1 Given a relation p between elements of V and of W, define 


a: P(W)>P(V): Ye{veV|VWweY, vp v}, 
bBo: PV) > P(W): XH {wEeW|WweEX, v pw}. 


Then a, and (3, form a Galois connection. 


2 Conversely, givena: P(W) — P(V) and B: P(V) — P(W) forming 
a Galois connection, there is a unique relation p between elements of V 
and of W, such that a = a, and 3 = pp; the relation p is given by 


Vu e€ V,Vu € W, vpw = vea({wh) = we B({v}). 


Following Birkhoff, 1995, the Galois maps a, and (3, are called polarities. 
Galois connections between sets expressed in such a form, arise in many aspects 
of mathematics and computer science. See for example Sec. 3.1. 

We turn now to the notion of adjunction, which is “semi-dual” to the one 
of Galois connection, in the sense that we reverse the ordering on one of the 
posets, but not on the other. 


DEFINITION 14.13 Let A and B two posets, with two operators and ô : A —> 
Bande: B — A. We say that (£, ô) is an adjunction if 


(14.21) Va € A, Vb € B, d(a) <b = > a< elb). 
We say that 6 is lower adjoint of £, and € is upper adjoint of 6. 


Compared with Galois connections (see Definition 14.8), we have reversed 
the ordering on B, since we have ô(a) < b instead of b < d(a). Hence £ and 6 
do not play symmetrical roles, that is why we write the ordered pair (£, 6). We 
obtain then the analogue of Proposition 14.9: 


PROPOSITION 14.14 Let A and B two posets, and let 6 : A — B and 
e€: B — A such that (€, 6) is an adjunction. Then: 
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1 cand 6 are increasing, € = ede and 6 = 6€6. 


2 eô is a closing on A, ôe is an opening on B, Inv(ed) = e(B) and 
Inv(de) = 6(A) (so that e(B) is a Moore family and 6(A) is a dual 
Moore family). 


3 The restriction of 6 to e( B) is an isomorphism ¢(B) — 6(A) whose 
inverse 6(A) — e( B) is the restriction of € to 0( A). 


PROPOSITION 14.15 Let L be a poset, T a group of automorphisms of L, 
and €,6 : L — L such that (€,6) is an adjunction. Then £ is T-invariant iff 6 
is T-invariant. 


Let us now characterize adjunctions in the case of complete lattices. 


DEFINITION 14.16 Let A and B be complete lattices. 


1 An operator € : B — A is an erosion if it commutes with the infimum 
operation: 


V(ai,i € DT) CB, e(A xi) = Aee - 
iEl icl 


In particular (for I = 0), € maps the greatest element 1g of B onto the 
greatest element 14 of A. 


2 An operator 6: B — A is a dilation if it commutes with the supremum 
operation: 


Yz i€ 1) CB, aV ai) = V e 
icl iEl 


In particular (for I = $), 6 maps the least element Og of B onto the least 
element 04 of A. 


Note that dilations and erosions are increasing. Also the set of d(x) (x € B) 
is closed under the supremum operation, while the set of e(x) (x € B) is closed 
under the infimum operation. We obtain now the analogue of Proposition 14.11: 


THEOREM 14.17 Let A and B be complete lattices. Then: 


1 Given ô : A — Bande: B — A such that (€, ô) is an adjunction, ô is 
a dilation and € is an erosion. 


2 Conversely, (a) given a dilation 6 : A — B, there is a unique erosion 
e€: B — A such that (€, ô) is an adjunction, and 
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(b) given an erosion € : B — A, there is a unique dilation 6: A — B 
such that (£, 6) is an adjunction. 


3 Given 61,62 : A — B and €\,€2 : B — A such that (¢€;,0;) is an 
adjunction for i = 1,2, we have 6, < 6g & €1 > €2. 


4 Given 6;: A — Band £; : B — A such that (€;, 6;) is an adjunction for 
i € I, (Aer €i Vier 5) is an adjunction. 


In other words, in the complete lattice of operators A — B (or B — A), 
erosions form a Moore family, while dilations form a dual Moore family, and 
adjunctions establish a dual isomorphism between the two complete lattices of 
dilations A — B and erosions B > A. 


The classical example of adjunction is given by the erosion and dilation by 
a structuring element or function, Eqs. (14.1, 14.10, 14.12), arising from the 
Minkowski addition and subtraction. They are both translation-invariant (cf. 
Proposition 14.15). Here A = B = P(E) or T”. In fact, every translation 
invariant dilation/erosion on sets arises from Minkowski operations, Equa- 
tion (14.1), while for functions, every flat dilation/erosion invariant under spa- 
tial translations takes the form of Equation (14.10), and every dilation/erosion 
invariant under both spatial and grey-level translations arises from Minkowski 
operations, Equation (14.12). 

In Heijmans and Ronse, 1990, there is a general study of complete lattices 
where it is possible to define such Minkowski operations, and to obtain for them 
properties similar to those verified for sets. Particular cases include of courses 
P(E) and TË (E = R” or Z”, T = R or Z), for which we obtain the form 
given in Eqs. (14.1, 14.12), but also: the lattice of convex subsets of R” (here 
the supremum is the convex hull of the union, but Minkowski operations are the 
same as in P(R")), the lattice F(R”) of closed sets of R” (here the supremum 
is the closure of the union, and the Minkowski addition is the closure of the one 
obtained in P(R”), but the Minkowski subtraction is the one of P(R”)), upper 
semi-continuous functions R” — R, etc. 

In the case where A = B, the operators £, ô are A — A, and can be composed 
arbitrarily in any order. It is then easily checked that in a poset A we have 


(14.22) 6>id & 6>66 © ðe>e & id>e 


and 
(14.23) 
Pe<id es 6 <8 BS b6K< 6 S dele SB e<l S&S id< ed. 


This gives then the following result, which will be used later on, in the case of 
sets: 
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PROPOSITION 14.18 Let A be a poset, and let (€, ô) be an adjunction (for 
6,€ : A — A). Then the following five statement are equivalent: (a) 6 is a 
closing, (b) £ is an opening, (c) ôe = e, (d) eô = ô, (e) ô and € verify one 
statement of Equation (14.22) and one statement of Equation (14.23). Then we 
have 

Inv(eĝ) = Inv(ôe) = Inv(d) = Inv(e) 

= €0(A) = ôe( A) = 6(A) = e(A) . 








This set is both a Moore family and a dual Moore family in A; when A is a 
complete lattice, it is a complete sublattice of A. 


Let us now consider dilations, erosions and adjunctions on sets. Let V and 
W two sets, and let p be a relation between elements of V and of W. We define 
dp: P(V) — P(W), the dilation by p, and £p : P(W) — P(V), the erosion 
by p, as follows: 





VX EP(V), 6(X)={weW | ave xX, vp w}, 


C9 “WV EPW), epl¥) ={vEV | Wwe W, v pw we Y}. 


Alternately, we can define dilation erosion in terms of a map N : V — P(W) 
and the dual map N : W — P(V), corresponding to the relation p by 


we Nw) & vEN(w) & vpu, 
(14.25) W € V,Yw € W, that is, N(v) ={weWw |vpv}. 
and N(w) ={vEV]|vpw}. 


When V = W, the set N (v) can be considered as the window or neighbourhood 
of point v, and N is called a neighbourhood function or a windowing function. 
Now Equation (14.24) can be written 
(14.26) 
VX EP(V), dn(X)= |] NW) ={wew | Nw) X 40}, 
vEX 
VY EP(W), en(Y)={vEV|N(v) CY}. 


We have then the analogue for adjunctions of Ore’s characterization of Galois 
connections on sets (Theorem 14.12): 


THEOREM 14.19 Let V and W two sets. 
1 Givena map N : V > P(W), (en, On) is an adjunction. 


2 Conversely, given ô : P(V) — P(W) ande: P(W) — P(V) such that 
(e, 6) is an adjunction, there is a unique map N : V — P(W) such that 
ô = Oy and € = En; for every v E€ V, N(v) = d({v}). 
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Note that ôg is a dilation P(W) — P(V), eq is an erosion P(V) — P(W), 
(es, 0%) is an adjunction, and that J; and << are dual by complementation of 
En and on respectively, as 


VY € P(W), ôg) =V \en(W \Y) 
and VX € P(V), eg (X) = W \ dn(V \ X). 
In fact 65 = 6,-1 and ex = €,-1, where p | is the relation inverse of p 


(wp tv & vpw). 


A classical example is given for V = W = E for E being the Euclidean 
space R” or the digital space Z”, and the neighbourhoods being built from a 
structuring element B C FE: forevery p € E, N(p) = Bp. Then N (p) = (B)p 
for all p € E, ôn = 0B, EN = €B, Ox = Op and Ex = Eğ. These operators 
are translation-invariant. In fact, from Proposition 14.15, for an adjunction 
(En, On), En is translation-invariant iff ôy is translation-invariant, and in such 
a case it is easily seen that they are the erosion and dilation by the structuring 
element B = N(o). 


PROPOSITION 14.20 The following are equivalent: 

(vu EV, Nw) 49) © en) =0 & OG(W) HV & en < Oy. 
Dually, the following are equivalent: 

(vwe W, N(w) #0) © es (0) =0 & ôn(V)=W & ex Sôn. 


This result will intervene later on, in particular in Sec. 3.2 and Sec. 4. Note 
that in the case where V = W = E (E = R” or Z”) and N(p) = B, for all 
p € E, the two equivalences reduce both to B Æ 0). 

Consider now the case where V = W = E. Here pis a relation on EF, 
and both N and N are E — P(E). The following two results will be used in 
Sec. 3.2: 


PROPOSITION 14.21 Considera relation p ona set E, and the corresponding 
maps N,N : E —> P(E). Then: 


1 The following five statements are equivalent: (a) p is reflexive, (b) ôn 
is extensive, (c) ey is anti-extensive, (d) dx is extensive, (e) Ex is 
anti-extensive. 


2 The following five statements are equivalent: (a) p is symmetrical, (b) 
EXON is extensive, (c) ONE ® is anti-extensive, (d) eyĝğ is extensive, (e) 
ô WEN is anti-extensive. 


3 The following five statements are equivalent: (a) p is transitive, (b) 
Ore < On, (c) ex, > EN, (d) re S Om (e) et > EÑ 
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Combining items 1 and 3 with Proposition 14.18, we deduce: 


PROPOSITION 14.22 Consider a relation p on E, and the corresponding 
maps N, N: E> P(E). Then the following nine statements are equiva- 
lent: (a) p is reflexive and transitive, (b) dyn is a closing, (c) €y is an opening, 
(d) õnen = En, (e) Enon = On, (f) ôg is a closing, (g) Ex is an opening, 
(h) Omen = Exp (2) E05 = Oe We have then 

Inv(endn) = Inv(dnen) = Inv(dy) = Inv(en) 

= {enôn (Z) | Z € P(E)} = {onen(Z) | Z € P(E)} 

= {n (Z) | Z € P(E)} = {en(Z)|ZEP(E)} , 


and the same with N in place of N. The two families Inv(eydn)=Inv(5NnEn) 
and Inv(éxoq)=Inv(d Eğ) are closed under arbitrary union and inter- 
section, and contain E and () (in other words they are complete sublattices 


of (P(E), S)). 


An Alexandroff topology (Alexandroff, 1937; Alexandroff and Hopf, 1935) 
is a topological space (Æ, G) where the family G of open sets is closed under 
arbitrary intersection; in other words G is a complete sublattice of (P(E), C). 
It is equivalent to require that every point of E has a least open neighbourhood. 
By the Alexandroff specialization theorem (Alexandroff, 1956), there is a one- 
to-one correspondence between Alexandroff topologies on E and reflexive and 
transitive relations on E; in fact, for x,y € E, x p y iff x is in the closure 
of {y}, i.e., iff y belongs to the least neighbourhood of x. It follows then that 
for x € E, N(x) is the least neighbourhood of x and N (x) is the topological 
closure of {x}, while for X € P(E), dn(X) is the least open set containing 
X (called the star of X), en (X) is the topological interior of X, 6(X) is the 
topological closure of X, and € ġ(X) is the greatest closed subset of X. Note 
that Inv(ewon) = Inv(dnen) is the family of open sets and Inv(egdq) = 
Invu(os€ x) is the family of closed sets. 

We saw in Sec. 2.2 that a closing y on P(E) is the closure operator in a 
topology on Æ iff it satisfies the following two additional constraints: y(0) = Ø 
and p(X; U X2) = v(X 1) U (X2) for all X1, X2 € P(E); we have then 
p(X1U-:- UX») = y(X1)U---Uy(X,) forall X1,...,Xpn E€ P(E). In other 
words the commutation with the union operation, y(U;er Xi) = Ujer V(X), 
is verified for J being empty or finite. This is weaker than y being a dilation, 
where this identity is verified also for an infinite family J; but then the set of 
closed sets y(X) is closed under infinite unions, which means indeed that we 
have an Alexandroff topology. 


2.4 Morphological filters 


The word “filter” is used in several scientific and technological contexts, with 
various meanings. In image processing, one knows the linear filters, namely 
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convolution operators, in particular the bandpass filter from signal processing, 
which preserves all frequencies within a band, and eliminates all others. In 
non-linear image processing, the well-known median filter has been used to 
remove impulsive noise, without the blurring effect of linear smoothing filters. 
The morphological approach to filtering is similar to that of signal processing, 
namely preserving some parts of an image and eliminating some others, ex- 
cept that the separation of these parts is not based on frequencies. The model 
proposed is that of an ideal filter, i.e., one that keeps the wanted components un- 
altered, and eliminates completely the unwanted ones. In order to characterize 
an ideal filter, rather than describing the features to be preserved or removed, 
one takes an algebraic point of view: if the filter does not alter the wanted parts 
and eliminates completely the unwanted ones, then applying the filter a second 
time will not change anything. Hence the main characteristic of an ideal filter 
is its idempotence. This is important from a theoretical point of view, but also 
for practical applications: if after applying the filter on an image the result is 
not satisfying, then we know that another filter must be applied. This contrasts 
with the behaviour of the median filter: after one application, some noise re- 
mains, that could be eliminated by a second or third application; then one can 
repeat the application of the filter, without guarantee that this will lead to a 
stable final result, as the median filter can produce oscillations (Serra, 1988). 
This is related to the fact that one cannot characterize precisely what are the 
features preserved or eliminated by this filter. 

Besides idempotence, mathematical morphology demands that the behaviour 
of a filter should be related to the order and complete lattice structure of the 
family of images. Therefore one calls a morphological filter (or simply, a filter) 
an increasing and idempotent operator on a poset (or complete lattice). Write 
Filt(L) for the set of filters on L. We have already encountered some filters: 
openings and closings. There are many other ones, and we will describe here 
some techniques for constructing them. This requires some terminology: 


DEFINITION 14.23 Let L be a poset and w an operator on L. We say that: 
1 w is underpotent if yY? < y. 
2 wis overpotent if Y? > y. 
3 wis anunderfilter if q is increasing and underpotent. 
4 wis an overfilter if w is increasing and overpotent. 


We saw in Proposition 14.6 that in a complete lattice, the set of openings is a 
dual Moore family and the set of closings is a Moore family. They constitute 
thus two complete lattices. We have a similar result for filters (Serra, 1988): 


PROPOSITION 14.24 Let L be a complete lattice. 
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1 The set of overfilters on L is a dual Moore family in L", i.e. it is closed 
under the supremum operation. 


2 The set of underfilters on L is a Moore family in L}, i.e. it is closed under 
the infimum operation. 


3 The set Filt(L) of filters on L is a complete lattice. For any family pi 
(i € I) of filters, their supremum in Filt(L) is the least underfilter Y such 
that > Ve, Vir and their infimum in Filt(L) is the greatest overfilter 


w such that% < Nier Vi 


This gives a first method for constructing a filter from a family of filters. The 
second one arises from composition (Serra, 1988): 


PROPOSITION 14.25 Let L be a complete lattice and let € and w be two filters 
on L such that € > w. Then: 


I The only operators that can be obtained by repeated compositions of 4% 
and £ are pE, Ey, py and Ewé. They are all filters and 


g>6ve2 { ie b> ven > w, 


2 Inv(€)N Inv(w) C Inv(Ews) = Inv(Ew) C Inv(€) and 
Inv(§) N Invi) E Inv(vey) = Inv(W) E Inv(y). 


3 In Filt(L), the supremum and infimum of £y and YE are €Wé and pE 
respectively. 


Note that items 1 and 2 do not require L to be a complete lattice, they are 
valid in any poset. A classical example is when € is a closing and % is an 
opening: the opening filters out positive noise, the closing filters out negative 
noise, so the composition of the two should filter out both types of noise (cf. 
Sec. 1.1). 

The above result is at the basis of a well-known filter introduced in the 
1980s, the alternate sequential filter. Suppose that we have an image where 
features of foreground and background are imbricated. To extract an object of 
a given size, it is necessary to filter its holes at a smaller size, and this require 
filtering objects at an even smaller size, etc. Thus we will apply openings 
and closings at increasing scales in order to simplify the image. Consider n 
openings 71,..-,Y%n such that yn < --- < y, and n closings Y1, ..., Pn 
such that pı < --- < Yn. From the previous proposition, the compositions 
Hi = ViPS Vi = Pii, Pi = PiYipi and o; = YipiYi are filters. Alternate 
sequential filters are then defined as: 


Mibs1--* Mob = (pi) (%-1%i-1) (M22) (M141), 
(14.27) 
ViVi—1'':V2V] = (pii) (Pi-17i-1) ar (p2y2)(p171), 
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or as the following variants: 


(14.28) 
PiPi-1°** pap. = (Piipi)(Pi-1Vi-1Pi—1) * + + (P2722) (YIN 1) 
= YilibMi-1°°* H2H1 , 
Oi0i-1°° 0201 = (WIV) (V1 1-1-1) ** (V2H272) (11.9171) 
= YiViVi—1:''' V2 , 
for i = 1,...,n. They are all filters. They are useful for filtering images 


where grains (bright zones) are imbricated with pores (dark zones) at all sizes. 
Typically, the +;’s and y,’s can be: 


™ openings and closings by structuring elements of increasing sizes; 


™ openings and closings by reconstruction, based on structuring elements 
of increasing sizes; 


m area openings and closings (removing grains and pores on the basis of 
their area), with increasing area thresholds; 


hence as i increases, the alternating sequential filters will progressively remove 
grains and pores of increasing sizes, thus simplifying the image. (We will 
discuss further the notion of removing “features of increasing sizes” in Sec. 2.5.) 
An example is provided in Fig. 14.9 and 14.10. 

Schonfeld and Goutsias, 1991 noticed that besides Eqs. (14.27,14.28), any 
composition of openings y; and closings y;, in any order, is a filter. Their 
argument was generalized by Heijmans, 1997 as follows: 


PROPOSITION 14.26 Let W1,...,Wn be overfilters and &,... , En be under- 
filters such that 


Un Se SW SES En. 


Then any composition of these operators, containing at least one p; and one 
Ej, is a filter. 





A consequence is the following surprising result: 


PROPOSITION 14.27 Let (€,6) be an adjunction in a poset L. Then any 
repeated composition of £ and ô in any order, containing the same number of 
instances of € and of 6, is a filter. 


More precisely, an operator of the form ~,---wW2n, where for each i = 
1,...,2nwehave p; € {8, €}, and card{i = 1,...,2n | Yi = ô} = card{i = 
1,...,2n | pi = £}, is a filter. 

For more results on filters, the reader is referred to Heijmans, 1994, Heijmans, 
1997, Ronse and Heijmans, 1991, Serra, 1988 and Soille, 2003. 
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(d) 





Figure 14.9. Original image (a) and three steps (b, c, d) of an alternate sequential filter based 
on opening-closing by reconstruction using an hexagon as structuring element. 


2.5 Granulometries and size distributions 


As openings remove parts of an object (they are anti-extensive), one can 
compare two openings yı and 2 in such terms; thus we say that y2 is more 
active than 91 if y2 removes from any object more than yı does, in other words 
if y2 < y. On the other hand, as closings add parts to an object, given two 
closings y; and p2, we say that pə is more active than y1 if p2 adds to an object 
more than y; does, in other words if y2 > y1. 

In the case of the complete lattice P(E), given two structuring elements 
A, B € P(E), we define the relation 3 by B I A iff B is a union of translates 
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Figure 14.10. Steps 1 (a) and 3 (b) of the same filter as in Fig. 14.9 but using segments in 
different directions as structuring elements. 


of A. Readily, by Eqs. (14.1,14.4) we have 





BIA <= BoA=B <> (ACEP(E), B=AOC). 


For example, given b > a, this is true if A and B are squares of size a and 
b respectively, or (for Æ = R”) if A and B are closed balls of radii a and 
b respectively. Now by Equation (14.4), the openings y4, yg and closings 
YA, YB Verify: 


yB S ya => Op > pa 4 BIA. 


In other words, the “greater” is the structuring element (for 3), the more active 
are the opening and closing. 

The above suggests that the activity of openings and closings is governed by 
the size of the structuring elements that they use. We see below that it can be 
characterized in another way: 


PROPOSITION 14.28 Let pı and y be either two openings, or two closings, 
on a poset L. Then the following four statements are equivalent: 


1 p is more active than y. 
2 Inv(wb2) C Inv(y1). 

3 papi = ya. 

4 pie = Ya. 
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The second item indicates that the activity increases as the domain of invari- 
ance decreases. The last two suggest a notion of a filtering absorption order 
associated to activity: if %2 is more active than 71, then as a filter 72 is more 
severe, so 7; does not improve in any way upon the result of w2, whether ap- 
plied before of after it; thus Y2 absorbs pı. We can now consider an ordered 
sequence of openings: 


DEFINITION 14.29 A granulometry (on a poset L) is a family of operators py 
(r €e RC R?) such that: 


I Vr € R, Yr is anti-extensive; 
2 Vr € R, Y% is increasing; 
3 Vr, sER, Wr Ws = YPsPr = W®max(r,s)- 


Applying item 3 with r = s, Yy is idempotent, so it is an opening. In fact, 
w, (r € R)is a granulometry iff Y, is an opening for every r, and Yy decreases 
(becomes more active) as the parameter r increases: Yr, s E€ R, r > s implies 
Wr < ws, or equivalently Inv(w,) C Inv(as). 

For binary images in a digital framework (L = P(Z”)), we take R = 
{2,..., Tmax} and Yr to be the opening by a structuring element B, corre- 
sponding to size r (say, ar x r-square). Then fora set X C Z”, it is interesting 
to measure the area (number of pixels) of yr (X) for all r; this gives a decreasing 
function R — N, the granulometry curve of X, it displays the area of the ob- 
jects according to the size of the opening. Positions where this curve decreases 
sharply indicate that there are substantial parts of X having the corresponding 
width. This is illustrated in Fig. 14.11. 

One defines similarly an anti-granulometry by replacing, in item 1 of Def- 
inition 14.29, “anti-extensive” by “extensive”. Then Y, (r € A) is an anti- 
granulometry iff 7, is a closing for every r, and p, increases (becomes more 
active) as the parameter r increases: Vr,s € R, r > s implies Yr > We, 
or equivalently Inv(w,) C Inv(qs). In a digital framework, we define the 
anti-granulometry curve, which gives an indication on the width of the holes 
of the set. 

It is possible to combine a granulometry yr (r € Ri C Rt) and an anti- 
granulometry yr (r € R2 C R?) into a two-sided sequence Yr, r € R = 
Rı U {0} U (— R2) by setting Yr = 7, for r € Ri, Yo = id, and Y-r = pr for 
r € Rə. Then the axioms are: (1) Y, is anti-extensive for r > 0 but extensive 
for r < 0, (2) Yr is increasing, (3) prs = YsYr = Ym(r,s) for r, s having the 
same sign, where m(r,s) = max(r, s) for r,s > 0, but m(r, s) = min(r, s) 
for r,s #4 0 (we have no such identity for r > 0 and s < 0). We generalize 
then the granulometry curve into a function R — N where the parts r < 0 and 
r > 0 deal with the sizes of holes and grains respectively. 
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Figure 14.11. A binary image of coffee beans, and its granulometry curve, showing a sharp 
decrease between 10 and 15; this indicates that most beans have such a width. 


3. Related approaches 


As we Said in the Introduction, there are other fields of research which are 
based on the same lattice-theoretical foundations as mathematical morphology. 
We present here three of them, which we think are relevant to the logic of spatial 
relations: formal concept analysis, rough sets and fuzzy sets. Roughness can 
be represented by using mathematical morphology operators to define upper 
and lower approximations in the framework of roughs sets (Sec. 3.2). We then 
show why mathematical morphology can be considered as a spatial reasoning 
tool (Sec. 3.3), with its two components: spatial knowledge representation and 
reasoning. As for the first one, we present in Sec. 3.4 an extension of mathe- 
matical morphology to fuzzy sets, which leads to an extended representation 
power coping with spatial imprecision. Modeling spatial relationships based 
on fuzzy morphology allows reasoning under imprecision and with structural 
spatial information, as shown in Sec. 3.5. This reasoning component will be 
further explored in Sec. 4. 


3.1 Formal Concept Analysis 


It is a lattice-based theory (Ganter and Wille, 1999) of relations between 
objects and features. It can be applied to spatial objects and spatial relations 
like visibility, enclosure, etc. 

Let Q be a set of objects, II a set of properties, and ~ a relation between Q 
and II, where o ~ p means that object o has property p. The triple (Q, I, ~) is 
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called a context. Following Theorem 14.12 we define the two maps 


w: PMD > PQ): PrfoEeQ|VpEeP, o~p}, 
mr: PQ) > PMI): O {pEII|YoE€0,o0o~p}. 


Thus w(P) is the set of objects sharing all properties in P, while 7(O) is the 
set of properties shared by all objects in O. A concept is a pair (O, P), where 
O € P(Q) and P € P(ID), such that O = w(P) and P = 7(O); O is the extent 
of the concept and P is the intent of the concept. 

The set of concepts ordered by inclusion of object sets, or equivalently by 
the inverse inclusion on property sets: 





(O1, Pi) < (O2, P2) O; C O2 P 2P. 


It forms then a complete lattice with the following supremum and infimum 


operations: 
Vonr) = (e*[Uo].”) 


iel ie ied 
/\(Oi, Pi) = (N Ontw |P) : 
icI iel icI 


Here we used the complete lattice structure of the two Moore families of ex- 
tents (possible O;’s) and of intents (possible P;’s), cf. Proposition 14.9 and 
Theorem 14.5. 

A possible example of application of formal concept analysis is to consider 
two subsets S and V of the space R” or Z”, where S represents an object to 
be visually examined, and a V is a set of viewpoints. We define a relation ~ 
between the boundary ôS of S and V, namely s ~ v if s is visible from v, i.e., 
the open segment |s, v| is disjoint from S. Then a concept is given by a pair 
(T,W), where T C OS and W C V, such that T is the set of positions visible 
by all viewpoints in W, and W is the set of all viewpoints from which T is 
entirely visible. 


3.2 Mathematical morphology and rough sets 


Rough set theory has been introduced by Pawlak, 1982, as an extension of 
set theory, mainly in the domain of intelligent systems. The objective was to 
deal with incomplete information, leading to the idea of indistinguishability 
of objects in a set. It is therefore related to the concept of approximation, 
and of granularity of information (in the sense of Zadeh, 1979). This theory 
was applied successfully in several applications, e.g. information analysis, data 
analysis and data mining, knowledge discovery (for instance discovery of which 
features are relevant for data description), i.e., all applications for which a need 
arises for intelligent decision support. Let us mention in particular the works 
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of Lin, 1995, Lin and Liu, 1994, Yao, 1998 and Yao and Lin, 1996. There have 
also been studies towards a fuzzy approach to rough sets (Dubois and Prade, 
1990), and on their relations with logic (Orlowska, 1993; Pawlak, 1987). 

In this framework, a set X is approximated by two sets, called upper and 
lower approximations, and denoted by A(X) and A(X), such that A(X) C 
X C A(X). Itis interesting to investigate the algebraic properties of the two set 
operators A and A: the first one is extensive, the second one is anti-extensive, 
and probably they should be increasing. But then, are they respectively a 
dilation and an erosion? In particular, do they constitute an adjunction (arising 
from a reflexive relation, cf. Proposition 14.21)? Or are they a closing and 
an opening? Or is A both a closing and a dilation, and A both an opening 
and an erosion (cf. Proposition 14.22)? Are they dual by complementation? 
Surprisingly, there have not been many studies on the relation between rough 
sets and MM; let us mention a few of them. On the one hand Polkowski, 
1998 built a hit-or-miss (Fell) topology on rough sets, similar to the one used 
in MM for closed sets (Heijmans, 1994; Matheron, 1975; Ronse and Tajine, 
2004; Serra, 1982). On the other hand Bloch, 2000b studied the algebraic 
properties of the upper and lower approximation operators, and established an 
analogy between them and the classical morphological operators on Euclidean 
or digital sets, namely dilation, erosion, opening and closing by a structuring 
element. Also Diintsch and Gediga, 2003 considered the algebraic aspects of 
rough sets, in particular their links with Galois connections. 

Here we will investigate rough sets in light of the theory of adjunctions on 
sets; in some sense, this is a generalization of Bloch, 2000b. At the same time 
we will address topological aspects. But let us first recall the basic definitions 
of rough sets, in particular those based on a similarity relation. 

In rough set theory (Pawlak, 1982), the two approximations A(X) and A(X) 
such that A(X) C X C A(X) are defined from an equivalence relation. Let U 
denote the universe of discourse (X being a subset of /). We consider attributes 
which are functions defined on U/, and write A for the set of attributes. To each 
x € U we associate an information vector Inf (x), which is the set of attributes 
associated to x. We define an equivalence relation R4 (with respect to the set 
A of attributes on U) by the equality of the information vector: 


x Ray <> Inf(«) =Infly) . 
Assuming that each element of U is known only through its attributes, x RA y 
means that x and y are undistinguishable on the basis of available information. 


The pair (U, R4) is called an approximation space. For x € U, let [x] 4 denote 
the equivalence class of x under R4: 


[zqla={yeU|e Ray} . 
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Then upper and lower approximations of a subset X of U are defined as: 


A(X) {reU | [z]4nx £0} , 
A(X) = {zeu | [tla €X}. 


(14.29) 


The lower approximation of X contains all points of U that are distinguishable 
from every elements of X°, while its upper approximation contains all points 
of U that are undistinguishable from some element of X. We call a rough set 
a pair (A(X), A(X). 

Let us refer to the terminology used for dilations and erosions on sets: if 
Ra stands for p, then by Eqs. (14.24,14.25,14.26) we have [z]4 = N(x), 
A = EN = Ep, and A= Oy = Ong Clearly A and A are dual under 
complementation: A(X) = [A(X°)] °, The fact that R4 is symmetrical (R4 = 
Rx’) means that N = Ñ, so (A, A) forms an adjunction. Now as R4 is 
reflexive and transitive, by Proposition 14.22, the erosion A is also an opening, 
while the dilation A is also a closing, with AA = A and AA = A. In particular, 
we have 

VX € P(U), A(X) CXCA(X). 


By Proposition 14.22 again, ey (P(U)) = ôn (P (U)), i.e., the families of lower 
and upper approximations coincide: 


{A(X) | X € PU)} = {4(X) | X e PU)} ; 


in fact this family consists of all sets which are unions of equivalence classes 
[x] 4. With the topological interpretation given after Proposition 14.22, it con- 
stitutes an Alexandroff topology on U, where open sets coincide with closed 
ones, the upper approximation A(X) is the closure and the star (least open 
superset) of X, while the lower approximation A(X) is the interior and the 
greatest closed subset of X. 

This definition can be extended to any relation R, leading to the notion 
of generalized approximate space (see e.g. Yao, 1998). Simply we take an 
arbitrary relation R instead of the equivalence R4, and the set 


r(x) ={y EU |z Ry} 


instead of the equivalence class [x] 4; here r(x) corresponds to the set N(x) 
according to Equation (14.25), with R standing for p. Then Equation (14.29) 
becomes 


(14.30) R(X) = {xeu|r(x) NX FO} , 

` R(X) = {teUu|r(x) CX}. 
In our terminology, R = £, = £p and R = 67 = bp-1. Now Rand R are still 
dual under complementation. 
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Equivalently, we can define R and R as two set operators which are dual 
under complementation (R(X) = [R(X °)| “), and such that R is an erosion (or 
equivalently: R is a dilation). This is in accordance with the operator-oriented 
view of rough sets (Lin and Liu, 1994; Yao, 1998). 

If Ris an equivalence relation, we get Pawlak’s definition, Equation (14.29); 
indeed, we can define Inf (x) = [x] pr, the equivalence class of x under R. Let 
us consider weaker conditions on R. We require that R(X) C R(X) for all 
X € P(U), which means that €» < ôr; according to Proposition 14.20, this is 
verified iff 

Ve EU, r(x) Oo. 


Usually, one requires that R is anti-extensive and R is extensive, that is, 
YX EPU), RX)CXCR(X); 


by Proposition 14.21, this is verified iff R is reflexive. 

By Proposition 14.21, R is symmetrical iff RR is extensive, iff RR is anti- 
extensive. Then r = 7, and (R, R) is an adjunction. If R is reflexive and 
symmetrical, we call it a tolerance relation. 

By Proposition 14.22, Ris reflexive and transitive iff Ris both an erosion and 
an opening, iff R is both a dilation and a closing. Here we have an Alexandroff 
topology on U, where for every X € P(U), R(X) is the topological interior 
of X and R(X) is the topological closure of X. The family of open sets, 
{R(X) | X € P(U)}, and the family of closed sets,{ R(X) | X € P(U)}, do 
not coincide, unless R is also symmetrical (i.e., an equivalence relation). 

When Ris reflexive but not transitive, the upper and lower approximations do 
generally not correspond to a topology. However they correspond to the closure 
and interior in a pre-topology, that is: R and R are dual under complementation, 
they are both increasing, R is extensive while R is anti-extensive, R(Ø) = Ø and 
RU) = U. Let us mention the use by Emptoz, 1983 of pre-topology for the 
description of spatial objects. This may be of interest for pattern recognition 
purposes, since a non-idempotent closure allows to aggregate patterns using 
iterated closure operations. 

In Yao, 1998, various properties are given for the operators R and R, which 
may or may not be satisfied, according to the properties of the relation R. In 
Bloch, 2000b, a parallel is made between these properties and those of dilations, 
erosions, openings and closings. In fact, all these properties follow from the 
ones given in Sec. 2.3 for dilations and erosions on sets, those of openings and 
closings in Sec. 2.2, and from Equation (14.15). 

Using the operator-oriented point of view (Lin and Liu, 1994; Yao, 1998), 
one could also define the lower and upper approximation as an opening and a 
closing. However, such operators cannot be defined in terms of a relation or a 
neighbourhood function, as was the case with dilations and erosions. Openings 
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and closings are characterized by their invariance domain: for an opening, it is 
a dual Moore family, i.e., a family closed under arbitrary unions and contain- 
ing (); for a closing it is a Moore family, i.e., a family closed under arbitrary 
intersections and containing U. Let L be an opening and let S be a family of 
nonvoid parts of / such that Inv(L) is the set of all arbitrary unions of elements 
of S (including the empty union Ø); in fact L is the opening ys defined after 
Eqs. (14.18,14.19), as for every X € P(U) we have 


L(x) =\/{Ae S| AC X}. 


For example, if U = E and L is the opening by a structuring element B, then 
S is the set of translates Bp (p € E) of B. Now let H be the closing which is 
the dual of L by complementation; then Inv(H) = {A€ | A € Inv(L)}, and 
Inv(H) is the set of arbitrary intersections of A° for A € S. For X € P(U) 


we have H(X) = (Vues IAC x*}) 


We can interpret the elements of S as “blocks”, and a point x can be in- 
cluded in the lower approximation L(X) only through its membership of a 
“block” included in X, while a point x can be excluded from the upper ap- 
proximation H(X) only through its membership of a “block” excluded from 
X. However these blocks do not make a partition, as with Pawlak’s definition, 
Equation (14.29). 

An interesting particular case is when L and H are the interior and closure 
operators in a topological space. One speaks then of a topological approxima- 
tion space. As explained after Proposition 14.22, this is weaker than requiring 
H to be both a dilation and a closing, and dually L to be both an erosion and 
an opening. 

Other operators could be used for lower and upper approximations. In Serra, 
1988, it is shown that every increasing operator on a complete lattice, which 
fixes the greatest element, is a supremum of erosions. Thus an increasing 
operator w on P(U) such that y(u) = U, is a union of erosions; in particular Y 
is anti-extensive iff these erosions are anti-extensive. Consider the following 
definition of lower and upper approximations (Lin, 1995). Suppose that to each 
x € U we associate a family N (x) of parts of U which are “neighbourhoods” 
of x; N(x) is called a neighbourhood system of x. Now we define the upper 
and lower approximations N and N as follows: 


N(X) {r EU |VAEN(z), ANX #0}, 
N(X) = {xEU|AIAE N(x), AC X}. 


Cc 


(14.31) 





Clearly N is the dual by complementation of N, so let us analyse the latter. We 
have 


NU) = {x €U| N(x) 4 9}, 
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so in order to have N(U) = U, we suppose that N(x) 4 0 for all x € U. 
Then N is a union of erosions, and we can describe them precisely. For every 
x € U and for every A € N(x), let N[x, A] : U — P(U) be defined by 
N[x, A](x) = A and N[z, A](y) =U for y € U \ {x}. Then eyje, 4) verifies 
for every X € P(U): 


U if X =U, 
ENa, A(X) = 4 {r} if ACX AU, 
Ø otherwise. 


=U Ne 


zeU AEN (£x) 


TANS Sma 


reU AEN (a 


We obtain then 


Dually, we get 


This view is particularly interesting for oe recognition, since in morpho- 
logical recognition, an object has often to be tested or matched with a set of 
patterns, like directional structuring elements. Thus we apply the union of the 
erosions by all those patterns, which is a particular case of the above operator. 

In Bloch, 2000b, some other extensions are presented, using as lower and 
upper approximations morphological thinning and thickening (Serra, 1982). 
There is also an extension to rough functions, using the grey-level morpholog- 
ical operations described in Sec. 1.2. 

To conclude, let us remark that the general theory of adjunctions, dilations, 
erosions, openings and closings on sets provides a good formal framework for 
expressing the notion of coarseness underlying rough sets. It allows to charac- 
terize precisely their algebraic properties and their relations with topology. In 
Sec. 3.4 and 4, we will examine the relationship of rough sets with fuzzy sets 
and modal logic, especially in the morphological framework. 


3.3 Mathematical morphology and spatial reasoning 


Spatial reasoning has been largely developed in artificial intelligence, in par- 
ticular using qualitative representations based on logical formalisms. In image 
interpretation and computer vision it is much less developed and is mainly 
based on quantitative representations. On the contrary, mathematical morphol- 
ogy is widely used in these domains. A typical example concerns model-based 
structure recognition in images, where the model represents spatial entitites and 
relationships between them. Based on this example, spatial reasoning can be 
defined as the domain of spatial knowledge representation, in particular spa- 
tial relations between spatial entities, and of reasoning on these entities and 
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relations. This definition exhibits two main components: spatial knowledge 
representation and reasoning. In particular spatial relationships constitute an 
important part of the knowledge we have to handle and imprecision is often at- 
tached to it. The reasoning component includes fusion of heterogeneous spatial 
knowledge, decision making, inference, recognition. Two types of questions 
are raised when dealing with spatial relationships: 


1 given two objects (possibly fuzzy), assess the degree to which a relation 
is satisfied; 


2 given one reference object, define the area of space in which a relation 
to this reference is satisfied (to some degree). 


In order to answer these questions and address both representation and reasoning 
issues, different frameworks and their combination can be used. Fuzzy set 
theory has powerful features to represent imprecision at different levels, to 
combine heterogeneous information and to make decision (Dubois and Prade, 
1985; Dubois et al., 1999). Formal logics and the attached reasoning and 
inference power are widely used too, usually in a qualitative context. But 
mathematical morphology, which is an algebraic theory that has extensions to 
fuzzy sets and to logical formulas, is a very promising tool, since it can elegantly 
unify the representation of several types of relationships (Bloch, 2003b). The 
association of different frameworks for spatial reasoning allows us to match 
two requirements such as axiomatization, expressiveness and completeness 
with respect to the types of spatial information we want to represent (Aiello, 
2002). Complexity issues are not addressed here, but it should be noted that 
efficient algorithms exist for digital morphology. 

Mathematical morphology provides tools for spatial reasoning at several 
levels. It provides tools for representing objects or object features (see e.g. 
Sec. 1.1 and 2). For instance skeletons provide compact and expressive rep- 
resentations of shapes; morphological tools for shape decomposition lead to 
structured representations, such as graphs for instance; spatial imprecision can 
be represented by a pair of dilation and erosion; tools for selecting objects or 
parts of objects having specific properties can be derived from morphological 
operators such as hit-or-miss transformations for instance, etc. These aspects, 
quite traditional in mathematical morphology, are not detailed here, and we 
will concentrate rather on tools for representing spatial relations. The notion of 
structuring element captures the local spatial context and leads to analysis of a 
scene using operators involving the neighbourhood of each point. At a more 
global level, several spatial relations between spatial entities can be expressed 
as morphological operations, in particular using dilations. Let us provide a few 
examples, of metric and topological relationships. 

Several distances between objects can be expressed in terms of dilation. The 
minimum or nearest point distance between two sets X and Y is defined (in the 
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discrete finite case) as: 


TE S in dx(z, 
n(X,Y) Pee B(x, Yy) 
(14.32) = mindg(z,Y) = mindg(y, X), 
xEXx yey 


where dg denotes the Euclidean distance in S (note that this function is improp- 
erly named distance since it is not separable and does not satisfy the triangular 
inequality). This has an equivalent morphological expression: 


dy (X,Y) inf{n € N, X N ôa (Y) £ 0} 
(14.33) = inf{n €N, Y N ôn(X) #0}. 


Another morphological expression is, for n > 0: 
(14.34) dn(X, Y) =n & ôn (X) NY # Oand ðm- (X) NY =4 
or equivalently the symmetrical expression. For n = 0 we have: 
(14.35) dy XY SUS XAY ZU: 
The Hausdorff distance is defined as: 


(14.36) Ha( X,Y) = max(sup dg(z,Y ), sup dz(y, X)]. 
xEX yeY 


Similarly as for the nearest point distance, this distance can be expressed in 
morphological terms as: 


(14.37) Hy( X,Y) = inf{n, X C b,(Y) and Y C 6,(X)}. 
Alternatively, we can write: 
(14.38) Ha(X,Y)=0 6 X=Y, 
and for n > 0: 
Ha(X,Y)=n& XC ôr(Y)andY C 6,(X) 
(14.39) and (X Z ôm-1) (Y) or Y Z ôm-1)(X))- 


From these representations, several types of knowledge about distance can 
be expressed. For instance, Fig. 14.12 shows a spatial representation of “B is 
at a distance between nı and ng from A”, i.e. B should be in the dilation of 
radius ns of A but not in the dilation of radius nı of A. 

Another example is adjacency. Here, we restrict ourselves to the digital case, 
and use discrete topology as derived from digital connectivity for defining 
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Figure 14.12. Illustration of a distance relation expressed by an interval. 


adjacency between two regions X and Y, subsets of the digital space (see 
Ch. 12 of this book for further details about digital topology). Let us consider 
an n-dimensional digital space (typically Z”), and any discrete connectivity 
defined on this space, denoted c-connectivity (for instance, for n = 3, we 
may consider 6-, 18- or 26-connectivity on a cubic grid). Let ne(x, y) be the 
Boolean variable stating that x and y are neighbours in the sense of the discrete 
c-connectivity. Let Be be the set of c-neighbours of the origin. For any two 
subsets X and Y in Z”, X and Y are adjacent according to the c-connectivity 
if: X NY =@ and Ave X, Jy € Y : n-(z,y). 

This definition can also be expressed equivalently in terms of morphological 
dilation, as: 








(14.40) XAY =Q and ôB. (X) AY 49, 


where ôg, (X) denotes the dilation of X by the structuring element Be. 
Another topological relation, often used in the context of mereotopology for 
instance, is tangential proper part. Again this can be expressed in morphological 
terms, as illustrated in Fig. 14.13. 
These expressions extend to different frameworks, including fuzzy set the- 
ory and formal logics, thus benefiting from the reasoning power of these 
frameworks. 


3.4 Fuzzy mathematical morphology 


Dealing with spatial imprecision can be adequately addressed by defining 
objects or regions of space as fuzzy objects. We denote by S the spatial domain, 
typically Z or Z? for digital 2D or 3D images, or, in the continuous case, R? 
or R3. A fuzzy object is a fuzzy set defined on S, i.e. a spatial fuzzy set. Its 
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e(¥) 


6 (X) 


Figure 14.13. Illustration of tangential part relationship, and its expression in terms of dilation 
and erosion: X is included in Y while its dilation is not (equivalently, X is not included in the 
erosion of Y). 


membership function u represents the imprecision in the spatial extent of the 
object. For any point x of S, u(x) is the degree to which z belongs to the fuzzy 
object. We denote by F the set of fuzzy sets on S. 

Several definitions of fuzzy mathematical morphology have been proposed 
since a few years. Some of them just consider grey level as membership func- 
tions (Goetcherian, 1980; Giardina and Sinha, 1989; Laplante and Giardina, 
1991; di Gesu, 1988; di Gesu et al., 1993; Nakatsuyama, 1993), or use binary 
structuring elements (Rosenfeld, 1984). Here we restrict the presentation to 
really fuzzy approaches, where fuzzy sets have to be transformed according to 
fuzzy structuring elements. Initial developments can be found in the definition 
of fuzzy Minkowski addition (Dubois and Prade, 1983; Kaufmann and Gupta, 
1988). Then this problem has been addressed by several authors independently 
(Bloch, 1993; Bloch and Maitre, 1995; Sinha and Dougherty, 1992; de Baets and 
Kerre, 1993; de Baets, 1995; Bandemer and Nather, 1992; Popov, 1995; Sinha 
et al., 1997; Nachtegael and Kerre, 2000; Maragos et al., 2001; Deng and Hei- 
jmans, 2002). 

Attention will be paid here only to the 4 basic operations of mathematical 
morphology (erosion, dilation, opening, closing), but it should be clear for 
the reader that for every definition, a complete set of morphological operations 
could be derived. Extensions of mathematical morphology have been proposed 
for instance for defining more complex operations (like filtering) (Bloch and 
Maitre, 1995; Sinha et al., 1997), and geodesic operations (Bloch, 2000a). 

Although a fuzzy set is defined through its membership function, functional 
approaches are not appropriate. For instance, the classical dilation of a function 
taking values in [0, 1] by a functional structuring element taking values in [0, 1] 
generally provides a function with values in [0, 2] which has no direct interpre- 
tation in terms of fuzzy sets. Therefore a set theoretical approach is preferred, 
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where set operations are converted into their fuzzy equivalents, thus preserving 
the compatibility with classical morphology in case the fuzzy sets reduce to 
crisp ones. 


General methods for extending an operation to fuzzy sets. Common 
and generic methods that can be used for defining a fuzzy operator or fuzzy 
relationship from the corresponding binary ones can be categorized in three 
main classes. The first type relies on the “extension principle’, as introduced 
by Zadeh (Zadeh, 1975; Dubois and Prade, 1980). The second class relies on 
computation on a-cuts (e.g. Dubois and Jaulent, 1987; Krishnapuram et al., 
1993; Bloch and Maitre, 1995). These two classes of definitions explicitly 
involve the operations or relations on crisp sets. The third class of methods 
consists in providing directly fuzzy definitions of the operations or of the rela- 
tionships, by substituting all crisp expressions by their fuzzy equivalents. This 
type of translation is used in the following. 

This translation is generally done term by term. For instance, intersection 
is replaced by a t-norm, union by a t-conorm, sets by fuzzy set membership 
functions, etc. A triangular norm (or t-norm) is a function from [0, 1] x [0, 1] 
into [0,1] which is commutative, associative, increasing, and for which 1 is 
unit element and 0 is null element (Menger, 1942; Schweizer and Sklar, 1963). 
Examples of t-norms are min, product, etc. (Dubois and Prade, 1980). A 
t-conorm is its dual with respect to complementation. This type of translation 
is particularly straightforward if the binary relationship can be expressed in set 
theoretical and logical terms. 

Let us take a simple example to illustrate this method. A fuzzy set jz is said 
to be included in another fuzzy set v if Vz € S, u(x) < v(x). This isa 
crisp definition of inclusion of fuzzy sets. We may also consider that if two 
sets are imprecisely defined, their inclusion relationship may be imprecise too. 
Therefore inclusion of fuzzy sets becomes a matter of degree. This degree of 
inclusion can be obtained using the translation principle. In the crisp case, the 
set equation expressing inclusion of a set X ina set Y can be written as follows: 


(14.41) X CYS XUVY=SSVrES, EX UY. 


In the fuzzy case, X and Y become fuzzy sets having membership functions u 
and v and we have the following correspondences: 


(14.42) VaES © inf, 

zE 
(14.43) rEex® e cļlulx)], 
(14.44) rEeY e v(x), 
(14.45) X° UY e Tjeu), n]. 


Mathematical Morphology 909 


Finally, the degree of inclusion of jz in v is defined as: 
(14.46) T(u, v) = inf Tle(u()),r(x)], 
rE 


where T is a t-conorm and c a fuzzy complementation. 


Fuzzy morphology by formal translation based on t-norms and t-conorms. 

The first attempts to build fuzzy mathematical morphology based on this 
translation principle were developed in Bloch, 1993 and Bloch and Maitre, 
1995, and coincide with the definitions independently developed in Bandemer 
and Nather, 1992. An important property that was put to the fore in this approach 
is the duality between erosion and dilation. 

From the following set equivalence (where £ p(X) denotes the erosion of the 
set X by B): x € ep(X) = B, C X, a natural way to define the erosion of 
a fuzzy set u by a fuzzy structuring element v is to use the degree of inclusion 
defined above: 


(1441) Ya eS, ev(u)(v) = inf Tleli — 2), u(y) 


By duality with respect to the complementation c, fuzzy dilation is then 
defined by: 


(14.48) Vx € S, ô (u)(x) = ARER — y), u(y)], 
yE 


where t is the t-norm associated to the t-conorm T' with respect to the com- 
plementation c . This definition of dilation corresponds to the following set 
equivalence: 


(14.49) x € 6p(t) BOX FOS WES, ye BNX. 


Here, intersection N has been translated in terms of a t-norm t and the existential 
symbol by a supremum. 

This form of fuzzy dilation and fuzzy erosion are very general, and several 
definitions found in the literature appear as particular cases (such as 
Bandemer and Näther, 1992; de Baets and Kerre, 1993; Sinha and Dougherty, 
1992; Rosenfeld, 1984; Kaufmann and Gupta, 1988; Goetcherian, 1980). 

Finally, fuzzy opening (respectively fuzzy closing) is simply defined as the 
combination of a fuzzy erosion followed by a fuzzy dilation (respectively a fuzzy 
dilation followed by a fuzzy erosion), by using dual t-norms and t-conorms. 

Weak t-norms and t-conorms are weaker forms of t-norms and t-conorms: 
they are not associative and do not admit 1 (respectively 0) as unit element, 
in general. If we replace t-norms and t-conorms by these weaker forms in the 
previous construction, then Eqs. (14.47,14.48) appear as a generalization of the 
definitions proposed in Sinha and Dougherty, 1993. But they lead to weaker 
properties, and are therefore somewhat less interesting from a morphological 
point of view. 
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Properties of basic fuzzy morphological operations. The detail of prop- 
erties for various definitions can be found in Bloch and Maitre, 1995. We 
summarize here the main properties when using t-norms and t-conorms: 


= duality of erosion and dilation (respectively opening and closing) with 
respect to the complementation c; 


= compatibility with classical morphology if the structuring element is 
binary; 


= translation-invariance (see Sec. 2); 

= local knowledge property; 

= continuity if the t-norm is continuous (which is most often the case); 
ma increasingness of all operations with respect to inclusion; 


= extensivity of dilation and anti-extensivity of erosion iff v(0) = 1 (this 
corresponds to the condition that the origin should belong to the struc- 
turing element in the crisp case); 


a extensivity of closing, anti-extensivity of opening and idempotence of 
these two operations iff t[b, T(c(b),a)] < a, which is satisfied for 
Lukasiewicz t-norm (t(a, b) = max(0, a+b—1)) and t-conorm (T (a, b) = 
min(1,a + b)); 


= commutation of dilation with union (and of erosion with intersection); 


= iteration property of dilation (6,6, = 6,4, ) and of erosion. 


Fuzzy morphology using adjunction and residual implications. A sec- 
ond type of approach is based on the notion of adjunction and fuzzy implications. 
Here the algebraic framework is the main guideline, which contrasts with the 
previous approach where duality was imposed in first place. 

Fuzzy implication is often defined as (Dubois and Prade, 1991): 


(14.50) Imp(a, b) = T[c(a), b]. 
Fuzzy inclusion is related to implication by the following equation: 


(14.51) T(v, u) = inf Impļ|v(z), u(x)], 


which allows to relate directly fuzzy erosion to fuzzy implication, leading to 
the general definition using t-conorm, and by duality also fuzzy dilation. 
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This suggests another way to define fuzzy erosion (and dilation), by using 
other forms of fuzzy implication. One interesting approach is to use residual 
implications: 


(14.52) Imp(a, b) = sup{e € [0,1], t(a,¢) < b}. 
This provides the following expression for the degree of inclusion: 


(14.53) T(v, u) = inf sup{e € [0,1], t(v(x),£) < u(x)}. 


This definition coincides with the previous one for particular forms of t, typi- 
cally Lukasiewicz t-norm. 

The derivation of fuzzy morphological operators from residual implication 
has been proposed in de Baets, 1995, and then developed e.g. in de Baets, 1997 
and Nachtegael and Kerre, 2000. One of its main advantages is that it leads 
to idempotent fuzzy closing and opening. This approach was formalized from 
the algebraic point of view of adjunction, as developed in Deng and Heijmans, 
2002. This approach has then been used by other authors, such as in Maragos 
et al., 2001 and Maragos, 2005. This leads to general algebraic fuzzy erosion 
and dilation. Let us detail this approach. A fuzzy implication J is a mapping 
from [0, 1] x [0, 1] into [0, 1] which is decreasing in the first argument, increasing 
in the second one and satisfies (0,0) = (0,1) = Z(1,1) = 1 and J(1,0) = 
0. A fuzzy conjunction is a mapping from [0,1] x [0,1] into [0,1] which is 
increasing in both arguments and satisfies C'(0,0) = C(1,0) = C(0,1) = 0 
and I(1,1) = 1. If C is also associative and commutative, it is a t-norm. A 
pair of operators (I, C') are said adjoint if: 


(14.54) C(a,b) <c@b< Ila, c). 


The adjoint of a conjunction is a residual implication. 
Fuzzy dilation and erosion are then defined as: 


(14.55) ôs (u) (x) = Sup C(v(x — y), u(y), 


(14.56) E(u) (x) = E I(v(y — x), u(y). 


Note that (J, C’) is an adjunction if and only if (€,, ô») is an adjunction. 
Opening and closing derived from these operations by combination have 
all required properties, whatever the choice of C and I. Some properties of 
dilation and erosion, such as iterativity, require C and I to be associative and 
commutative. 
If C is a t-norm, then the dilation is exactly the same as the one obtained 
in the first approach. To understand the relation between both approaches for 


912 HANDBOOK OF SPATIAL LOGICS 


erosion, we define Î(a, b) = I(c(a), b) where c is a fuzzy complementation. In 
the following, we simply take c(a) = 1— a which is the most usual complemen- 
tation. Then J is increasing in both arguments, and if J is further assumed to be 
commutative and associative, Î is a t-conorm. Equation (14.56) can be rewrit- 
ten as: e, (u)(x) = inf, Î(1 — v(y — x), (y)), which corresponds to the fuzzy 
erosion of the first approach. The adjunction property can also be written as 
C(a,b)<ceb< I (1 —a,c). However, pairs of dual t-norms and t-conorms 
are not identical to pairs of adjoint operators. Let us take a few examples. For 
C = min, its adjoint is Z (a,b) = bif b < a, and 1 otherwise. But the derived Î 
is the dual of the conjunction defined as C (a, b) = Oif b < 1—a and botherwise. 
Conversely, the adjoint of this conjunction is J (a, b) = max(1 — a, b), the dual 
of which is the minimum conjunction. Lukasiewicz operators are both adjoint 
and dual, which explains the exact correspondence between both approaches for 
these operators. Moreover, it can be proved that the condition for t-norms and 
t-conorms leading to idempotent opening and closing (i.e. t(b, T(c(b), a) < a) 
is equivalent to the adjunction property between C and I fort = C and T = Î. 
This new result completes the link between both approaches. 


Fuzzy rough sets. We can now extend the links between rough sets and 
morphological operators derived in Sec. 3.2 to fuzzy rough sets. Using fuzzy 
mathematical morphology operators leads to fuzzy rough sets that have exactly 
the same properties as crisp rough sets, at least for particular t-norms and t- 
conorms (Bloch, 2000b). It turns out that these definitions using fuzzy erosion 
and dilation are generalizations of the ones proposed in Dubois and Prade, 
1990, for t = min and T = max in a completely different context, using a 
fuzzy relation upr. The equivalence is obtained as in the crisp case by setting: 


(14.57) UR(x,y) = v(y — 2). 


The interpretation is similar as in the crisp case: the degree of relation between 
x and y is equal to the degree to which y — x belongs to the structuring element, 
i.e. to the degree to which y belongs to the structuring element translated at x. 
This approach has also been used in Nachtegael et al., 2000. 

This extension brings together three different aspects of the information: 
rough sets represent coarseness, fuzzy sets represent vagueness and mathemat- 
ical morphology brings a geometrical, topological and morphological aspect. 


3.5 Spatial relationships and spatial reasoning from fuzzy 
mathematical morphology 
Fuzzy distances derived from fuzzy dilation. The importance of dis- 


tances in spatial reasoning is well established. Their extensions to fuzzy sets 
can be useful for dealing with imprecision and reasoning with semi-qualitative 
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(or semi-quantitative) information. Several definitions can be found in the lit- 
erature for distances between fuzzy sets (which is the main addressed problem). 
They can be roughly divided into two classes (see Bloch, 2003a for a review): 
distances that take only membership functions into account and that compare 
them pointwise, and distances that additionally include spatial distances. The 
definitions which combine spatial distance and fuzzy membership comparison 
allow for a more general analysis of structures in space, for applications where 
topological and spatial arrangement of the structures of interest is important 
(such as spatial reasoning). 

Morphological dilations are a convenient tool to define distances in the sec- 
ond class (Bloch, 1999b). The relations described in Sec. 3.3 express distances 
in set theoretical terms, and are therefore easier to translate with nice proper- 
ties than usual analytical expressions. We detail the examples of nearest point 
distance and Hausdorff distance. 


Fuzzy nearest point distance. By translating Equation (14.33), we 
define a distance distribution (Rosenfeld, 1985) A y(u, p’) (n) that expresses 
the degree to which the distance between ju and p is less than n by: 


(14.58) An(n, n’)(n) = Flsup t[u(e), dn(u')(a)], sup tlw (2), dn(u)(#)]], 


where ôn is a fuzzy dilation of radius n (6, = (61)"), t is a t-norm, and f 
is a symmetrical function. The structuring element used in the dilatation 6, 
can simply be a unit ball, or a fuzzy set representing for instance the smallest 
sensitive unit in the image, along with the imprecision attached to it. In this 
case, it has to have a membership value equal to 1 at origin, in order to guarantee 
extensivity of dilations. 

A distance density (Rosenfeld, 1985), i.e. a fuzzy number Dy (u, p’) (n) 
representing the degree to which the distance between u and p’ is equal to n, 
can be obtained implicitly by: 


(14.59) An(tH!)(n) = / DRUI NUE 


Clearly, this expression is not very tractable and does not lead to a simple 
explicit expression of Dy (u, u’)(n). Therefore, we suggest to use an explicit 
method, exploiting the other morphological expressions if nearest point distance 
(see Sec. 3.3). The translation of these equivalences provides, for n > 0, the 
following distance density: 

(14.60) 


Dy(p, u’) (n) = t[sup t{u'(x), dn(u)(x)], c[sup tlu (x), b(n—1) (4) (2) II] 
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or a symmetrical expression derived from this one, and: 
(14.61) Dn(u, w’) (0) = sup t[u(e), u (2)]. 
rE 


Fuzzy Hausdorff distance. From Equation (14.37), a distance distribu- 
tion can be defined, by introducing fuzzy dilation: 
(14.62) 


An (u, p')(n) = tlin€ Tôn (u) (x), e(u'(2))), inf Tin l) CE), eul), 


where c is a complementation, t a t-norm and T a t-conorm. A distance density 
can be derived implicitly from this distance distribution. 

A direct definition of a distance density can be obtained from the expression 
of Ha(X, Y ) = n (see Sec. 3.3). Translating this expression leads to a definition 
of the Hausdorff distance between two fuzzy sets u and p’ as a fuzzy number: 


(14.63) Ha(u, 1')(0) = tlinf Tlu(2), e(u'(x))], inf T[u'(2), c(u(x)) |], 


(14.64) 
Halu, p(n) = tlinf T[ðn(u) (2), o(u'(a))], inf Tin (H')(2), c(u(x))], 


nun t{u(a), clm- (w) (£))], a tiw (x), C(6(n—1)(H)(a))])I- 


Properties. These definitions of fuzzy nearest point and Hausdorff 
distances (defined as fuzzy numbers) between two fuzzy sets do not necessarily 
share the same properties as their crisp equivalent. All distances are positive, in 
the sense that the defined fuzzy numbers have always a support included in R*. 
By construction, all defined distances are symmetrical with respect to u and p. 
The separability property (i.e. d(u,v) = 0 = pu = v) is not always satisfied. 
However, if u is normalized (i.e. 3x, u(x) = 1), we have for the nearest point 
distance Dy (u, 4)(0) = 1 and Dy (u, w)(n) = O for n > 1. For the Hausdorff 
distance, Ha( u, ’)(0) = 1 implies u = p’ for Lukasiewicz t-conorm, while it 
implies jz and yp’ crisp and equal for T = max. Also the triangular inequality 
is not satisfied in general. 





Fuzzy adjacency from fuzzy dilation and set operations. Adjacency 
has a large interest in image processing, pattern recognition, spatial reasoning 
(Rosenfeld and Kak, 1976). A crisp definition of adjacency between crisp 
objects often leads to a low robustness, since the fact that two objects are 
adjacent or not may depend on one point only. 

In order to account for possible errors or imprecisions, the framework of 
fuzzy sets is very useful. Two completely different ways for representing im- 
precision can be considered. In the first one, the satisfaction of the adjacency 
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property between two objects is considered to be a matter of degree; this can 
be more appropriate than a binary index (Rosenfeld, 1979; Rosenfeld, 1984). 
The second one consists in introducing imprecision in the objects themselves, 
and to deal with fuzzy objects. Then obviously adjacency is also a matter of 
degree. Only the second way is addressed here. More details can be found in 
Bloch et al., 1997. 

Adjacency is defined using fuzzy dilation, by translating Equation (14.40) 
into fuzzy terms. The degree of adjacency between u and v involving fuzzy 
dilation is then: 


(14.65) Hadj (u, v) = t[Hin(H, v), Hint [ôB. (u), v], Hint (dB. (V), Lt] i 


This definition represents a conjunctive combination of a degree of non- 
intersection (in, between u and v and a degree of intersection Hint between 
one fuzzy set and the dilation of the other. The degree of intersection can be 
defined using a supremum of a t-norm (as for fuzzy dilation): 


(14.66) hint u, vV) = Sup Hula), v(x)], 


or using the normalized fuzzy surface (or volume) of t(u, v). The degree of 
non-intersection is simply defined by point = 1 — Mine. Be can be taken as 
the elementary structuring element related to the considered connectivity, or as 
a fuzzy structuring element, representing for instance spatial imprecision (i.e. 
the possibility distribution of the location of each point). 

This degree of adjacency (with any structuring element) is symmetrical, 
consistent with the binary case and decreases when the distance between both 
fuzzy sets increases. 


Fuzzy directional relative position from conditional fuzzy dilation. Re- 
lationships between objects can be partly described in terms of relative position, 
like “to the left of”. Since such concepts are rather ambiguous, although human 
beings have an intuitive and common way of understanding and interpreting 
them, they may find a better modeling in the framework of fuzzy sets as fuzzy 
relationships. This framework makes it possible to propose flexible definitions 
which fit the intuition and may include subjective aspects, depending on the 
application and on the requirements of the user. Almost all existing methods for 
defining fuzzy relative directional spatial position (see Bloch and Ralescu, 2003 
for a review) rely on angle measurements between points of the two objects of 
interest (Krishnapuram et al., 1993; Miyajima and Ralescu, 1994; Keller and 
Wang, 1995; Matsakis and Wendling, 1999), and concern 2D objects (some- 
times with possible extension to 3D). 

Another approach was proposed in Bloch, 1999a. It is based on fuzzy dilation 
and consists of two steps: 
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1 We first define a fuzzy “landscape” around the reference object R as a 
fuzzy set such that the membership value of each point corresponds to 
the degree of satisfaction of the spatial relation under examination. This 
fuzzy region is defined by a fuzzy dilation of the reference object by a 
fuzzy structuring element expressing the direction of interest t along with 
its imprecision. For instance, the structuring element v can be defined 





as: 
YP + ik 
(14.67) VP € S, v(P) = f(arccos Top) and v(O) = 1, 


where O is the center of the structuring element and f is a decreasing 
function of [0,7] into [0, 1]. An example is shown in Fig. 14.14. A fast 
algorithm for computing this fuzzy dilation is described in Bloch, 1999a. 


ELSE 


Figure 14.14. Fuzzy structuring element representing the relation “to the right of”, a fuzzy 
reference object, and its dilation representing the region to the right of it (high grey values 
represent high membership values). 





2 We then compare an object A to the fuzzy landscape attached to R, in 
order to evaluate how well this object matches with the areas having 
high membership values (i.e. areas that are in the desired direction). This 
is done using a fuzzy pattern matching approach (Dubois et al., 1988), 
which provides an evaluation as an interval instead of one number only. 
This makes a major difference with respect to all the previous approaches 
and, to our opinion, it provides a richer information about the considered 
relationship. 


This definition is invariant with respect to translation, rotation and scaling, 
for 2D and 3D objects (crisp and fuzzy). When the distance between the objects 
increases, the objects are seen as points. The value of their relative position 
can be predicted only from the direction of interest and the direction in which 
one object goes far away from the reference object. Therefore the shape of the 
objects does no longer play any role in the assessment of their relative position. 
Finally, the behavior of the definition in cases where the reference object has 
strong concavities corresponds to what can be intuitively expected. 
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Reasoning on spatial relationships. Now, we address the second im- 
portant issue in spatial reasoning. This includes fusion, since heterogeneous 
information has often to be combined in spatial reasoning, decision making and 
recognition (with a special focus on model-based recognition). Inference and 
logical reasoning are addressed in Sec. 4. 


Fusion. Spatial reasoning aiming for instance at recognizing structures 
in an image has to deal with the combination of knowledge and information 
represented and modeled as described above. Usually, to achieve recogni- 
tion, several spatial relationships to one or several spatial entities have to be 
combined, as well as information extracted from the image itself. For this 
combination step, the advantages of fuzzy sets lie in the variety of combina- 
tion operators, which may deal with heterogeneous information expressed in 
a semi-quantitative framework (Dubois and Prade, 1985; Yager, 1991; Dubois 
et al., 1999). A classification of these operators with respect to their behavior 
(in terms of conjunctive, disjunctive, and compromise), the possible control 
of this behavior, their properties and their decisiveness proved to be useful for 
choosing an operator (Bloch, 1996). 

Let us give a few examples. If we have different constraints about an object 
(for instance concerning the relations it should have with respect to another 
object) which have all to be satisfied, these constraints can be combined using 
a t-norm (a conjunction). This is typically the case when an object is described 
using relations to several objects or several relations of different types to the 
same object. If one object has to satisfy one relation or another one then a 
disjunction represented by a t-conorm has to be used. This occurs for instance 
when two symmetrical structures with respect to the reference object can be 
found. Mean operators can be used to combine several estimations and try to 
find a compromise between them. 


Decision making and recognition. Let us now consider the introduc- 
tion of fusion in model-based recognition procedures. We summarize here two 
distinct approaches. A first recognition approach, called global, uses the first 
type of question (1) raised at the beginning of Sec. 3.3 (define the degree to 
which a relation is satisfied between to given objects). The idea is to represent 
all available knowledge about the objects to be recognized. A typical example 
consists of graph-based representations. The model is then represented as a 
graph where nodes are objects and edges represent links between these objects. 
Both nodes and edges are attributed. Node attributes are characteristics of the 
objects, while edge attributes quantify spatial relationships between the objects. 
A data graph is then constructed from each image where the recognition has 
to be performed. Each region of the image (obtained after some processing) 
constitutes a node of this data graph, and edges represent spatial relationships 
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between regions, as for the model graph. The comparison between represen- 
tations is performed through the computation of similarities between model 
graph attributes and data graph attributes. The fusion takes mainly place at 
this level, in order to combine the similarity values for different relationships. 
The fusion results constitute an objective function to be optimized by a match- 
ing procedure. This approach can benefit from the huge literature on fuzzy 
comparison tools (see e.g. Bouchon-Meunier et al., 1996) and from recent de- 
velopments on fuzzy morphisms (Perchant and Bloch, 2002). It has been used 
in facial feature recognition based on a rough model of a face (Cesar et al., 
2002) and brain structure recognition based on an anatomical atlas (Perchant 
et al., 1999; Bengoetxea et al., 2002). 

A second type of approach relies on the second type of question (2) raised 
at the beginning of Sec. 3.3 (define the area of space in which a relation to a 
given reference object is satisfied), and is called here progressive. In such a 
progressive approach, objects are recognized sequentially and their recognition 
makes use of knowledge about their relations with respect to other objects. 
Relations with respect to previously obtained objects can be combined at two 
different levels of the procedure. First, fusion can occur in the spatial domain, 
using spatial fuzzy sets (Bloch et al., 2003). The result of this fusion allows to 
build a fuzzy region of interest in which the search of a new object will take 
place, in a process similar to focalization of attention. In a sequential procedure, 
the amount of available spatial relations increases with the number of processed 
objects. Therefore, the recognition of the most difficult structures, usually 
treated in the last steps, will be focused in a more restricted area. This approach 
has been used in medical imaging (Bloch et al., 2003; Colliot et al., 2004), 
as well as in mobile robotics to reason about the spatial position of the robot 
and the structure of its environment (Bloch and Saffiotti, 2002). Another fusion 
level occurs during the final decision step, i.e. segmentation and recognition of a 
structure. For this purpose, it was suggested in Colliot et al., 2004 to introduce 
relations in the evolution scheme of a deformable model, in which they are 
fused with other types of numerical information, usually edge and regularity 
constraints. 


4. Logics 


In this section, we explain how mathematical morphology relates to logics 
of space. As seen in Sec. 3.3, mathematical morphology can be considered as 
a spatial reasoning tool, with its two components: spatial knowledge represen- 
tation and reasoning. Now we go one step further about the reasoning aspect, 
and we show how morphological operators can be applied on logical formulas 
(Sec. 4.1), and can be used to define a modal logic (Sec. 4.2). This leads to 
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qualitative representation of spatial relationships (Sec. 4.3), thus enhancing the 
power of logical reasoning with morphological aspects. 


4.1 Morphology and propositional logics 


We first consider the framework of propositional logics (note that this sub- 
section is partly reproduced from Bloch and Lang, 2000). 

In the knowledge representation community, propositional formulas are used 
to encode either pieces of knowledge (which may be generic— for instance, in- 
tegrity constraints—or factual) or “preference items” (such as opinions, desires 
or goals), and are then used for complex reasoning or decision making tasks. 
These tasks often make use of operations on propositional formulas which are 
very similar to those considered in mathematical morphology. We give a (non- 
exhaustive) list of examples: 


m belief revision (as shown by Katsuno and Mendelzon, 1991) consists of 
the following operation: let y and Y% be two propositional formulas. The 
models of the revision y o w of y by w are the models of y which are 
closest (with respect to a given distance) to a model of y. Intuitively, 
using the language of mathematical morphology, it means that y has to 
be dilated enough to intersect with some models of Y. Belief update 
(Katsuno and Mendelzon, 1991) proceeds to the same kind of dilation 
but on each individual model of y and then takes the union of all obtained 
sets of models. 


= belief merging (Konieczny and Pino-Pérez, 1998) consists in finding the 
best compromise between a finite set of formulas y1, ..., Pn, which 
amounts to selecting the models which minimize the aggregation (using 
some given operator) of the distances to each of the y’s. This amounts 
intuitively to dilate simultaneously all the y’s until they intersect. Similar 
operations are at work for the aggregation of preferences in group decision 
making as proposed in Lafage and Lang, 2000. 


= one of the tasks involved in similarity-based reasoning (Esteva et al., 
1997; Dubois et al., 1997) consists in determining if a formula y ap- 
proximatively entails a formula Y% by looking to what extent ~ has to be 
extended so as to contain all models of p, which again corresponds to a 
dilation (and to directed Hausdorff distance). 


= reasoning with supermodels (Ginsberg et al., 1998) uses models of a 
formula y which are robust enough to resist some perturbations. In some 
cases, obtaining supermodels consists in eroding the formula so as to 
be far enough from the countermodels of p. Again this corresponds to 
a Classical operation of mathematical morphology (erosion). Another 
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close notion, evoked by Lafage and Lang, 2000, is the search for the 
most representative worlds of a formula. 


a in abductive reasoning (Pino-Pérez and Uzcátegui, 1999), preferred ex- 
planations of a formula are defined based on a set of axioms, several 
of which being close to properties of morphological operators. Erosion 
appears as a useful tool in this context (Bloch et al., 2001; Bloch et al., 
2004). 


In this section, we investigate how and why mathematical morphology can 
be applied on logical formulas. First we note that the fact that a propositional 
formula can be equivalently defined by the set of its models enables us to ap- 
ply easily all (set-theoretic) definitions of mathematical morphology to logical 
objects (worlds, formulas). This will lead us not only to rewriting well-known 
logical operations used for reasoning or decision making, but also to design- 
ing new kinds of logical objects or notions by transposing basic morphologi- 
cal operations to propositional logic. One may view morphological operators 
as transformations applied on formulas, leading to reasoning or decision 
making tools. 


Basic logical concepts. Let PS be a finite set of propositional symbols. The 
set of formulas (generated by PS and the usual connectives) is denoted by ®. 
Well-formed formulas will be denoted by Greek letters y, Y... Interpretations 
will be denoted by w, w’ and the set of all interpretations for ® by Q. Mod(y) = 
{w E Q | w H g} is the set of all models of ọ (i.e. all interpretations for which 
g is true). 

The underlying idea for constructing morphological operations on logi- 
cal formulas is to consider formulas and interpretations from a set theoret- 
ical perspective. Since © is isomorphic to 2°, i.e., knowing a formula is 
equivalent to knowing the set of its models (and conversely, any set of mod- 
els corresponds to a formula), we can identify y with the set of its models 
Mod(y), and then apply set-theoretic morphological operations. We recall 
that Mod(y Vy) = Mod(y)U Mod(w), Mod(p Aw) = Mod(y)N Mod(w), 
Mod(y) C Mod(w) iff p = Y, and y is consistent iff Mod(y) 4 0. 


Dilation and erosion of a formula. Using the previous equivalences, we 
propose to define dilation and erosion of a formula as follows: 


(14.68) Mod(5p(y)) = {w € Q | By Ay consistent}, 


(14.69) Mod(ep(y)) = {w EQ | Bu E 9}. 


In these equations, the structuring element B represents a relation between 
worlds, i.e. w’ € B,, iff w’ satisfies some relationship with w, and B,, is defined 
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by w € Bo & w € Bw. The condition in Equation (14.68) expresses that 
the set of worlds in relation to w should be consistent with y. The condition in 
Equation (14.69) expresses that all worlds in relation to w should be models of yp. 


Structuring element. There are several possible ways to define struc- 
turing elements or more generally binary relations between worlds in a context 
of formulas. We suggest here a few ones. The relation between worlds de- 
fines a “neighbourhood” of worlds (equivalent to the neighbourhood function 
in Sec. 2.3). If it is symmetrical, it leads to symmetrical structuring elements. 
If it is reflexive, it leads to structuring elements such that w € Bu, which leads 
to interesting properties, as will be seen later. For instance, this relationship can 
be an accessibility relation as in normal modal logics (Hughes and Cresswell, 
1968). An interesting way to choose the relationship is to base it on distances 
between worlds. This allows to define sequences of increasing structuring ele- 
ments defined as the balls of a distance. From any distance d between worlds, 
a distance from a world to a formula is derived as a distance from a point to a 
set: dy (w, p) = min, yy dlw, w’). 

The most commonly used distance between worlds in knowledge 
representation—especially in belief revision (Dalal, 1988), belief update 
(Katsuno and Mendelzon, 1991), merging (Konieczny and Pino-Pérez, 1998) or 
preference representation (Lafage and Lang, 2000)—is the Hamming distance 
dy where dy (w, w’) is the number of propositional symbols that are instantiated 
differently in both worlds. By default, we take d to be dy. 

Then dilation and erosion of size n are defined from Eqs. (14.68,14.69) by 
using the distance balls of radius n as structuring elements: 








Mod(6n(y)) = {w | dw’,w’ H ọ and dy(w,u’) < n} 
(14.70) = {w|dn(w,y) <n}, 
Mod(en(p)) = {w|Yu!,du(w,w") < n => o Ky} 


(14.71) {w | dn(w,7y) > n}. 

From operations with the unit ball we define the external (respectively inter- 
nal) boundary of ọ as 61(y) A ~y (respectively y A 7€1(y)), corresponding to 
the worlds that are exactly at distance 1 of y (resp. of ~g). 


Properties. The main properties of dilation and erosion, which are satis- 
fied in mathematical morphology on sets, hold also in the logical 
setting proposed here, since the algebraic frameworks are the same up to an 
isomorphism. 
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Monotonicity: Both operators are increasing with respect to y, i.e. if y H Y, 
then ôg (y) = ôg(y) and €g(p) H €a(y), for any relation B. Dilation 
is increasing with respect to the relation, while erosion is decreasing, i.e. 
if Ww € Q, By C Bl, then dp(y) H ôs (p) and ep (p) = eB(y¥). 


Extensivity: Dilation is extensive (y = 6g(y)) if B is derived from a reflexive 
relation (as is the case for distance based dilation, since if w = y, then 
dn(w,y) = 0), and erosion is anti-extensive (¢g(y~) | y) under the 
same conditions. 


Iteration: Dilation and erosion satisfy an iteration property. For instance for 
distance based operations, we have: 


ntn (P) = dnldn(Y)] = dnldn’(¥)], 
Entr (P) = En! [en(Y)] = EnlEn' (¥)]- 


Commutativity with union or intersection: Dilation commutes with union or 
disjunction: for any family %1, . . . Ym of formulas, we have: ôg ( V yi) 
= V; 6a(yi). Erosion on the other hand commutes with intersection 
or conjunction. 


In general dilation (resp. erosion) does not commute with intersection 
(resp. union), and only an inclusion relation holds: ôg (p^) = dB(y)A 


õB(Y). 


Adjunction relation: (€g,6g) is an adjunction (moreover, if two operators 
form an adjunction, they are an erosion and a dilation respectively), i.e. 


Bly) E y iff y H eal). 


Duality: Dilation and erosion (respectively opening and closing) are dual 
operators with respect to the negation: eg(p) = 7d (=~) which allows 
to deduce properties of an operator from those of its dual operator. 


Relations to distances: Equation (14.70) is an example of how to derive a 
dilation from a distance. Conversely, we have: dy(w,y) = min{n € 
N | w H 6,(y)}. Distances between formulas can also be derived 
from dilation, as minimum distance and Hausdorff distance. For in- 
stance the minimum distance (i.e. nearest world distance) is expressed 
as: dn(¥, Y) = MIN Hy w! Hy di(w,w’) = min{n EN | õn (p) ADF 
0 and bn(w)Ag & Ú}. This means that the minimum distance is attained 
for the minimum size of dilation of each formula such that it becomes 
consistent with the other. 





Opening and closing are defined classically by composition and have the 
same properties as the corresponding operators on sets. Filters, as described 
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in Sec. 2.4, can be applied on formulas as well, for instance for approximation 
and simplification purposes. 

All these definitions and properties allow us to formalize problems of fusion, 
revision, abduction mentioned at the beginning of this subsection in morpho- 
logical terms. 


4.2 Morphological modal logic 


When looking at the algebraic properties of mathematical morphology oper- 
ators on the one hand, and of modal logic operators on the other hand, several 
similarities can be shown, and suggest that links between both theories are worth 
to be investigated. A pair of modal operators (O, ©) is defined in Bloch, 2002, 
as morphological erosion and dilation. This section summarizes this approach. 

Until now mathematical morphology has been used mainly for quantitative 
and semi-quantitative (or semi-qualitative) representations of spatial relations. 
For qualitative spatial reasoning, several symbolic approaches have been devel- 
oped, but mathematical morphology has not been widely used in this context. 
In Bloch, 2002, it was shown how modal operators based on morphological 
operators can be used for symbolic representations of spatial relations. 

In a similar way as in Jeansoulin and Mathieu, 1994, the modal operators 
are used here for representing spatial relationships, and classical predicates 
represent the semantic part of the information. While inclusion and adjacency 
are considered in Jeansoulin and Mathieu, 1994, we consider here more spatial 
relationships, including metric ones, and model all of them using mathematical 
morphology. 














Lattice structure. We use the same notations as in Sec. 4.1. We use 
standard Kripke’s semantics and denote by M a model composed of a set of 
worlds Q, a binary relation R between worlds and a truth valuation. Considering 
the inclusion relation on 2°, (2°, C) is a Boolean complete lattice. Similarly 
a lattice (which is isomorphic to 2°) is defined on B=, where = denotes the 
quotient space of ® by the equivalence relation between formulas (with the 
equivalence defined as y = w iff Mod(p) = Mod(w)). In the following, this 
is implicitly assumed, and we simply use the notation ®. Any subset {y;} of ® 
has a supremum \/, y;, and an infimum A; y; (corresponding respectively to 
union and intersection in 2°). The greatest element is T and the smallest one is 
| (corresponding respectively to 2? and Ø). This lattice structure is important 
from the algebraic point of view of mathematical morphology. Indeed, it is the 
fundamental structure on which adjunctions and morphological operators can 
be defined. 
A canonical formula y,, associated with a world w is defined by: 


(14.72) Mod(pu) = {w}. 


924 HANDBOOK OF SPATIAL LOGICS 


Let C be the subset of ® containing all canonical formulas. The canonical 
formulas are sup-generating, i.e: 








(14.73) Wwe, He} cc, p=V vi. 


The formulas y; are associated with the worlds w; which satisfy y: for all w; 
such that w; = Y, Yi = Yw,. This decomposition is useful for some proofs. 





Neighborhood function (or structuring element) as accessibility relation. 
The structuring element B representing a relationship between worlds defines a 
“neighbourhood” of worlds. This corresponds to the notion of neighbourhood 
function of Sec. 2.3. We propose to define this relationship as an accessibility 
relation as in normal modal logics (Hughes and Cresswell, 1968; Chellas, 1980). 

An interesting way to choose the relationship is to base it on distances be- 
tween worlds, as mentioned in Sec. 4.1. Another way to choose the relation- 
ship is to rely on an indistinguishability relation between worlds (Orlowska, 
1993; Balbiani and Orlowska, 1999), for instance based on spatial attributes of 
spatial entities represented by these worlds. Interestingly enough, as shown in 
Orlowska, 1993, modal logics based on such relationships show some links with 
Pawlak’s work on rough sets and rough logic (Pawlak, 1982; Pawlak, 1987), 
while rough sets can be constructed from morphological operators as shown in 
Bloch, 2000b. Also, the modal logic based on rough sets described e.g. in Yao 
and Lin, 1996, has links with the morphological modal logic described below. 
An accessibility relation can be defined from any neighbourhood function B as 
follows: 


(14.74) R(w,w’) iff w € B(w). 


Conversely, a neighbourhood function can be defined from an accessibility 
relation using this equivalence. This is similar to the notions of Sec. 2.3. 

The accessibility relation R is reflexive iff Vw € Q, w € B(w). Itis sym- 
metrical iff V(w,w’) € Q?, w € B(w’) iff w’ € B(w). In general, accessibility 
relations derived from a neighbourhood function are not transitive. Indeed in 
general if w’ € B(w) and w” € B(w’), we do not necessarily have w” € B(w). 


Modal logic from morphological dilations and erosions. Modal opera- 
tors O (necessity) and © (possibility) are usually defined from an accessibility 
relation (Chellas, 1980) as: 

















(14.75) M, w = Oy iff Vw" € Q such that R(w, w), M, w Ky, 


(14.76) M,w = Oy iff dw’ E Q, R(w,w’) and M, w E y, 
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where M is a standard model related to R, that we will omit in the following in 
order to simplify notations (it will be always implicitly related to the considered 
accessibility relation). 

Equation (14.75) can be rewritten as: 














w = Oy [u €0| Rw, RY 
& {Ww EQ]|w E Blo) Ey 
= Bw)Ey, 


which corresponds exactly to the definition of the erosion of a formula as defined 
in Equation (14.69). 
Similarly, Equation (14.76) can be rewritten as: 
wKOop & {w EQ] Rw, w} N Mod(y) 40 
& {w EQ] w € Blw)}N Mod(y) 40 
= B(w)N Mod(y) £9, 


which exactly corresponds to a dilation according to Equation (14.68). 
This shows that we can define modal operators based on an accessibility 
relation as erosion and dilation with a neighbourhood function: 


(14.77) p = ep(9), 














(14.78) Op = daly). 


Let us now list the main properties of these operators. All results below can 
be found in Bloch, 2002, along with the corresponding proofs. Note that some 
results are direct consequences of the results of Sec. 2.3. For instance, T is 
deduced from Proposition 14.21, from which 5c and 4c are derived. 


LEMMA 14.30 The modal logic built from morphological erosions and dila- 
tions has the following theorems and rules of inference (we use similar notations 
as in Chellas, 1980): 











€ T: Oy > y and gp > O¢ iffVw E Q,w E€ B(w) (reflexive accessibility 
relation). 





























u Df: Oy = -O-y and Oy = ~nOny. 











€ D: Oy = Oç iff R is serial (or in other words, Yw € Q, B(w) £ 0). 

















a B: O09 — y and y — O¢ for symmetrical B. 




















» 5c: OO —> Oy and Oy > OOY iffYw E Q,w € B(w). 
a 4c: yp > Oy and Oy —> OO>¢ iffVw E€ O,w € B(w). 
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=» N: OT and-06 l. 
a M: O(A yY) > (Og A OY) and (Op V OY) > Olp V y). 
a M’: S(p Ay) = (Cp A Ow) and (Oy V OY) > Oly V 4). 
a C: (Oy A Oy) > Oy Ay) and O(g V Y) = (Op V oy). 
a R: (Op A Dy) = Of Ay) and Oly V Y) = (OY V OY). 
= RN: 
P 
Ugy 
=» RM: 
pry Ss pry 
p > Oy Op Op 
= RR: 
(pag) >y and (pyg) >y 
(Oy A Oy’) > Oy (Op V Oy’) = OW 
a RE: y 7 
po po 
po y Sy 6 OM 
= i (p > ¥) > (Oy —> Oy) and by duality (“Op NOY) > O(n ^ 
w). 


Since the proposed system contains Df, N, C and is closed by RM, it is a 
normal modal logic (Chellas, 1980). 


LEMMA 14.31 On the contrary, the following expressions are not satisfied in 


general: 


a 5: Op 











Oy (since for a symmetrical B the dilation followed by an 


erosion is a closing which does not necessarily contains the dilation). 


ae 4:0 























Let us now denote by 


po y (since eroding a region twice produces a smaller region 


ifw € B(w)). 














” the iteration of n times 














(i.e. n erosions by the 


same structuring element). Since the succession of n erosions by a structuring 
element is equivalent to one erosion by a larger structuring element, of size n 


(iterativity property of erosion), 














” is anew modal operator, constructed as in 


Equation (14.77). In a similar way, we denote by ©” the iteration of n times 
©, which is again a new modal operator, due to iterativity property of dilation, 
constructed as in Equation (14.78) with a structuring element or neighbourhood 
of size n. We set Ot = O and ©! = 0. 
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We also have the following theorems for symmetrical B and w € B(w): 





n 1 


n Moo OPH and O?O” p = OPT y (iterativity properties of 
dilation and erosion). This property holds also in a more general case, 
without assumption on the symmetry of B. 










































































= S0009% + ODy, and 09099 = Oy (idempotence of opening and 
closing). This is actually a theorem from any KB logic: OOoOOy —> 
OO is B applied to OOy and OOy — OOOO comes from B applied 
to Oy and from RM. 



















































































wm O09 © Oy, and 0909 © Oy. 




















= More generally, we derive from properties of opening and closing the 
following theorems: 


f 








n'y ome max(n,n’) max(n,n’) 


Oo” MEM 


























P, 


and 














non Op vam max(n,n’) mann 5, 




















See also the paragraph on granulometries in Sec. 2.4. 


a Fon <n, the ee expressions are theorems Gf Ris reflexive): 
























































KOL) Zy Op ap Eş Oy, nry Zy n! Ory , O” gr Y = 
on op, 
Modal operators from adjunction. Now, we consider the more general 


framework of algebraic erosions and dilations and the fundamental properties 
of adjunction (Heijmans and Ronse, 1990; Heijmans, 1994, Sec. 2.3). 

Generalizing the definitions of Bloch and Lang, 2000, an algebraic dilation 
ô on ® is defined as an operation which commutes with disjunction, and an 
algebraic erosion £ as an operation which commutes with conjunction, i.e. we 
have the two following expressions for any family { ;}: 


(14.79) a(V pi) = Va), 
(14.80) (A ei) = \e(¢i). 


a 


One of the fundamental concept in the algebraic framework is the one of ad- 
junction (see Sec. 2.3). Using similar concepts modal operators can be defined 
on ©. A pair of modal operators (O, ©’) is an adjunction on © iff: 


(14.81) V(ip,¥) E P, E (g> yY = p> y4), 
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or in other words: 

















/ 
De Oe e 
Cepr>bp yoy 
In terms of worlds, this can also be expressed as: 

















(14.82) Y(p, Y) € ®, Mod(O'p) E Mod(w) iff Mod(p) € Mod(Oy). 


























At this point, we use the notation (O, ©’) instead of the classical notation (O, ©) 
because, as will be seen later, the two operators are not necessarily dual. In 
general they are two different modal operators. 























LEMMA 14.32 Jf(O, ©’) isan adjunction on ®, then O is analgebraic erosion, 
and ©" is an algebraic dilation, i.e. for any family {p;}, we have: 


(14.83) Av = TAN Pi, 


i 























(14.84) OV go =V pi 





These equivalences are also true for empty families, since we have OT = T 
andl = 1. 














LEMMA 14.33 Let(O, ©’) be an adjunction on the lattice of logical formulas. 
The modal logic based on these operators has the following theorems and rules 


of inference (we use similar notations as in Theorem 14.30 but © has to be 
replaced by ©’): B, N, M, M’, C, R, RN, RM, RR, RE, K. 


The proof is derived mainly from Theorem 14.32, from Eqs. (14.73, 14.81- 
14.84) and from the following result: 





LEMMA 14.34 We can write O and ©’ as: 


(14.85) p=\{ved, Ov 9}, 





























(14.86) es MEt, p> Y}. 


Again formulas are considered up to the equivalence relation, and therefore 
\ and / are taken over a finite family. 


LEMMA 14.35 T, 5cand4care not always satisfied, and we have the following 
results: 


a TiffWwe,wE Ogu, 
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a 5ciffVwEeQ, w H Ogu, 
a 4cifYw EQ, wH Xyu. 





Note that the condition on B for T in Theorem 14.30 corresponds to the one 
above, and we have B(w) = Mod(y.). 


LEMMA 14.36 We have the two following additional theorems: 
































a 00/09 © Og and GO'Y > O'g. 
a 00y e Og and O0%O00'y > 00g. 










































































LEMMA 14.37 Let (O, ©’) be an adjunction on ©. Let O,y = 70-7 and 
Slp = =g. Then (O, O,) is an adjunction. 

















This property expresses a kind of duality between both operators. 




















Note that we do not always have: Df: ©’y = 7=O-y and Oy © ~no, 
nor D: Oy > O'y. 





























LEMMA 14.38 Df is satisfied by an adjunction (O, ©’) if and only if © sat- 
isfies the following property: 


(14.87) Vw, w) € 07, w E Oy, ifo H gpu. 








D is satisfied by an adjunction (O, ©’) if ©! satisfies one of the two following 
properties: 


(14.88) Ww EQ, w H gpu 


or 
(14.89) 
V(w,w") ER, w E pu fw E pu and {u!,w" E pu} Z O. 





The last result (see Proposition 14.20) means in particular that we can have 
D without having T. 
In cases where Df is satisfied, then we note simply © instead of ©’. 





LEMMA 14.39 The operators (O, ©) defined by Eqs. (14.77, 14.78) build an 
adjunction in the case B is symmetrical. 











This shows that modal operators derived from morphological erosions and 
dilations are particular cases of modal operators derived from algebraic erosions 
and dilations. 

All these results show that the use of general algebraic dilations and erosions 
defined from the adjunction property lead to the properties of normal modal 
logics. This justifies the use of Kripke’s semantics. This also guarantees a 
completeness result. 
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Characterizing modal logics in terms of morphological operators. Con- 
versely, the following result shows that modal operators satisfying some axioms 
can be expressed in morphological terms. 


























LEMMA 14.40 Iftwomodal operators O and © satisfy B and RM, then (O, ©) 
is an adjunction on ®, O is an algebraic erosion and © is an algebraic dilation. 

Moreover, if we define a relation R between worlds by R(w,w') iffw = 
Oyu, where pu is a canonical formula associated with w (Mod(y.) = {w}), 
then O and © are exactly given by: 





























(14.90) Mod(Oy) = {w E€ Q | Yu", R(w’,w) > w E p}, 














(14.91) Mod(O) = {w E Q | dw’, R(w, w"), w = p}. 


These equations are similar to the ones used for defining modal operators 
from an accessibility relation and a structuring element, except that here we 
consider R(w,w’) for one operator, and R(w’,w) for the other. If R is sym- 
metrical, both are equivalent. In cases where the structuring element (and the 
accessibility relation) is not symmetrical, we consider its symmetrical in one 
of the operations. 


Modal operators from morphological opening and closing. We can 
define modal operators from opening and closing on formulas as: 














(14.92) y= Oly), 


(14.93) Op =C(¢). 


Unfortunately, this leads to weaker properties than operators derived from 
erosion and dilation. This comes partly from the fact that no accessibility 
relation can be derived from opening and closing as easily as from erosion and 
dilation. 

However, it would be interesting to link this approach with the topological 
interpretation of modal logic as proposed in Aiello and van Benthem, 1999, 
since opening and closing are related to the notions of topological interior and 
closure. Note that considering erosion and dilation only leads to a pre-topology 
(where closure is not idempotent). 

Another interesting direction could be to consider the neighbourhood se- 
mantics (Aiello and van Benthem, 1999), where here the neighbourhoods of w 
would be all elements of the set N (w) = {B(w’) | w” € Q and w € B(w")}. 
With this semantics, we can prove: 








(14.94) w H Oy & dw’ € | B(w’) € N(w) and B(w’) E ¢. 
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The proof of this expression comes from the following rewriting of opening: 








(14.95) Mod(Oy) = {w E | aw’ € 0, w € B(w’) and B(w’) = g}. 











Kripke’s semantics can be seen as a particular case, where the neighbourhood 
of w is reduced to the singleton {B(w)}. 


LEMMA 14.41 The modal logic constructed from opening and closing satisfies 
T, Df, D, 4, 4c, 5c, N, M, M’, RM, RE, but not 5, B, K, C, R, RR. 


The fact that K is not satisfied goes with the interpretation in terms of neigh- 
bourhood semantics, which leads to a weaker logic, where RM (monotonicity) 
is satisfied, but not K in general (Aiello and van Benthem, 1999). 


Extension to the fuzzy case. We now consider fuzzy formulas, i.e. formulas 
y for which Mod(v) is a fuzzy subset of Q and use the fuzzy morphological 
operators of Bloch and Maitre, 1995, (see Sec. 3.4). However, what follows 
applies as well if other definitions are used. 

Modal operators in the fuzzy case can then be constructed from fuzzy erosion 
and dilation in a similar way as in the crisp case using Eqs. (14.77, 14.78). The 
fuzzy structuring element can be interpreted as a fuzzy relation between worlds. 
The properties of this fuzzy modal logic are the same as in the crisp case, since 
fuzzy dilations and erosions have the same properties as the binary ones. 

This extension can also be considered from the algebraic point of view of 
adjunction, based on the results of Deng and Heijmans, 2002, and on a definition 
of fuzzy erosion in terms of residual implication. 

The use of fuzzy structuring elements will appear as particularly useful for 
expressing intrinsically vague spatial relationships such as directional relative 
position. 

It is also interesting to relate this approach to the possibilistic logic proposed 
for belief fusion in Boldrin and Saffiotti, 1995, and to similarity-based reasoning 
(Esteva et al., 1997; Dubois et al., 1997). 


4.3 Qualitative representation of spatial relationships and 
reasoning 


For qualitative spatial reasoning, worlds (or interpretations) can represent 
spatial entities, like regions of the space. Formulas then represent combinations 
of such entities, and define regions, objects, etc., which may be not connected. 
For instance, if a formula ọ is a symbolic representation of a region X of the 
space, it can be interpreted for instance as “the object we are looking at is in 
X”. Inan epistemic interpretation, it could represent the belief of an agent that 
the object is in X. The interest of such representations could be also to deal in 
a qualitative way with any kind of spatial entities, without referring to points. 
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Using these interpretations, if y represents some knowledge or belief about a 
region X of the space, then Oy represents a restriction of X. If we are looking 
at an object in X, then Dy is a necessary region for this object. Similarly, 
Ov represents an extension of X, and a possible region for the object. In an 
epistemic interpretation, Oy can represent the belief of an agent that the object 
is necessarily in the erosion of X while Oy is the belief that it is possibly in 
the dilation of X. Interpretations in terms of rough regions are also possible. 

In this subsection, we address the problem of qualitative representation of 
spatial relationships between regions or objects represented by logical formulas. 























Topological relationships. Let us first consider topological relationships. 
Let y and w be two formulas representing two regions X and Y of the space. 
Note that all what follows holds in both crisp and fuzzy cases. Simple topo- 
logical relations such as inclusion, exclusion, intersection do not call for more 
operators than the standard ones of propositional logic (e.g. Bennett, 1995). 
But other relations such that X is a tangential part of Y can benefit from the 
morphological modal operators. Such a relationship can be expressed as: 


(14.96) p — wand Oy A ~y consistent, 


or, equivalently, 





(14.97) p — wand y A —Oy consistent. 


Indeed, if X is a tangential part of Y, it is included in Y but its dilation is 
not, and equivalently it is not included in the erosion of Y, as illustrated in 
Fig. 14.13. 

In a similar way, a relation such that X is a non tangential part of Y is 
expressed, for a reflexive accessibility relation, as: 


(14.98) op > YU, 


or, equivalently, 














(14.99) yp Ov, 


(i.e. in order to verify that X is a non tangential part of Y, we have to prove 
these relations). 
If we also want X to be a proper part, we have to add the following condition: 


(14.100) ay A w consistent. 


Let us now consider adjacency (or external connection). Saying that X is 
adjacent to Y means that they do not intersect and as soon as one region is 
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dilated, it has a non empty intersection with the other one. In symbolic terms, 
this relation can be expressed as: 


(14.101) yA vw inconsistent and Oy A W consistent (or p A Ow consistent). 


Actually, this expression holds in a discrete domain. If y and w represent 
spatial entities in a continuous spatial domain, some problems may occur if 
these entities are closed sets and have parts of local dimension less than the 
dimension of the space (see Bloch et al., 1997 for a complete discussion). Such 
problems can be avoided if the entities are reduced to regular ones, i.e. that 
are equal to the closure of their interior (and by considering an asymptotic 
definition of adjacency). Using the topological interpretation of modal logic, 
this amounts to deal with formulas for which we can prove y e OOy. 

It is interesting to link these types of representations with the ones devel- 
oped in the community of mereology and mereotopology, where such relations 
are defined respectively from parthood and connection predicates (Asher and 
Vieu, 1995; Randell et al., 1992; Cohn et al., 1997; Varzi, 1996; Renz and 
Nebel, 2001). Interestingly enough, erosion is defined from inclusion (i.e. 
a parthood relationship) and dilation from intersection (i.e. a connection re- 
lationship). Some axioms of these domains could be expressed in terms of 
dilation. For instance from a parthood postulate P(X, Y) between two spatial 
entities X and Y and from dilation 6, tangential proper part could be defined as 
TPP(X,Y) = P(X, Y)AAP(Y, X)A7P(6(X), Y). Further links certainly 
deserve to be investigated, in particular with the work presented e.g. in Cohn 
et al., 1997, Cristani et al., 2000 and Galton, 2000. 





Distances. Distances between objects X and Y can be expressed in different 
forms, as the distance between X and Y is equal to n, the distance between X 
and Y is less (respectively greater) than n, the distance between X and Y is 
between nı and n2. Several distances can be related to morphological dilation, 
as minimum distance and Hausdorff distance, as explained in Sec. 3.3. 

Based on algebraic expressions of distances using dilation, the translation 
into a logical formalism is quite straightforward. Expressing that dy (X,Y) = 
n leads to: 


Ym <n, Op A Y inconsistent, and O'’~w A ọ inconsistent 
and ©" A w consistent (or O"w A ọ consistent). 


(14.102) { 
Expressions like dy (X,Y) < n translate into: 
(14.103) ©"p Aw consistent (or O”y A ọ consistent). 
Expressions like dy (X,Y) > n translate into: 


(14.104) Vm<n,O'™p Av inconsistent (or O"’~ A ọ inconsistent). 
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Expressions like nı < dy(X,Y) < ng translate into: 


Ym <m,O"™p A w inconsistent (or Ow A ọ inconsistent) 


nS) { and ©" A^ % consistent (or O24 A ọ consistent). 


Similarly for Hausdorff distance, we translate Hy(X,Y) = n by: 


(14.106) Ym < n, Y A nOg consistent or y A =O" consistent 
: and w — Op and yo Org. 


The first condition corresponds to Ha(X,Y) > n and the second one to 
Ha(X, Y) <n. 

Let us consider an example of possible use of these representations for spatial 
reasoning. If we are looking at an object represented by w in an area which is at 
a distance in an interval [n1, n2] of a region represented by y, this corresponds 
to a minimum distance greater than nı and to a Hausdorff distance less than 
ng. This is illustrated in Fig. 14.12. 

Then we have to check the following relation: 


(14.107) yr HOD AO™y, 
or equivalently: 


(14.108) p> Oy A Oy. 














This expresses in a symbolic way an imprecise knowledge about distances 
represented as an interval. If we consider a fuzzy interval, this extends directly 
by means of fuzzy dilation (see Bloch, 2000c, for detailed expressions of these 
dilations). 

These expressions show how we can convert distance information, which is 
usually defined in an analytical way, into algebraic expressions through mathe- 
matical morphology, and then into logical expressions through morphological 
expressions of modal operators. 


Directional relative position. We use for this relation the same approach as 
in Sec. 3.4, based on dilation. Let us denote by D* the dilation corresponding 
to a directional information in the direction d, and by ©? the associated modal 
operator (this assumes that directions can be defined over the set of spatial 
entities represented as logical formulas). Expressing that an object represented 
by w has to be in direction d with respect to a region represented by p amounts 
to check the following relation: 


(14.109) yp — Oty. 


In the fuzzy case, this relation can hold to some degree. 

Usually for spatial reasoning several relationships have to be used together. 
This aspect can benefit from the developments in information fusion, both in a 
numerical and in a logical setting. 
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Logical reasoning and inference. One of the advantages of logical rep- 
resentations is their inference and reasoning power. Rule-based systems can 
make use of the proposed representations in a quite straightforward way. But 
it is also interesting to note that several spatial logics contain ingredients that 
can be expressed equivalently in morphological terms. We show here some of 
these links but do not pretend to be exhaustive. 

Some links with mereotopology and region connection calculus (RCC) have 
already been mentioned above. They allow us to combine the expressive- 
ness power of mathematical morphology and the reasoning power of RCC and 
mereotopology. 

The “egg-yolk” structures, as developed e.g. in Cristani et al., 2000 can also 
lead to interpretations in terms of mathematical morphology. For instance in 
this model, establishing if a yolk can be a mobile part (in translation) of its egg is 
based on the notion of congruence. This characterization can be expressed in a 
very simple way using morphological opening (erosion followed by a dilation): 
the opening of the egg by the yolk considered as the structuring element should 
be connected. 

Let us now consider two examples of logics of distances. The first one 
defines a modality AS® by Kutz et al., 2002: 


(14.110) w H AS y iff Vu, d(w,u) <a >u E g, 


where dis a distance between worlds. Itis straightforward to show that A<y is 
equivalent to the erosion of ọ by a ball of the distance d of radius a. The dual of 
A“ is equivalent to a dilation. Then we have direct correspondences between 
the axioms of this distance logics and the axioms of the modal morpho-logics 
as presented in Bloch, 2002. Some theorems can be also directly deduced from 
properties of dilation or erosion. For instance, the following is proved to be a 
theorem: 


(14.111) Ay = Ap fora < b. 


Using the morphological equivalence, this theorem is directly deduced from the 
decreasingness of erosion with respect to the size of the structuring element. 

The second example concerns nearness logics (Aiello and van Benthem, 
2002), where the notion of “nearest than” is modeled as: 





(14.112) rE<N>y,wviffdy,z (y HE p Az H y) A N(z,y, z) 








where N(x, y, z) means that y is closer to x than z is. The meaning of this 
expression is that the nearest point distance of x to ọ is less than the nearest 
point distance of x to 7). An equivalent expression is therefore: 


(14.113) t H nlp) >  F bn(y) 
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which expresses that x is reached faster from y than from w by dilations of 
these formulas. 

Other links between linear logics or arrow logics and mathematical morphol- 
ogy exist, as already established in Aiello and van Benthem, 2002. 

Finally, let us consider logics of convexity (Aiello and van Benthem, 2002): 





(14.114) x H Co iff dy,z,(yEpAzE vy) A(t e€y—-2) 





which expresses a linear closure, the iteration of which provides convexity. This 
iterative closure is clearly equivalent to morphologic closing, where structuring 
elements are segments in all directions of infinite length (in practice, larger than 
the largest diameter of the considered spatial entities). 

All these examples show interesting links between different spatial logics 
which have not been exhibited before for most of them. They can be exploited 
in two ways: the properties of morphological operators can provide additional 
theorems to these logics; conversely spatial logics endow mathematical mor- 
phology with powerful inference and reasoning tools. 

Other links between mathematical morphology and non-classical logics are 
explored in Fujio and Bloch, 2004. 


5. Conclusion 


This chapter provides an overview of the algebraic basis of mathematical 
morphology. It shows how this lattice-theoretical formalism can be applied 
in different frameworks for spatial reasoning, thanks to the representation of 
shapes and of spatial relations it provides. In particular, it is highly adapted to 
the modelling of logical relations. 

Further links with other chapters in this book are worth to be mentioned, 
such as mereotopology, modal logics of space and topology. 

It should be stressed that we have not given here a complete overview of the 
methods, techniques and tools of mathematical morphology. Nor have we given 
any idea of the way to use them (alone or in conjunction with other approaches) 
in practical image processing problems. This by far exceeds the scope of this 
chapter. For this, we advise reading classical books such as Heijmans, 1994, 
Serra, 1982, Serra, 1988 and Soille, 2003, as well as image processing journals 
and conference proceedings. 

Complexity issues have not been addressed in this chapter, except a few 
words at the end of Sec. 1. The interested reader should consult standard books 
on morphological image processing (for example, Soille, 2003) for more details 
on algorithms and data structures for morphology. 


Mathematical Morphology 937 


References 


Aiello, M. (2002). Spatial Reasoning, Theory and Practice. PhD thesis, Uni- 
versity of Amsterdam. 

Aiello, M. and van Benthem, J. (1999). Logical Patterns in Space. Technical 
Report UVA PP-99-24, University of Amsterdam. 

Aiello, M. and van Benthem, J. (2002). A Modal Walk Through Space. Journal 
of Applied Non Classical Logics, 12(3-4):319-364. 

Alexandroff, P. (1937). Diskrete Ratime. Mat. Sb., 2:501-518. 

Alexandroff, P. and Hopf, H. (1935). Topologie, Erster Band. Springer-Verlag, 
Berlin. 

Alexandroff, P. S. (1956). Combinatorial Topology, Vol. 1. Graylock Press, 
Rochester, NY. 

Asher, N. and Vieu, L. (1995). Toward a Geometry of Common Sense: A Seman- 
tics and a Complete Axiomatization of Mereotopology. In IJCAI’95, pages 
846-852, San Mateo, CA. 

Balbiani, P. and Orlowska, E. (1999). A Hierarchy of Modal Logics with Rel- 
ative Accessibility Relations. Journal of Applied Non-Classical Logics, 9: 
303-328. 

Bandemer, H. and Nather, W. (1992). Fuzzy Data Analysis. Theory and Decision 
Library, Serie B: Mathematical and Statistical Methods. Kluwer Academic 
Publisher, Dordrecht. 

Bengoetxea, E., Larranaga, P., Bloch, I., Perchant, A., and Boeres, C. (2002). 
Inexact Graph Matching by Means of Estimation of Distribution Algorithms. 
Pattern Recognition, 35:2867—2880. 

Bennett, B. (1995). Modal Logics for Qualitative Spatial Reasoning. Bulletin 
of the IGPL, 4(1):23-45. 

Birkhoff, G. (1995). Lattice Theory. Vol. 25 of American Mathematical Society 
Colloquium Publications. American Mathematical Society, 3rd edition, 8th 
printing. 

Bloch, I. (1993). Triangular Norms as a Tool for Constructing Fuzzy Mathemat- 
ical Morphologies. In International Workshop on “Mathematical Morphol- 
ogy and its Applications to Signal Processing”, pages 157-161, Barcelona, 
Spain. 

Bloch, I. (1996). Information Combination Operators for Data Fusion: A Com- 
parative Review with Classification. IEEE Transactions on Systems, Man, 
and Cybernetics, 26(1):52-67. 

Bloch, I. (1999a). Fuzzy Relative Position between Objects in Image Process- 
ing: a Morphological Approach. IEEE Transactions on Pattern Analysis and 
Machine Intelligence, 21(7):657-664. 

Bloch, I. (1999b). On Fuzzy Distances and their Use in Image Processing under 
Imprecision. Pattern Recognition, 32(11):1873-1895. 


938 HANDBOOK OF SPATIAL LOGICS 


Bloch, I. (2000a). Geodesic Balls in a Fuzzy Set and Fuzzy Geodesic Mathe- 
matical Morphology. Pattern Recognition, 33(6):897-905. 

Bloch, I. (2000b). On Links between Mathematical Morphology and Rough 
Sets. Pattern Recognition, 33(9):1487-1496. 

Bloch, I. (2000c). Spatial Representation of Spatial Relationships Knowledge. 
In Cohn, A. G., Giunchiglia, F., and Selman, B., editors, 7th International 
Conference on Principles of Knowledge Representation and Reasoning KR 
2000, pages 247-258, Breckenridge, CO. Morgan Kaufmann, San Francisco, 
CA. 

Bloch, I. (2002). Modal Logics based on Mathematical Morphology for Spatial 
Reasoning. Journal of Applied Non Classical Logics, 12(3-4):399-424. 

Bloch, I. (2003a). On Fuzzy Spatial Distances. In Hawkes, P., editor, Advances in 
Imaging and Electron Physics, Volume 128, pages 51-122. 
Elsevier, Amsterdam. 

Bloch, I. (2003b). Unifying Quantitative, Semi-Quantitative and Qualitative 
Spatial Relation Knowledge Representations using Mathematical Morphol- 
ogy. In Asano, T., Klette, R., and Ronse, C., editors, LNCS 2616 Geometry, 
Morphology, and Computational Imaging, 11th International Workshop on 
Theoretical Foundations of Computer Vision, Dagstuhl Castle, Germany, 
April 7-12, 2002, Revised Papers, pages 153—164. Springer. 

Bloch, I., Géraud, T., and Maitre, H. (2003). Representation and Fusion of 
Heterogeneous Fuzzy Information in the 3D Space for Model-Based Struc- 
tural Recognition - Application to 3D Brain Imaging. Artificial Intelligence, 
148:141-175. 

Bloch, I. and Lang, J. (2000). Towards Mathematical Morpho-Logics. In 8th 
International Conference on Information Processing and Management of 
Uncertainty in Knowledge based Systems IPMU 2000, volume III, pages 
1405-1412, Madrid, Spain. 

Bloch, I. and Maitre, H. (1995). Fuzzy Mathematical Morphologies: A Com- 
parative Study. Pattern Recognition, 28(9):1341-1387. 

Bloch, I., Maitre, H., and Anvari, M. (1997). Fuzzy Adjacency between Image 
Objects. International Journal of Uncertainty, Fuzziness and Knowledge- 
Based Systems, 5(6):615-653. 

Bloch, I., Pino-Pérez, R., and Uzcátegui, C. (2001). Explanatory Relations 
based on Mathematical Morphology. In ECSQARU 2001, pages 736-747, 
Toulouse, France. 

Bloch, I., Pino-Pérez, R., and Uzcategui, C. (2004). A Unified Treatment of 
Knowledge Dynamics. In International Conference on the Principles of 
Knowledge Representation and Reasoning, KR2004, pages 329-337, Canada. 

Bloch, I. and Ralescu, A. (2003). Directional Relative Position between Objects 
in Image Processing: A Comparison between Fuzzy Approaches. Pattern 
Recognition, 36:1563-1582. 


Mathematical Morphology 939 


Bloch, I. and Saffiotti, A. (2002). On the Representation of Fuzzy Spatial Rela- 
tions in Robot Maps. In JPMU 2002, volume III, pages 1587—1594, Annecy, 
France. 

Boldrin, L. and Saffiotti, A. (1995). A Modal logic for merging Partial Belief of 
Multiple Reasoners. Technical Report TR/IRIDIA/95-19, IRIDIA, Univer- 
sité Libre de Bruxelles. 

Bouchon-Meunier, B., Rifqi, M., and Bothorel, S. (1996). Towards General 
Measures of Comparison of Objects. Fuzzy Sets and Systems, 84(2): 
143-153. 

Cesar, R., Bengoetxea, E., and Bloch, I. (2002). Inexact Graph Matching using 
Stochastic Optimization Techniques for Facial Feature Recognition. In In- 
ternational Conference on Pattern Recognition ICPR 2002, volume 2, pages 
465-468, Québec. 

Chellas, B. (1980). Modal Logic, an Introduction. Cambridge University Press, 
Cambridge. 

Cohn, A., Bennett, B., Gooday, J., and Gotts, N. M. (1997). Representing and 
Reasoning with Qualitative Spatial Relations about Regions. In Stock, O., 
editor, Spatial and Temporal Reasoning, pages 97-134. Kluwer. 

Colliot, O., Camara, O., Dewynter, R., and Bloch, I. (2004). Description of 
Brain Internal Structures by Means of Spatial Relations for MR Image Seg- 
mentation. In SPIE Medical Imaging, volume 5370, pages 444-455, San 
Diego, CA, USA. 

Cristani, M., Cohn, A. G., and Bennett, B. (2000). Spatial Locations via Morpho- 
Mereology. In Cohn, A. G., Giunchiglia, F., and Selman, B., editors, 7th 
International Conference on Principles of Knowledge Representation and 
Reasoning KR 2000, pages 15-25, Breckenridge, CO. Morgan Kaufmann, 
San Francisco, CA. 

Dalal, M. (1988). Investigations into a Theory of Knowledge Base Revision: 
Preliminary Report. In AAAI’8S, pages 475-479. 

de Baets, B. (1995). Idempotent Closing and Opening Operations in Fuzzy 
Mathematical Morphology. In ISUMA-NAFIPS’95, pages 228-233, College 
Park, MD. 

de Baets, B. (1997). Fuzzy Morphology: a Logical Approach. In Ayyub, B. and 
Gupta, M., editors, Uncertainty in Engineering and Sciences: Fuzzy Logic, 
Statistics and Neural Network Approach, pages 53—67. Kluwer Academic. 

de Baets, B. and Kerre, E. (1993). An Introduction to Fuzzy Mathematical 
Morphology. In NAFIPS’93, pages 129-133, Allentown, Pennsylvania. 

Deng, T.-Q. and Heijmans, H. (2002). Grey-Scale Morphology Based on Fuzzy 
Logic. Journal of Mathematical Imaging and Vision, 16:155-171. 

di Gesu, V. (1988). Mathematical Morphology and Image Analysis: A Fuzzy 
Approach. In Workshop on Knowledge-Based Systems and Models of Logical 
Reasoning, Reasoning. 


940 HANDBOOK OF SPATIAL LOGICS 


di Gesu, V., Maccarone, M. C., and Tripiciano, M. (1993). Mathematical Mor- 
phology based on Fuzzy Operators. In Lowen, R. and Roubens, M., editors, 
Fuzzy Logic, pages 477-486. Kluwer Academic. 

Dubois, D., Esteva, F., Garcia, P., Godo, L., and Prade, H. (1997). A Logical 
Approach to Interpolation based on Similarity Relations. International Jour- 
nal of Approximate Reasoning, 17(1):1—36. 

Dubois, D. and Jaulent, M.-C. (1987). A General Approach to Parameter Eval- 
uation in Fuzzy Digital Pictures. Pattern Recognition Letters, 6:251—259. 
Dubois, D. and Prade, H. (1980). Fuzzy Sets and Systems: Theory and Appli- 

cations. Academic Press, New-York. 

Dubois, D. and Prade, H. (1983). Inverse Operations for Fuzzy Numbers. In 
Sanchez, E. and Gupta, M., editors, Fuzzy Information, Knowledge Represen- 
tation and Decision Analysis, IFAC Symposium, pages 391-396, Marseille, 
France. 

Dubois, D. and Prade, H. (1985). A Review of Fuzzy Set Aggregation Connec- 
tives. Information Sciences, 36:85—121. 

Dubois, D. and Prade, H. (1990). Rough Fuzzy Sets and Fuzzy Rough Sets. 
International Journal of General Systems, 17:191-209. 

Dubois, D. and Prade, H. (1991). Fuzzy Sets in Approximate Reasoning, Part I: 
Inference with Possibility Distributions. Fuzzy Sets and Systems, 40: 
143-202. 

Dubois, D., Prade, H., and Testemale, C. (1988). Weighted Fuzzy Pattern Match- 
ing. Fuzzy Sets and Systems, 28:313-331. 

Dubois, D., Prade, H., and Yager, R. (1999). Merging Fuzzy Information. In 
Bezdek, J.C., Dubois, D., and Prade, H., editors, Handbook of Fuzzy Sets 
Series, Approximate Reasoning and Information Systems, chapter 6. Kluwer. 

Diintsch, I. and Gediga, G. (2003). Approximation operators in qualitative data 
analysis. In de Swart, H., Orlowska, E., Schmidt, G., and Roubens, M., edi- 
tors, Theory and Application of Relational Structures as Knowledge Instru- 
ments, volume 2929 of Lecture Notes in Computer Science. Springer Verlag, 
Heidelberg. 

Emptoz, H. (1983). Modèle prétopologique pour la reconnaissance des formes. 
Applications en neurophysiologie. Thèse de Doctorat d’Etat, Univ. Claude 
Bernard, Lyon I, Lyon, France. 

Esteva, F., Garcia, P., and Godo, L. (1997). A Modal Account of Similari- 
ty-Based Reasoning. International Journal of Approximate Reasoning, 16: 
235-260. 

Everett, C. J. (1944). Closure Operators and Galois Theory in Lattices. Trans. 
Amer. Math. Soc., 55:514—525. 

Fujio, M. and Bloch, I. (2004). Non-Classical Logic via Mathematical Morphol- 
ogy. Technical Report 2004D010, Ecole Nationale Supérieure des 
Télécommunications. 


Mathematical Morphology 941 


Galton, A. (2000). Continous Motion in Discrete Space. In Cohn, 
A. G., Giunchiglia, F., and Selman, B., editors, 7th International Confer- 
ence on Principles of Knowledge Representation and Reasoning KR 2000, 
pages 26-37, Breckenridge, CO. Morgan Kaufmann, San Francisco, CA. 

Ganter, B. and Wille, R. (1999). Formal Concept Analysis. Springer Verlag. 

Giardina, C. R. and Sinha, D. (1989). Image Processing using Pointed Fuzzy 
Sets. In SPIE Intelligent Robots and Computer Vision VIII: Algorithms and 
Techniques, volume 1192, pages 659-668. 

Ginsberg, M. L., Parkes, A. J., and Roy, A. (1998). Supermodels and Robustness. 
In Fifteenth National Conference on Artificial Intelligence AAAI’98, pages 
334-339, Madison, Wisconsin. 

Goetcherian, V. (1980). From Binary to Grey Tone Image Processing using 
Fuzzy Logic Concepts. Pattern Recognition, 12:7-15. 

Hadwiger, H. (1950). Minkowskische Addition und Subtraktion beliebiger Punk- 
tmengen und die Theoreme von Erhard Schmidt. Math. Z., 53:210-218. 
Heijmans, H. J. A. M. (1991). Theoretical Aspects of Gray-Level Morphology. 

IEEE Trans. Pattern Analysis and Machine Intelligence, 13:568—582. 

Heijmans, H. J. A. M. (1994). Morphological Image Operators. Academic 
Press, Boston. 

Heijmans, H. J. A. M. (1997). Composing Morphological Filters. IEEE Trans. 
Image Processing, 6(5):713-723. 

Heijmans, H. J. A. M. and Ronse, C. (1990). The Algebraic Basis of Mathemati- 
cal Morphology — Part I: Dilations and Erosions. Computer Vision, Graphics 
and Image Processing, 50:245-295. 

Hughes, G. E. and Cresswell, M. J. (1968). An Introduction to Modal Logic. 
Methuen, London, UK. 

Jeansoulin, R. and Mathieu, C. (1994). Une logique modale des inférences 
spatiales. Revue Internationale de Géomatique, 4:369-384. 

Katsuno, H. and Mendelzon, A. O. (1991). Propositional Kowledge Base Re- 
vision and Minimal Change. Artificial Intelligence, 52:263—294. 

Kaufmann, A. and Gupta, M. M. (1988). Fuzzy Mathematical Models in Engi- 
neering and Management Science. North-Holland, Amsterdam. 

Keller, J. M. and Wang, X. (1995). Comparison of Spatial Relation Definitions 
in Computer Vision. In ISUMA-NAFIPS’95, pages 679-684, College Park, 
MD. 

Konieczny, S. and Pino-Pérez, R. (1998). On the Logic of Merging. In 6th 
International Conference on Principles of Knowledge Representation and 
Reasoning, pages 488—498, Trento, Italy. 

Krishnapuram, R., Keller, J. M., and Ma, Y. (1993). Quantitative Analysis of 
Properties and Spatial Relations of Fuzzy Image Regions. IEEE Transactions 
on Fuzzy Systems, 1(3):222-233. 


942 HANDBOOK OF SPATIAL LOGICS 


Kutz, O., Sturm, H., Suzuki, N.-Y., Wolter, F., and Zakharyaschev, M. (2002). 
Axiomatizing Distance Logics. Journal of Applied Non Classical Logics, 
12(3-4):425-440. 

Lafage, C. and Lang, J. (2000). Logical Representation of Preferences for Group 
Decision Making. In Cohn, A. G., Giunchiglia, F., and Selman, B., editors, 7th 
International Conference on Principles of Knowledge Representation and 
Reasoning KR 2000, pages 457—468, Breckenridge, CO. Morgan Kaufmann, 
San Francisco, CA. 

Laplante, P. A. and Giardina, C. R. (1991). Fast Dilation and Erosion of Time 
Varying Grey Valued Images with Uncertainty. In SPIE Image Algebra and 
Morphological Image Processing I, volume 1568, pages 295-302. 

Lin, T. Y. (1995). Neighborhood Systems: A Qualitative Theory for Fuzzy and 
Rough Sets. In 2nd Annual Joint Conference on Information Science, pages 
255-258, Wrightsville Beach, NC. 

Lin, T. Y. and Liu, Q. (1994). Rough Approximate Operators: Axiomatic Rough 
Sets Theory. In Ziarko, W. P., editor, Rough Sets, Fuzzy Sets and Knowledge 
Discovery, pages 256-260. Springer Verlag, London. 

Maragos, P. (2005). Lattice Image Processing: A Unification of Morphological 
and Fuzzy Algebraic Systems. Journal of Mathematical Imaging and Vision, 
22:333-353. 

Maragos, P., Tzouvaras, V., and Stamou, G. (2001). Synthesis and Applications 
of Lattice Image Operators Based on Fuzzy Norms. In IEEE Int’l Conference 
on Image Processing (ICIP-2001), pages 521-524, Thessaloniki, Greece. 

Matheron, G. (1975). Random Sets and Integral Geometry. J. Wiley & Sons, 

New York. 

Matsakis, P. and Wendling, L. (1999). A New Way to Represent the Relative 

Position between Areal Objects. IEEE Trans. on Pattern Analysis and Ma- 

chine Intelligence, 21(7):634-642. 

Menger, K. (1942). Statistical Metrics. Proc. National Academy of Sciences 

USA, 28:535-537. 

Minkowski, H. (1903). Volumen und Oberfläche. Math. Ann., 57:447-495. 

Miyajima, K. and Ralescu, A. (1994). Spatial Organization in 2D Segmented Im- 
ages: Representation and Recognition of Primitive Spatial Relations. Fuzzy 
Sets and Systems, 65:225-236. 

Nachtegael, M. and Kerre, E. E. (2000). Classical and Fuzzy Approaches to- 
wards Mathematical Morphology. In Kerre, E. E. and Nachtegael, M., edi- 
tors, Fuzzy Techniques in Image Processing, Studies in Fuzziness and Soft 
Computing, chapter 1, pages 3—57. Physica- Verlag, Springer. 

Nachtegael, M., Kerre, E. E., and Raszikowska, A. M. (2000). On Links between 
Fuzzy Morphology and Fuzzy Rough Sets. In Information Processing and 
Management of Uncertainty IPMU 2000, pages 1381-1388, Madrid. 





Mathematical Morphology 943 


Nakatsuyama, M. (1993). Fuzzy Mathematical Morphology for Image Process- 
ing. In ANZIIS-93, pages 75-79, Perth, Western Australia. 

Ore, O. (1944). Galois Connexions. Trans. Amer. Math. Soc., 55:493-513. 

Orlowska, E. (1993). Rough Set Semantics for Non-Classical Logics. In In- 
ternational Workshop on Rough Sets, Fuzzy Sets and Knowledge Discovery, 
pages 143-148, Banff, Canada. 

Pawlak, Z. (1982). Rough Sets. International Journal of Information and Com- 
puter Science, 11(5):341-356. 

Pawlak, Z. (1987). Rough Logic. Bulletin of the Polish Academy of Sciences, 
35(5-6):253-258. 

Perchant, A. and Bloch, I. (2002). Fuzzy Morphisms between Graphs. Fuzzy 
Sets and Systems, 128(2):149-168. 

Perchant, A., Boeres, C., Bloch, I., Roux, M., and Ribeiro, C. (1999). Model- 
based Scene Recognition Using Graph Fuzzy Homomorphism Solved by 
Genetic Algorithm. In GbR’99 2nd International Workshop on Graph-Based 
Representations in Pattern Recognition, pages 61-70, Castle of Haindorf, 
Austria. 

Pino-Pérez, R. and Uzcátegui, C. (1999). Jumping to Explanations versus jump- 
ing to Conclusions. Artificial Intelligence, 111:131-169. 

Polkowski, L. (1998). Rough Set Approach to Mathematical Morphology: Ap- 
proximate Compression of Data. In Information Processing and Manage- 
ment of Uncertainty IPMU’98, pages 1183-1189, Paris. 

Popov, A. T. (1995). Morphological Operations on Fuzzy Sets. In JEE Image 
Processing and its Applications, pages 837-840, Edinburgh, UK. 

Randell, D., Cui, Z., and Cohn, A. (1992). A Spatial Logic based on Regions 
and Connection. In Nebel, B., Rich, C., and Swartout, W., editors, Principles 
of Knowledge Representation and Reasoning KR’92, pages 165-176, San 
Mateo, CA. Kaufmann. 

Renz, J. and Nebel, B. (2001). Efficient Methods for Qualitative Spatial Rea- 
soning. Journal of Artificial Intelligence Research. 

Ronse, C. (1990). Why Mathematical Morphology Needs Complete Lattices. 
Signal Processing, 21(2):129-154. 

Ronse, C. (2003). Flat Morphological Operators on Arbitrary Power Lattices. 
In Asano, T., Klette, R., and Ronse, C., editors, LNCS 2616 Geometry, Mor- 
phology, and Computational Imaging, 11th International Workshop on The- 
oretical Foundations of Computer Vision, Dagstuhl Castle, Germany, April 
7-12, 2002, Revised Papers, pages 1-21. Springer. 

Ronse, C. and Heijmans, H.J.A.M. (1991). The Algebraic Basis of Mathematical 
Morphology — Part II: Openings and Closings. Computer Vision, Graphics 
and Image Processing: Image Understanding, 54:74-97. 

Ronse, C. and Tajine, M. (2004). Morphological Sampling of Closed Sets. 
Image Analysis & Stereology, 23:89-109. 


944 HANDBOOK OF SPATIAL LOGICS 


Rosenfeld, A. (1979). Fuzzy Digital Topology. Information and Control, 40: 
76-87. 

Rosenfeld, A. (1984). The Fuzzy Geometry of Image Subsets. Pattern Recog- 
nition Letters, 2:311—317. 

Rosenfeld, A. (1985). Distances between Fuzzy Sets. Pattern Recognition Let- 
ters, 3:229-—233. 

Rosenfeld, A. and Kak, A. C. (1976). Digital Picture Processing. Academic 
Press, New-York. 

Schonfeld, D. and Goutsias, J. (1991). Optimal Morphological Pattern Restora- 
tion from Noisy Binary Images. IEEE Trans. Pattern Analysis and Machine 
Intelligence, 13:14-29. 

Schweizer, B. and Sklar, A. (1963). Associative Functions and Abstract Semi- 
groups. Publ. Math. Debrecen, 10:69-81. 

Serra, J. (1982). Image Analysis and Mathematical Morphology. Academic 
Press, London. 

Serra, J., editor (1988). Image Analysis and Mathematical Morphology, Vol. 2: 
Theoretical Advances. Academic Press, London. 

Sinha, D. and Dougherty, E. (1992). Fuzzy Mathematical Morphology. Journal 
of Visual Communication and Image Representation, 3(3):286—302. 

Sinha, D. and Dougherty, E. R. (1993). Fuzzification of Set Inclusion: Theory 
and Applications. Fuzzy Sets and Systems, 55:15-42. 

Sinha, D., Sinha, P., Dougherty, E. R., and Batman, S. (1997). Design and 
Analysis of Fuzzy Morphological Algorithms for Image Processing. IEEE 
Trans. on Fuzzy Systems, 5(4):570-584. 

Soille, P. (2003). Morphological Image Analysis: Principles and Applications 
(2nd edition). Springer Verlag. 

Varzi, A. (1996). Parts, Wholes, and Part-Whole Relations: The Prospects of 
Mereotopology. Data and Knowledge Engineering, 20(3):259-286. 

Yager, R. R. (1991). Connectives and Quantifiers in Fuzzy Sets. Fuzzy Sets and 
Systems, 40:39-75. 

Yao, T. Y. (1998). Contructive and Algebraic Methods of the Theory of Rough 
Sets. Journal of Information Science, 109:21-47. 

Yao, Y. Y. and Lin, T. Y. (1996). Generalization of Rough Sets using Modal 
Logics. Intelligent Automation and Soft Computing, 2(2):103-120. 

Zadeh, L. A. (1975). The Concept of a Linguistic Variable and its Application 
to Approximate Reasoning. Information Sciences, 8:199-249. 

Zadeh, L. A. (1979). Fuzzy Sets and Information Granularity. In Gupta, M., 
Ragade, R., and Yager, R., editors, Advances in Fuzzy Set Theory and Appli- 
cations, pages 3-18. North-Holland, Amsterdam. 


Chapter 15 


SPATIAL REASONING AND ONTOLOGY: 
PARTS, WHOLES, AND LOCATIONS 


Achille C. Varzi 


Columbia University 


Second Reader 


Antony Galton 
University of Exeter 


Spatial reasoning is no abstract business. It is, to a great extent, reasoning 
about entities located in space, and such entities have spatial structure. If the 
table is in the kitchen, then it follows that the table top is in the kitchen, too, 
and it follows because the top is part of the table. If the concert took place at 
the stadium, then it didn’t take place in the theater, for concerts are spatially 
continuous. Even when we reason about empty places, we typically do so with 
an eye to the anatomy of their potential tenants: space as such is perceptually 
remote and we can hardly understand its structure without imagining what could 
fill the void. 

This general feature of our spatial competence might suggest a deep meta- 
physical truth, to the effect that concrete entities such as objects and events are 
fundamentally prior to, and independent of, their spatial receptacles. It might 
even suggest that space itself is just a fiction, a picture of some kind: really 
there are only objects and events spatially related to one another in various 
ways. Such was, for instance, the gist of Leibniz’s stern relationism against 
Newton’s substantivalism, in spite of the major role the idea of space plays 
in the sciences. At the same time, one might argue that our understanding of 
the spatial structure of objects and events, including their spatial relationhips, 
depends significantly on our understanding of the structure of space per se: that 
the spatial features we attribute to objects and events are somehow inherited 
from those of the spatial regions they occupy. Thus, for example, we are in- 
clined to say that ordinary objects have parts insofar as their spatial regions have 
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parts. We may be inclined to say that the top is part of the table because of its 
salience and functional role, but we may just as easily talk about the top half of 
a sphere, or its inner parts, in spite of their lacking any cognitive or functional 
salience: we identify (and reify) such parts in terms of the parts of the region 
the sphere occupies. 

This tension (not to say this ambiguity) between concrete object-oriented 
thinking and abstract space-oriented thinking is responsible for many of the 
philosophical issues that lie behind any formal theory of spatial reasoning. On 
the one hand, it is natural (if not necessary) to supplement the theory with an 
explicit account of what kinds of thing may enter into its scope, an account of 
the sorts of entity that can be located in space—in short, an account of what 
may be collected under the rubric of ‘spatial entities’. On the other hand, 
we also want the theory to be independent of any specific ontological biases 
we might have. Whatever spatial entities we are inclined to build into the 
basic furniture of the world—subatomic particles, middle-size objects of the 
garden variety, large scattered entities such as crowds, forests, archipelagos, 
galaxies—our reasoning about their spatial properties and relations appears to 
be governed by the same general principles, and it is natural to think that such 
principles must reflect our understanding of the structural features of the spatial 
environment in which such entities are located. In short, although a theory of 
spatial reasoning may be viewed as an example of applied logic, the logic of 
certain spatial concepts and relations, it may also be regarded as an example 
of a formal theory whose principles do not necessarily depend on the intended 
domain of application except to the extent that the domain must include entities 
that properly qualify as spatial entities of some sort. 

The purpose of this chapter is to take a closer look at these delicate matters. 
Rather than doing this in general, however, we shall look at how the subtle 
interplay between purely spatial intuitions and intuitions about concrete spatial 
entities shows up in the construction of a formal theory. More specifically, 
we shall consider three sorts of theory, each of which occupies a prominent 
position in recent literature: (1) mereology, or the theory of parthood relations; 
(2) topology, broadly understood as a theory of qualitative spatial relations 
such as continuity and contiguity; and (3) the theory of location proper, which 
deals explicitly with the relationship between an entity and the spatial region it 
occupies. Arguably, such theories may be viewed as jointly contributing to an 
overall appraisal of our spatial competence, and over the last few years there 
has been considerable progress in each direction. At this point there is some 
need for a philosophical pause, and our purpose in this chapter is to go some 
way in the direction of a systematic assessment. 
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1. Philosophical issues in mereology 


Let us begin with mereology. This is often defined as the theory of the part- 
whole relation, but such a definition is misleading. It suggests that mereology 
has something to say about both parts and wholes, which is not true. As we 
shall see in Sec. 2, the notion of a whole goes beyond the conceptual resources 
of mereology and calls for topological concepts and principles of various sorts. 
By itself, mereology is best understood as the theory of the parthood relation, 
regardless of whether the second term of the relation may be said to qualify as 
a whole entity. Thus, for instance, it is a mereological fact that my hand is part 
of my arm just as it is a mereological fact that it is part of my body, although 
it may plausibly be argued that only my body qualifies as a whole (maximally 
connected) object; my arm doesn’t. 

It is also worth pointing out that mereology has a long pedigree, which 
makes it a central chapter, not only of formal theories concerned with spatial 
reasoning, but of any theory in the realm of formal logic and ontology broadly 
understood. Its roots can be traced back to the early days of philosophy, be- 
ginning with the Pre-Socratic atomists and continuing throughout the writings 
of Plato (especially the Parmenides and the Thaetetus), Aristotle (the Meta- 
physics, but also the Physics, the Topics, and De partibus animalium), and 
Boethius (In Ciceronis Topica). Mereology occupies a prominent role also in 
the writings of medieval ontologists and scholastic philosophers such as Peter 
Abelard, Thomas Aquinas, Raymond Lull, and Albert of Saxony, as well as in 
Jungius’s Logica Hamburgensis (1638), Leibniz’s Dissertatio de arte combi- 
natoria (1666) and Monadology (1714), and Kant’s early writings (especially 
the Monadologia physica of 1756). As a formal theory of the parthood relation, 
however, mereology made its way into our times mainly through the work of 
Franz Brentano and of his pupils, especially Husserl’s third Logical Investiga- 
tion (1901). The latter may rightly be considered the first attempt at a rigorous 
formulation of the theory, though in a format that makes it difficult to disen- 
tangle the analysis of mereological concepts from that of other formal notions 
(such as the relation of ontological dependence). It is not until Lesniewski’s 
Foundations of a General Theory of Manifolds (1916) that a pure theory of 
parthood was given an exact formulation. And because Lesniewski’s work was 
largely inaccessible to non-speakers of Polish, it is only with the publication 
of Leonard and Goodman’s The Calculus of Individuals (1940) that mereology 
has become a chapter of central interest for modern ontologists and logicians. 
Indeed, although Lesniewski’s and Leonard and Goodman’s theories came in 
different logical guises, they are sufficiently similar to be recognized as a com- 
mon basis for most subsequent developments. The question that interests us 
here is how such developments—and the variety of motivations that lie behind 
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them—reflect and affect our understanding of mereology as a formal theory of 
spatial reasoning.! 


1.1 ‘Part’ and parthood 


To this end, the first thing to observe is that the word ‘part’ has many different 
meanings in ordinary language, not all of which correspond to the same relation. 
In a way, it can be used to indicate any portion of a given entity, regardless of 
whether the portion itself is attached to the remainder, as in (1), or undetached, 
as in (2); cognitively salient, as in (1)-(2), or arbitrarily demarcated, as in (3); 
self-connected, as in (1)-(3), or disconnected, as in (4); homogeneous, as in 
(1)-(4), or gerrymandered, as in (5); material, as in (1)-(5), or immaterial, as 
in (6); extended, as in (1)-(6), or unextended, as in (7); spatial, as in (1)-(7), or 
temporal, as in (8); and so on. 


(1) The handle is part of the mug. 

(2) This cap is part of your pen. 

(3) The left half is your part of the cake. 

(4) The cutlery is part of the tableware. 

(5) This stuff is only part of what I bought. 

(6) That area is part of the living room. 

(7) The outermost points are part of the perimeter. 
(8) The first act was the best part of the play. 


All of these cases illustrate the notion of parthood that forms the focus of 
mereology. Often, however, the English word ‘part’ is used in a restricted 
sense. For instance, it may be used to designate only the cognitively salient 
relation illustrated in (1) and (2). In this sense, the parts of an object x are 
just its ‘components’, i.e., those parts that are available as individual units 
regardless of their interaction with the other parts of x. (A component is a part 
of an object, rather than just part of it; see e.g. Tversky 1989). Clearly, the 
properties of such restricted relations may not coincide with those of parthood 
broadly understood, so the principles of mereology should not be expected to 
carry over automatically. 

Also, the word ‘part’ is sometimes used in a broader sense, for instance 
to designate the relation of material constitution, as in (9), or the relation of 
mixture composition, as in (10), or even a relation of conceptual inclusion, as 
in (11): 


(9) The clay is part of the statue. 





'For a historical survey of mereology, see Henry (1991), Burkhardt and Dufour (1991), and Simons (1991c). 
For systematic comparisons, see Simons (1987) and Ridder (2002). 
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(10) Gin is part of martini. 
(11) Writing detailed comments is part of being a good referee. 


The mereological status of these relations, however, is controversial. The con- 
stitution relation exemplified in (9) was included by Aristotle in his threefold 
taxonomy (Metaphysics, A, 1023b), but many contemporary authors would 
rather construe it as a sui generis, non-mereological relation (see Rea 1997 and 
references therein).” Similarly, the ingredient-mixture relationship exempli- 
fied in (10) is subject to controversy, as the ingredients may involve significant 
structural connections besides spatial proximity and may therefore fail to retain 
important characteristics they have in isolation (see Sharvy 1983). As for state- 
ments such as (11), it may simply be contended that the term ‘part’ appears only 
in the surface grammar and disappears at the level of logical form, for instance 
if (11) is paraphrased as ‘A good referee is one who writes detailed comments’. 
For more examples and tentative taxonomies, see Winston et al. (1987), Gerstl 
and Pribbenow (1995), and Iris et al. (1988). 

Finally, it is worth stating explicitly that mereology is typically construed as 
a piece of formal ontology, i.e., a theory of certain formal properties and rela- 
tions that are exemplified across a wide range of domains, whatever the nature 
of the entities in question. Thus, although both Lesniewski’s and Leonard and 
Goodman’s original theories betray a nominalistic stand, reflecting a conception 
of mereology as a parsimonious alternative to set theory,> most contemporary 
formulations assume no ontological restriction on the field of ‘part’. The re- 
lata can be individual entities as in (1)-(8), but also abstract entities such as 
propositions, sets, types, or properties, as in: 


(12) That premise is part of my argument. 

(13) The domain of quantification is part of the model. 
(14) The colon is part of the title. 

(15) Humanity is part of personhood. 


(The example in (11) may perhaps be read as expressing a mereological relation 
between properties, too.) This ‘ontological innocence’ of mereology plays 
of course an important role in the appraisal of what principles should hold 
unrestrictedly: greater generality means fewer axioms, and here the tension 
between the tasks of an applied logic and those of a purely formal theory shows 
up most vividly. In the following we focus primarily on the spatially salient 





? Actually, if the statue is identified with the lump of clay, as some would argue (e.g. Noonan 1993 vs. Johnston 
1993), and if identity is treated as a limit case of (improper) parthood, as we shall indeed suppose, then the 
relation of material constitution is a mereological relation. This, however, is the subject of controversy and 
we shall come back to it in due time. 

3To be sure, the original calculus of individuals had variables for classes; the class-free version is due to 
Goodman (1951). On the link between mereology and nominalism, see Eberle (1970). 
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uses of ‘part’, but it is important to keep this tension in mind when it comes to 
assessing the philosophical underpinnings of the most controversial tenets of 
mereology. 


1.2 Basic principles 


With these provisos, let us proceed to unpack the theory. We may ideally 
distinguish two sorts of mereological principles. On the one hand, there are 
principles that may be thought of as purely ‘lexical’ axioms fixing the intended 
meaning of the relational predicate ‘part’. On the other, there are principles that 
go beyond the obvious and aim at greater sophistication and descriptive power. 
Exactly where the boundary should be drawn, however, is by itself a matter of 
controversy. 


1.2.1 Parthood as a partial ordering. The obvious is this: regardless 
of how one feels about matters of ontology, if ‘part’ stands for the general rela- 
tion exemplified by all of (1)-(8) above, then it stands for a partial ordering—a 
reflexive, transitive, antisymmetric relation: 


(16) Everything is part of itself. 
(17) Any part of any part of a thing is itself part of that thing. 
(18) Two distinct things cannot be part of each other. 


As it turns out, virtually every theory put forward in the literature accepts (16)— 
(18), though it is worth mentioning some misgivings that have occasionally 
been raised. 

Concerning reflexivity, one might observe that many legitimate senses of 
‘part’ do not countenance saying that a whole is a part of itself. For instance, 
Rescher (1955, p. 10) cited the biologists’ use of ‘part’ for the functional subunits 
of an organism as a case in point. This is of little import, though. Taking 
reflexivity as constitutive of the meaning of ‘part’ amounts to regarding identity 
as a limit (improper) case of parthood. A stronger relation, whereby nothing 
counts as part of itself, can obviously be defined in terms of the weaker one, 
hence there is no loss of generality (see Sec. 1.2.2). Vice versa, one could frame 
a mereological theory by taking proper parthood as a primitive instead. This is 
merely a question of choosing a suitable primitive. 

The transitivity principle, (17), is more controversial. Several authors have 
observed that many legitimate senses of ‘part’ are non-transitive. Examples 
would include: (i) a biological subunit of a cell is not a part of the organ of 
which that cell is a part; (ii) a handle can be part of a door and the door of a 
house, though a handle is never part of a house; (iii) my finger is part of me and 
I am part of the team, yet my finger is not part of the team. (See again Rescher 
1955, Cruse 1979, and Winston et al. 1987, respectively; for other examples 
see Iris et al. 1988, Moltman 1997, and Johansson 2004 inter alia). Arguably, 
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however, such misgivings stem again from the ambiguity of ‘part’. What counts 
as a biological subunit of a cell may not count as a subunit, i.e., a distinguished 
part of the organ, but that is not to say that it is not part of the organ at all. 
Similarly, if there is a sense of ‘part’ in which a handle is not part of the house 
to which it belongs, or my finger not part of my team, it is a restricted sense: 
the handle is not a functional part of the house (a ‘component’), though it is a 
functional part of the door and the door a functional part of the house; my finger 
is not directly part of the team, though it is directly part of me and I am directly 
part of the team. It is obvious that if the interpretation of ‘part’ is narrowed by 
additional conditions (e.g., by requiring that parts make a functional or direct 
contribution to the whole), then transitivity may fail. In general, if x is a ¢-part 
of y and y is a ġ-part of z, x need not be a ¢-part of z: the predicate modifier 
‘œ may not distribute over parthood. But that shows the non-transitivity of 
“o-part’, not of ‘part’. And within a sufficiently general framework this can 
easily be expressed with the help of explicit predicate modifiers (Varzi 2005). In 
any event, it seems clear that spatial parthood is transitive: whether we construe 
this as a restricted notion or identify it with the general notion of parthood, 
(17) holds. 

Finally, concerning the antisymmetry postulate (18), two sorts of worry are 
worth mentioning. On the one hand, some authors maintain that the relation- 
ship between an object and the stuff it is made of provides a perfectly ordinary 
example of symmetric parthood: according to Thomson (1998), for example, 
a statue and the clay it is made of are part of each other, yet distinct. This 
is highly controversial and there is a large philosophical literature devoted on 
this topic (see e.g. the papers in Rea 1997). For the moment, let us simply 
observe that the example trades once again on the ambiguity of ‘part’. We have 
already mentioned that material constitution is best regarded as a sui generis, 
non-mereological relation. Whether this relation may obtain between two spa- 
tially coincident objects is an interesting question, but we should postpone its 
discussion to where it belongs: the theory of spatial location (Sec. 3.3). On the 
other hand, one may wonder about the possibility of unordinary cases of sym- 
metric parthood relationships. Sanford (1993, p. 222) refers to Borges’s Aleph 
as a case in point: ‘I saw the earth in the Aleph and in the earth the Aleph once 
more and the earth in the Aleph ...’. In this case, a plausible reply is simply 
that fiction delivers no guidance to conceptual investigations: conceivability 
may well be a guide to possibility, but literary fantasy is by itself no evidence 
of genuine conceivability (van Inwagen 1993, p. 229). Still, one may observe 
that the possibility of mereological loops is not pure fantasy. In view of certain 
developments in non-well-founded set theory (Aczel 1988), one might indeed 
suggest building mereology on the basis of a notion of parthood that may violate 
(18). This is particularly significant insofar as set theory itself may be refor- 
mulated in mereological terms—a possibility that is explored in the works of 
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Bunt (1985) and especially Lewis (1991). At present, however, no systematic 
study of non-well-founded mereology has been put forward in the literature. 
Moreover, we are interested here in mereology as a tool for spatial reasoning, 
and in this regard the possibility of symmetric loops does indeed appear to be 
pure fantasy. In the following we shall therefore confine ourselves to theories 
that accept the antisymmetry postulate along with reflexivity and transitivity: 
parthood is a partial ordering. 


1.2.2 Other mereological concepts. It is convenient at this point to 
introduce some degree of formalization. Let us use ‘P’ for the binary predi- 
cate constant ‘...is part of ...’. Taking the underlying logic to be a standard 
predicate calculus with identity, the above minimal requisites on parthood may 
then be regarded as forming a first-order theory characterized by the following 
proper axioms for ‘P’: 


(P.1) Para Reflexivity 
(P.2) Pay A Pyz > Pxz Transitivity 
(P.3) PayAPyx > a2 =y Antisymmetry 


(Here and in the following we simplify notation by dropping all initial universal 
quantifiers. Unless otherwise specified, all formulas are to be understood as 
universally closed.) A number of additional mereological predicates can then 
be introduced by definition. For example: 





(9) EQzry =a Pry A Pyg Equality 
(20) PPaxy =a Pay A -Pyx Proper Parthood 
(21) PExy =a —Pry A Pyg Proper Extension 
(22) Oxy =a dz(Pzx A Pzy) Overlap 
(23) Usy =a 3z(Pæz A Pyz) Underlap 


An intuitive model for these relations, with ‘P’ interpreted as spatial inclusion, 
is given in the diagram of Fig. 15.1. 

Note that ‘Uy’ is bound to hold if we assume the existence of a ‘universal 
entity’ of which everything is part. Conversely, ‘Oxy’ would always hold if 
we assumed the existence of a ‘null entity’ that is part of everything. In the 
domain of spatial entities, the latter assumption is of course implausible (Geach 
1949).4 The former assumption may be challenged, too (Simons 2003, Varzi 
2006), but it seems reasonable, if not obvious, in case the only spatial entities 
countenanced by the theory are regions of space. We shall come back to these 
issues in Sec. 1.4. 





“In other contexts one may feel differently: see Martin (1965) and Bunt (1985) for theories with a null 
individual, and Bunge (1966) for a theory with several null individuals. 
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Figure 15.1. Basic mereological relations. (Shaded cells indicate parthood.) 


Note also that the definitions imply (by pure logic) that EQ, O, and U are 
reflexive and symmetric; in addition, EQ is also transitive—an equivalence 
relation. By contrast, PP and PE are irreflexive and asymmetric, and it follows 
from (P.2) that both are transitive. Since the following biconditional is also a 
straightforward consequence of the axioms: 


(24) Pay o (PPxy Ve“= y) 


it should now be obvious that one could in fact use proper parthood as an 
alternative starting point for the development of mereology, using the right- 
hand side of (24) as a definiens for ‘P’. This is, for instance, the option followed 
in Simons (1987), where the partial ordering axioms for ‘P’ are replaced by the 
strict ordering axioms for ‘PP’: 


(25) —=PPxx 
(26) PPxy A PPyz — PPxz 
(27) PPzy — —PPyx 


Ditto for ‘EP’, which was in fact the primitive relation in Whitehead’s (1919) 
semi-formal treatment of the mereology of events. Other options may be con- 
sidered, too. For example, Goodman (1951) used ‘O’ as a primitive and Leonard 
and Goodman (1940) used its opposite: 


(28) Day =a "Oxy Disjointness 


However, the relations corresponding to such predicates are weaker than PP 
and PE and no biconditional is provable from (P.1)-(P.3) that would yield a 
corresponding definiens of ‘P’ (though one could of course define ‘P’ in terms 
of ‘O’ or ‘D’ in the presence of further axioms; see below ad (45)). Thus, 
other things being equal, ‘P’, ‘PP’, and ‘PE’ appear to be the only reasonable 
options. Here we shall stick to ‘P’. 

Finally, note that Identity could itself be introduced by definition, due to the 
following corollary of the antisymmetry postulate (P.3): 
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(29) x= y« EQry 


Accordingly, the theory could be formulated in a pure first-order language by 
assuming (P.1) and (P.2) and replacing (P.3) with the following variant of the 
standard axiom schema for ‘=’ (where ¢ is any formula): 


(P.3’) EQzry — (dr = py) Indiscernibility 


One may in fact argue on these grounds that parthood is in some sense con- 
ceptually prior to identity (as in Sharvy 1983, p. 234), and since ‘EQ’ is not 
definable in terms of ‘PP’ or ‘PE’ without resorting to ‘=’, the argument would 
also provide evidence in favor of ‘P’ as the most basic primitive. As we shall 
see in Sec. 1.3.2, however, the link between parthood and identity is philosoph- 
ically problematic. In order not to compromise our discussion, in the following 
we shall therefore continue to work with a language with both ‘P’ and ‘=’ as 
primitives. 


1.3 Decomposition principles 


Let M be the theory defined by the three basic principles (P.1)-(P.3). M may 
be viewed as embodying the common core of any mereological theory. Not 
just any partial ordering qualifies as a part-whole relation, though, and deciding 
what further principles should be added to (P.1)-(P.3) is precisely the question 
a good mereological theory is meant to answer. It is here that philosophical 
issues begin to arise. 

Generally speaking, such refinements may be divided into two main groups. 
On the one hand, one may extend M by means of decomposition principles that 
take us from a whole to its proper parts. For example, one may consider the idea 
that whenever something has a proper part, it has more than one—.e., that there 
is always some mereological difference (a remainder) between a whole and its 
proper parts. This need not be true in every model for M: a world with only 
two items, only one of which is part of the other, would be a counterexample, 
though not one that could be illustrated with the sort of geometric diagram used 
in Fig. 15.1. On the other hand, one may extend M by means of composition 
principles that go in the opposite direction—from the parts to the whole. For 
example, one may consider the idea that whenever there are some things there 
exists a whole that consists exactly of those things—.e., that there is always 
a mereological sum (or fusion) of two or more parts. Again, this need not be 
true in a model for M, and it is a matter of controversy whether the idea should 
hold unrestrictedly. 


1.3.1 Parts and remainders. Let us begin with the first sort of exten- 
sion. And let us start by taking a closer look at the intuition according to which a 
whole cannot be decomposed into a single proper part. There are various ways 
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Figure 15.2. Three unsupplemented models. (Parthood relationships are represented by con- 
necting lines going uphill.) 


in which one can try to capture this basic intuition. Consider the following 
(from Simons 1987, pp. 26-28): 





(P4,) PPzy > 3z(PPzy A =z = z) Weak Company 
(P.4,) PPzy — 3z(PPzy A =Pz2) Strong Company 
(P.4) PPary > 3z(Pzy A -Ozz) Supplementation 


The first principle, (P.4,), is a literal rendering of the idea in question: every 
proper part must be accompanied by another. However, there is an obvious 
sense in which (P.4,,) only captures the letter of the idea, not the spirit: it rules 
out the unintended model mentioned above (see Fig. 15.2, left) but not, for 
example, an implausible model with an infinitely descending chain in which 
the additional proper parts do not leave any remainder (Fig. 15.2, center). 

The second principle, (P.4,), is stronger: it rules out both models as unac- 
ceptable. However, (P.4,) is still too weak to capture the intended idea. For 
example, it is satisfied by a model in which a whole can be decomposed into 
several proper parts all of which overlap one another (Fig. 15.2, right), and it 
is may be argued that such models do not do justice to the meaning of “proper 
part’: after all, the idea is that the removal of a proper part should leave a 
remainder, but it is by no means clear what would be left of z once x (along 
with its parts) is removed. 

Itis only the third principle, (P.4), that appears to provide a full formulation of 
the idea that nothing can have a single proper part. According to this principle, 
every proper part must be ‘supplemented’ by another, disjoint part, and it is this 
last qualification that captures the notion of a remainder. 

Should (P.4) be incorporated into M as a further fundamental principle on the 
meaning of ‘part’? Most authors (beginning with Simons himself) would say so. 
Yet here there is room for disagreement. In fact, it is not difficult to conceive of 
mereological scenarios that violate not only (P.4), but also (P.4,) and even (P.4,). 
For example, in Brentano’s (1933) theory of accidents, a soul is a proper part 
of a thinking soul even though there is nothing to make up for the difference 
(see Chisholm 1978; Baumgartner and Simons 1994). Similarly, in Fine’s 
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(1982) theory of qua objects, every basic object (John) qualifies as the only 
proper part of its incarnations (John qua philosopher, John qua husband, etc.). 
Now, such putative counterexamples are controversial and, more importantly 
for our present concerns, they appear to be of little significance if mereology 
is to be thought of as a theory of space. The spatial relations illustrated by 
our initial examples (1)-(7) all seem to satisfy (P.4) and, a fortiori, (P.4,) and 
(P.4,). Nonetheless there are counterexamples also in the realm of truly spatial 
mereologies. The best illustration comes from Whitehead’s (1929) theory of 
extensive connection: on this theory, a topologically closed region includes 
its open interior as a proper part in spite of there being no boundary elements 
to distinguish them—the domain only consists of extended regions. Whether 
the omission of boundary elements such as points, lines, and surfaces is a 
reasonable thing to do when it comes to the task of modeling our understanding 
of space, and whether in the absence of such elements the distinction between 
open and closed regions is still legitimate, are questions that every theory of 
space must of course address. In Sec. 2.4 we shall see that answering in the 
affirmative involves serious philosophical and technical complications. But we 
shall also see that several theories are available to do the job, including theories 
that occupy a prominent role in the current literature on qualitative spatial 
reasoning. One may rely on the intuitive appeal of (P.4) to discard such theories 
as implausible, but one may as well turn things around and regard the adequacy 
of such theories as a good reason not to accept (P.4) unrestrictedly. As things 
stand, it therefore seems appropriate to regard such a principle as providing a 
minimal but substantive addition to (P.1)-(P.3), one that goes beyond the mere 
lexical characterization of ‘part’ provided by M. For future reference, let us 
label the resulting mereological theory MM (for Minimal Mereology). 


1.3.2 Supplementation, extensionality, identity. There is another 
way of expressing the supplementation intuition that is worth considering. It 
corresponds to the following axiom, which differs from (P.4) in the antecedent: 





(P.5) —=Pyx — 3z(Pzy A -Ozz) Strong Supplementation 


Intuitively, this says that if an object fails to include another among its parts, 
then there must be a remainder. It is easily seen that (P.5) implies (P.4), so 
any theory rejecting at least (P.4) will a fortiori reject (P.5). (For instance, on 
Whitehead’s boundary-free theory of extensive connection, a closed region is 
not part of its interior even though each part or the former overlaps the latter.) 
However, the converse does not hold. The diagram in Fig. 15.3 illustrates a 
model in which (P.4) is true, since each proper part counts as a supplement of 
the other; yet (P.5) is false. 

The theory obtained by adding (P.5) to (P.1)-(P.3) is thus a proper extension 
of MM. Let us label this stronger theory EM, for Extensional Mereology, 
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Figure 15.3. A weakly supplemented model. 


the attribute ‘extensional’ being justified precisely by the exclusion of counter- 
models that, like the one just mentioned, contain distinct objects with the same 
proper parts. In fact, the following is a theorem of EM: 


(30) AzPPzx — (Vz(PPzx — PPzy) — Pay) 





from which it follows that no composite objects with the same proper parts can 
be distinct: 








(31) (azPPzz v 4zPPzy) — (x = y = Yz(PPzxz > PPzy)) 


(The analogue for ‘P’ is, of course, already provable in M, since parthood is 
reflexive and antisymmetric.) Thus, ÆM is truly an extensional theory incor- 
porating the view that an object is exhaustively defined by its constituent parts. 
This goes far beyond the intuition that lies behind the weak supplementation 
principle (P.4). Does it go too far? 

On the face of it, it is not difficult to envisage scenarios that would correspond 
to the diagram in Fig. 15.3. For example, we can obtain a counterexample to 
(P.5) by identifying x and y with the sets {{21}, {z1, zo}} and {{22}, {z1, zo}} 
(i.e., with the ordered pairs (z1, 22) and (z2, 21), respectively), interpreting ‘P’ 
as the ancestral of the improper membership relation (i.e., of the union of € 
and =). But sets are abstract entities; can we also envisage similar scenarios in 
the spatial domain? 

Here is a case where the answer may differ crucially depending on whether 
we are interested in modeling a domain of concrete spatial entities or just the 
domain of the regions of space that they occupy. In the latter case there is little 
room for controversy: spatial regions are extensional, if anything is, unless 
of course we favor a Whitheadian conception of space. In the former case, 
however, the answer is controversial. There are two sorts of objection worth 
considering. On the one hand, it is sometimes argued that sameness of parts is 
not sufficient for identity, as some entities may differ exclusively with respect to 
the arrangement of their parts. For example, it is sometimes argued that: (i) two 
words can be made up of the same letters, as with ‘fallout’ and ‘outfall’; (ii) the 
same flowers can compose a nice bunch or a scattered bundle, depending on the 
arrangements of the individual flowers; (iii) a cat can survive the annihilation of 
its tail, but the amount of feline tissue consisting of the cat’s tail and the rest of the 
cat’s body cannot survive the annihilation of the tail, hence they have different 
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properties and must be distinct by Leibniz’s law in spite of their sharing exactly 
the same ultimate mereological constituents. (See Hempel 1953: 110, Eberle 
1970: 82.10, and Wiggins 1968, respectively; variants of (iii) may also be found 
in Doepke 1982, Lowe 1989, Johnston 1992, and Baker 1999, inter alia.) On 
the other hand, it is sometimes argued that sameness of parts is not necessary for 
identity, as some entities may survive mereological change. Ifa cat survives the 
annihilation of its tail, then the tailed cat (before the accident) and the tailless 
cat (after the accident) are one and the same in spite of their having different 
proper parts (Wiggins 1980). If any of these arguments is accepted, then clearly 
(31) is too strong a principle to be imposed on the parthood relation. And since 
(31) follows from (P.5), it might be concluded that EM is on the wrong track. 

Let us look at these objections separately. Concerning the necessity of 
mereological extensionality, i.e., the left-to-right conditional in the consequent 
of (31): 


(32) x= y —> Yz(PPzz = PPzy) 


it is perhaps enough to remark that the difficulty is not peculiar to extensional 
mereology. The objection proceeds from the consideration that ordinary entities 
such as cats and other living organisms (and possibly other entities as well, such 
as cars and houses) survive all sorts of gradual mereological changes. Yet the 
same can be said of other types of change as well: bananas ripen, houses 
deteriorate, people sleep at night and eat at lunch. How can we say that they are 
the same things, if they are not quite the same? Indeed, (32) is just an instance 
of the identity axiom 


dD) x=y-—> (pr = oy) 


and it is well known that this axiom calls for revisions when ‘=’ is given a 
diachronic reading. Arguably, any such revisions will affect the case at issue 
as well, and in this sense the above-mentioned objection to (32) can be disre- 
garded. For example, if the basic parthood predicate were reinterpreted as a 
time-indexed relation (Thomson 1983), then the problem would disappear as 
the tensed version of (P.5) would only warrant the following variant of (32): 


BX) x =y —> YtYz(PP;zxz => PP;zy) 


Similarly, the problem would disappear if the variables in (32) were taken 
to range over four-dimensional entities whose parts may extend in time as 
well as in space (Heller 1984, Sider 2001), or if identity itself were construed 
as a contingent relation that may hold at some times but not others (Gallois 
1998). Such revisions may be regarded as an indicator of the limited ontological 
neutrality of extensional mereology. But their independent motivation also 
bears witness to the fact that controversies about the necessity of extensionality 
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stem from larger and more fundamental philosophical conundrums and cannot 
be assessed by appealing to our intuitions about the meaning of ‘part’. 

The worry about the sufficiency of mereological extensionality, i.e., the right- 
to-left conditional in the consequent of (31): 


(33) W2z(PPza <= PPzy) => z= y 


is more to the point. However, there are various ways of resisting such coun- 
terexamples as (i)—(iii) on behalf of EM. Consider ()—two words made up 
of the same letters. Insofar as we are dealing with truly spatial entities, this 
is best described as a case of different word tokens made up of distinct tokens 
of the same letter types. There is, accordingly, no genuine violation of (33) in 
the opposition between ‘fallout’ and ‘outfall’ (for instance), hence no reason 
to reject (P.5) on these grounds. (Besides, even with respect to abstract types, 
it could be pointed out that ‘fallout’ and ‘outfall’ do not share all their proper 
parts: the string ‘lo’, for instance, is only included in the first word.) What 
if one of the two word tokens is obtained from the other by rearranging the 
same letter tokens? In that case the reply misfires, but so does the ojection: the 
issue becomes once again one of diachronic non-identity, with all that it entails 
(Lewis 1991, pp. 78ff). 

Case (ii)—the flowers—is not significantly different. The same, concrete 
flowers cannot compose a nice bunch and a scattered bundle at the same time. 
Case (iii), however, is more delicate. There is a strong intuition that a cat 
really is something over and above the amount of feline tissue consisting of its 
tail and the rest of its body—that they have different survival conditions and, 
hence, different properties—so it may be thought that here we have a genuine 
counterexample to mereological extensionality. On behalf of EM, it should 
nonetheless be noted that the appeal to Leibniz’s law in this context is debatable. 
Let ‘Tibbles’ name our cat and ‘Tail’ its tail, and let us grant the truth of 


(34) Tibbles can survive the annihilation of Tail. 
There is, indeed, an intuitive sense in which the following is also true: 


(35) The amount of feline tissue consisting of Tail and the rest of 
Tibbles’s body cannot survive the annihilation of Tail. 


However, this intuitive sense corresponds to a de dicto reading of the modality, 
where the description in (35) has narrow scope: 


(35a) Necessarily, the amount of feline tissue consisting of Tail and 
the rest of Tibbles’s body has Tail as a proper part. 


On this reading (35) is hardly negotiable (in fact, logically true). Yet this is 
irrelevant in the present context, for (35a) does not amount to an ascription of a 
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modal property and cannot be used in connection with Leibniz’s law. (Compare 
the following fallacious argument: The number of planets might have been 
even; 9 is necessarily odd; hence the number of planets is not 9.) On the other 
hand, consider a de re reading of (35): 


(35b) The amount of feline tissue consisting of Tail and the rest of 
Tibbles’s body necessarily has Tail as a proper part. 


On this reading the appeal to Leibniz’s law would be legitimate (modulo any 
concerns about the status of modal properties) and one could rely on the truth of 
(34) and (35), i.e., (35b), to conclude that Tibbles is distinct from the relevant 
amount of feline tissue. However, there is no obvious reason why (35) should 
be regarded as true on this reading. That is, there is no obvious reason to 
suppose that the amount of feline tissue that in the actual world consists of Tail 
and the rest of Tibbles’s body—that amount of feline tissue that is now resting 
on the carpet—cannot survive the annihilation of Tail. Indeed, it would appear 
that any reason in favor of this claim vis-à-vis the truth of (34) would have to 
presuppose the distinctness of the entities in question, so no appeal to Leibniz’s 
law would be legitimate to establish the distinctess on pain of circularity (Varzi 
2000). This is not to say that the putative counterexample to (34) is wrong- 
headed. But it requires genuine metaphysical work to defend it and it makes 
the rejection of the strong supplementation principle (P.5) a much harder task. 


1.3.3 Complementation. There is a way of expressing the supplemen- 
tation intuition that is even stronger than (P.5). It corresponds to the following 
thesis, which differs from (P.5) in the consequent: 





(P.6) =Pyx > AzVw(Pwz > (Pwy A -Owz)) Complementation 


This says that if y is not part of x, there exists something that comprises exactly 
those parts of y that are disjoint from «—something that we may call the dif- 
ference or relative complement between y and zx. It is easily checked that this 
principle implies (P.5). On the other hand, the diagram in Fig. 15.4 shows that 
the converse does not hold: there are two parts of y disjoint from x, namely 
zı and 22, but there is nothing that consists exactly of such parts, so we have a 
model of (P.5) in which (P.6) fails. 

Any misgivings about (P.5) may of course be raised against (P.6). But what 
if we agree with the above arguments in support of (P.5)? Do they also give us 
reasons to accept the stronger principle (P.6)? The answer is in the negative. 
Plausible as it may sound, (P.6) has consequences that even an extensionalist 
may not be willing to accept. For example, Fig. 15.5 depicts a scenario that— 
it may be argued—corresponds exactly to the model of Fig. 15.4. It may be 
argued that although x and z, jointly constitute a larger part of y (the difference 
between y and z2), and similarly for x and z (the difference between y and 


Spatial Reasoning and Ontology: Parts, Wholes, and Locations 961 


e 


Figure 15.4. A strongly supplemented model violating complementation. 
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Figure 15.5. Does the difference between y and x exist? 


z1), there is nothing consisting of z, and zə (the difference between y and 
x), since these two pieces are disconnected. More generally, it appears that 
(P.6) would force us to accept the existence of scattered entities, such as the 
‘sum’ of your left and right arms, or the ‘sum’ of Canada and Mexico, and 
since Lowe (1953) many authors have objected to this thought regardless of 
how one feels about extensionality. (One philosopher who explicitly agrees 
to extensionality while distrusting scattered entities is Chisholm 1987.) As it 
turns out, the extra strength of (P.6) is therefore best appreciated in terms of 
the sort of mereological aggregates that this principle would entail, aggregates 
that are composed of two or more parts of a given whole. This suggests that 
any additional misgivings about (P.6), besides its extensional implications, are 
truly misgivings about matters of composition. We shall accordingly postpone 
their discussion to Sec. 1.4, where we shall attend to these matters more fully. 
For the moment, let us simply say that (P.6) is, on the face of it, not a principle 
that can be added to M without further argument. 


1.3.4 Atomism and other options. One last important family of de- 
composition principles concerns the question of atomism. Mereologically, an 
atom (or ‘simple’) is an entity with no proper parts: 





(36) Ax =a ~SyPPyx Atom 


Are there any such entities? And if there are, is everything entirely made up 
of atoms? Does everything comprise at least some atoms? Or is everything 
made up of atomless gunk? These are deep and difficult questions, which have 
been the focus of philosophical investigation since the early days of philosophy 
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and have been center stage also in many recent disputes in mereology (see, 
for instance, van Inwagen 1990, Sider 1993, Zimmerman 1996a, and the pa- 
pers collected in Hudson 2004). Here we shall confine ourselves to a brief 
examination. 

The two main options, to the effect that there are no atoms at all, or that ev- 
erything is ultimately made up of atoms, correspond to the following postulates, 
respectively: 


(P.7) 
(P.8) 


yPPyx Atomlessness 
y(Ay A Pyx) Atomicity 





These postulates are mutually incompatible, but taken in isolation they can con- 
sistently be added to any mereological theory X considered here. Adding (P.8) 
yields a corresponding Atomistic version, AX; adding (P.7) yields an Atom- 
less version, AX. Since finitude together with the antisymmetry of parthood 
(P.3) jointly imply that mereological decomposition must eventually come to 
an end, it is clear that any finite model of M (and a fortiori of any extension of 
M) must be atomistic. Accordingly, an atomless mereology AX admits only 
models of infinite cardinality. (A world containing such wonders as Borges’s 
Aleph, where parthood is not antisymmetric, might by contrast be finite and 
yet atomless.) An example of such a model, establishing the consistency of the 
atomless version of most mereological theories considered in the this chapter, 
is provided by the regular open sets of a Euclidean space, with ‘P’ interpreted 
as set-inclusion (Tarski 1935). 

Now, one thing to notice is that, independently of their motivations, atomistic 
mereologies admit of significant simplifications in the axioms. For instance, 
AEM can be simplified by replacing (P.5) and (P.8) with 





(P.5’) =Pyx > 3z(Az A Pzy A =Pz2) 


which in turns implies the following atomistic variant of the extensionality 
thesis (31): 


(37) x= y< Yz(Az > (Pzxr => Pzy)) 


Thus, any atomistic extensional mereology is truly ‘hyperextensional’ in 
Goodman’s (1958) sense: things built up from exactly the same atoms are 
identical. An interesting question, discussed at some length in the late 1960’s 
(Yoes 1967, Eberle 1968, Schuldenfrei 1969) and taken up more recently by 
Simons (1987: 44f) and Engel and Yoes (1996), is whether there are atomless 
analogues of (37). Is there any predicate that can play the role of ‘A’ in an 
atomless mereology? Such a predicate would identify the ‘base’ of the system 
and would therefore enable mereology to cash out Goodman’s hyperextensional 
intuitions even in the absence of atoms. This question is particularly significant 
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from a nominalistic perspective, but it also bears on our present concerns. For 
example, it is a relevant question to ask in connection with the Whiteheadian 
conception mentioned in Sec. 1.3.1, according to which space contains no parts 
of lower dimensions such as points or lines (see Forrest 1996, Roeper 1997). 
In special cases there is no difficulty in providing a positive answer. For exam- 
ple, in the AEM model consisting of the open regular subsets of the real line, 
the open intervals with rational end points form a base in the relevant sense. 
It is unclear, however, whether a general answer can be given that applies to 
any sort of domain. If not, then the only option would appear to be an account 
where the notion of a ‘base’ is relativized to entities of a given sort. In Simons’s 
terminology, we could say that the ~-ers form a base for the ¢-ers if and only 
if the following variants of (P.5’) and (P.8) and are satisfied: 





(P.5*) dx A dy > (Pyr —> 3z(yz A Pzy A -=Pz2)) 
(P.8*) gx > Ay(vy A Pyr) 





An atomistic mereology would then correspond to the limit case where ‘p’ is 
identified with ‘A’ for every choice of ‘œ’. In an atomless mereology, by con- 
trast, the choice of the base would depend each time on the level of ‘granularity’ 
set by the relevant specification of ‘Q’. 

A second important consideration concerns the possibility of theories that lie 
between the two extreme options afforded by Atomicity and Atomlessness. For 
instance, it can be held that there are atoms, though not everything need have a 
complete atomic decomposition, or it can be held that there is atomless gunk, 
though not everything need be gunky (Zimmerman 1996a). Again, formally 
this amounts to endorsing a restricted version of either (P.7) or (P.8) in which 
the variables are suitably restricted so as to range over entities of a certain sort: 


(P74) gx 
(P8y) gx 


yPPyx 
y(Ay A Py) 





At present, no thorough formal investigation of such options has been enter- 
tained (but see Masolo and Vieu 1999). Yet the issue is particularly significant 
from the perspective of a mereological theory aimed at modeling the spatial 
world, especially if the theory is to countenance concrete spatial entities along 
with the regions of space that such entities may occupy. It is, after all, a plau- 
sible thought that while the question of atomism may be left open with regard 
to the mereological structure of material objects (pending empirical findings 
from physics, for example), it must receive a definite answer with regard to the 
structure of space itself. This would amount to endorsing a version of either 
(P.74) or (P.84) in which ‘Q’ is understood as a condition that is satisfied exclu- 
sively by regions of space. Such a condition, of course, cannot be formulated 
in the language of a purely mereological theory, but we shall see in Sec. 3 
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that a suitably enriched theory, in which the relation of location is explicitly 
articulated, can do the job properly. (Actually, it is hard to conceive of a world 
in which an atomistic space is inhabited by entities that can be decomposed 
indefinitely, so in this case it is reasonable to suppose that any theory accepting 
(P.84) for regions would also endorse the stronger principle (P.8). However, 
(P.74) would be genuinely independent of (P.7), unless it is assumed that every 
mereologically atomic entity should also be spatially atomic, i.e., unextended.) 

Similar considerations apply to other decomposition principles that may 
come to mind at this point. For example, one may consider adding a require- 
ment to the effect that ‘PP’ forms a dense ordering, as already Whitehead (1919) 
had it: 





(P.9) PPxy — 3z(PPxz A PPzy) Density 


As a general decomposition principle, (P.9) might be deemed too strong, es- 
pecially in an atomistic setting. However, it is plausible to suppose that (P.9) 
should hold at least in the domain of spatial regions, regardless of whether these 
are construed as atomless gunk or as aggregates of spatial atoms. Evidently 
much depends on the link one establishes between the mereology of an ob- 
ject and that of its spatial location and this, again, is a question to which we 
attend more fully in Sec. 3. For the moment, let us simply observe that the 
sort of philosophical issues that lie behind these options is significantly differ- 
ent from those considered in the previous sections. Whether something can 
have a single proper part, whether parthood is extensional, or even whether it 
satisfies the complementation principle (P.6) are issues that depend greatly on 
our understanding of the parthood relation. They are, in an important sense, 
conceptual questions. Whether there are mereological atoms, by contrast, or 
whether mereological decomposition should obey a density principle, are sub- 
stantive questions that have nothing to do with our understanding of parthood as 
such. (For more on these questions, and on their general historical background, 
see Pyle 1995 and Holden 2004.) 


1.4 Composition principles 


Let us now consider the second way of extending M mentioned at the beginning 
of Sec. 1.3. Just as we may want to fix the logic of P by means of decompo- 
sition principles that take us from a whole to its proper parts, we may look 
at composition principles that go in the opposite direction—from the parts to 
the whole. More generally, we may consider the idea that the domain of the 
theory ought to be closed under mereological operations of various sorts: not 
only fusions, but also products, differences, and more. Here, again, there is 
room for several philosophical considerations, some of which are particularly 
indicative of the tension between space-oriented and object-oriented intuitions. 
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Figure 15.6. A fusiona that is not a fusion, and a fusion that is not a fusions. 


1.4.1 Bounds and fusions. Conditions on composition are many. 
Beginning with the weakest, consider the claim that any two suitably related 
entities have an upper bound, i.e., underlap: 





(P.10y) way — dz(Pxz A Pyz) Boundedness 


Exactly how ‘w’ should be construed is an important question by itself—a 
version of what van Inwagen (1990) calls the ‘special composition question’. 
Perhaps the most natural choice is to identify 7 with mereological overlap, the 
rationale being that such a relation establishes an important tie between what 
may count as two distinct parts of a larger whole. As we shall see momentarily, 
with 7 so construed, (P.10,,) is indeed uncontroversial. However, regardless of 
any specific choice, it is apparent that (P.10,,) is pretty weak, as it holds trivially 
in any domain with a universal entity of which everything is part. 

A somewhat stronger condition would be to require any pair of suitably 
related entities to have a smallest underlapper—something composed exactly 
of them and nothing else. This requirement is sometimes stated by saying 
that such entities must have a mereological ‘sum’, or ‘fusion’, though it is not 
immediately obvious how that should be formulated in the formal language. 
Consider: 


(Pllya) wry > 
(Pllys) pry > 
(Plly) wry 


In a way, (P.11yq) would seem the obvious choice, corresponding to the idea 
that the fusion of two objects is just their least upper bound relative to P. (See 
e.g. Bostock 1979, van Benthem 1983.) However, this condition is too weak 
to capture the intended notion of a mereological fusion. For example, with 
w construed as overlap, (P.11,,q) is satisfied by the model of Fig. 15.6, left: 
here the least upper bound of x and y is z, yet z hardly qualifies as something 
‘made up’ of x and y since its parts also include a third, disjoint item w. In fact, 
it is a simple fact about partial orderings that among finite models (P.11,,,) is 
equivalent to (P.10,,), hence just as weak. By contrast, (P.11,,,) corresponds to 
a notion of fusion (to be found e.g. in Tarski 1929) that may seem too strong: 


2(Paz A Pyz \Vw(Paw A Pyw = Pzw)) Fusion, 
2(Paz A Pyz \Vw(Pwz > Owx v Owy)) Fusion, 
zVw(Owz = Owz V Owy) Fusion 
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it rules out the model on the left of Fig. 15.6; but it also rules out the model 
on the right, which depicts a situation in which z may be viewed as an entity 
truly made up of x and y insofar as it is ultimately composed of atoms to 
be found either in x or in y. Of course, such a situation violates the strong 
supplementation principle (P.5), but that’s precisely the sense in which (P.11 5) 
is too strong: an anti-extensionalist might want to have a notion of fusion 
that does not presuppose strong supplementation. The formulation in (P.11,,) 
is the natural compromise: it is strong enough to rule out the model on the 
left, but weak enough to be compatible with the model on the right. This is, 
in fact, the formulation that best reflects the notion of fusion to be found in 
standard treatments of mereology, and in the sequel we shall mostly stick to 
it. Note, however, that if (P.5) holds, then (P.11,,) is equivalent to (P.11y,). 
Moreover, it turns out that if the stronger complementation axiom (P.6) holds, 
then all of these principles are trivially satisfied in any domain in which there 
is a universal entity: in that case, regardless of 7), the fusion of any two entities 
is just the complement of the difference between the complement of one minus 
the other. (Such is the strength of (P.6), a genuine cross between decomposition 
and composition principles.) 

We can further strengthen these principles by considering infinitary bounds 
and fusions. For example, (P.10,,) can be generalized to a principle to the effect 
that any non-empty set of entities satisfying a suitable condition € has an upper 
bound. Strictly speaking there is a difficulty in expressing such a principle ina 
language without set variables. We can, however, achieve a sufficient degree of 
generality by relying on an axiom schema where classes are identified by open 
formulas. Since an ordinary first-order language has a denumerable supply 
of formulas, at most denumerably many sets (in any given domain) can be 
specified in this way. But for most purposes this limitation is negligible, as 
normally we are only interested in those sets of objects or regions that we are 
able to specify. Thus, the following axiom schema will do, where ‘œ’ is any 
formula in the language and ‘£’ expresses the condition in question: 








(P.12¢) Jwọw AVu(¢w > Ew) > AzVw(gw — Pwz) 


Likewise, the fusion axiom (P.11,,) can be strengthened as follows: 











(P.13¢) dwew AVu(dw > Ew) > AzVw(Owz = Jv(dv A Owv)) 





and similarly for (P.11,,) and (P.11,y,). (The condition ‘dw@w’ guarantees that 
‘œ’ picks out a non-empty set, so there is no danger of asserting the uncondi- 
tional existence of ‘null entities’ —a mereological fiction that we have already 
mentioned as implausible in the context of spatial ontology.) It can be checked 
that these generalized formulations include the corresponding finitary princi- 
ples as special cases, taking ‘w’ to be the formula ‘(w = x V w = y)’ and 
‘£w’ the condition ‘(w = z > wwy) A (w = y > yrwY. 
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Finally, we get the strongest version of these composition principles by as- 
serting them as axiom schemas that hold for every condition €, i.e., effectively, 
by foregoing any reference to € altogether. Formally this amounts in each case 
to dropping the second conjunct of the antecedent. For example, the following 
schema is the unrestricted version of (P.13¢), to the effect that every specifiable 
non-empty set of entities has a fusion: 











(P.13) Awdw —> 3zYw(Owz = Ju(dv A Owv)) Unrestricted Fusion 


The extension of EM obtained by adding every instance of this schema has 
a distinguished pedigree and is known as General Extensional Mereology, or 
GEM. It corresponds to the classic systems of Leśniewski and of Leonard and 
Goodman. In fact, it turns out that adding (P.13) to MM yields the same theory 
GEM, since (P.13) implies that every pair of overlapping things has a maximal 
common part (a product): 





(38) Oxy — 3zYw(Pwz = (Pwr) A Pwy)) 


which, in turn, implies the equivalence between the weak supplementation 
principle (P.4) and the stronger (P.5) (Simons 1987, p. 31). This is by itself 
remarkable, for it might be thought that a composition principle such as (P.13) 
should be compatible with the rejection of a decomposition principle that is 
committed to extensionality. On the other hand, mereological extensionality is 
really a double-barreled thesis: it says that two wholes cannot be decomposed 
into the same proper parts but also, by the same token, that two wholes cannot 
be composed out of the same proper parts. So it is not entirely surprising that as 
long as PP is well behaved, as per (P.4), extensionality might pop up like this in 
the presence of substantive composition statements. (It is, however, noteworthy 
that it pops up as soon as (P.4) is combined with a seemingly innocent thesis 
such as (38), so the anti-extensionalist should keep that in mind.) 

The intuitive idea behind all these principles is in fact best appreciated in the 
presence of extensionality, for in that case the relevant fusions must be unique. 
Just to confine ourselves to GEM, it is natural to consider the following fusion 
operator (where ‘z’ is the definite descriptor’): 





(39) Lada =q 1z2Vw(Owz => Ju(dv A Owv)) fusion 


Then (P.13) and (P.5) can be simplified to a single axiom schema: 








(P.14) Irox > Az(z = Sarda) Unique Unrestricted Fusion 





5We assume a classical logical background, with ‘2’ defined as usual. Much of what follows, however, would 
also apply in case a free logic were used instead, with ‘2’ assumed as part of the logical vocabulary proper. 
(See Simons 1991b for a free formulation of mereology.) 
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and the full strength of the theory can be seen by considering that its models 
are all closed under the following functors, modulo the absence of a null entity: 


(40) x+y =q Uz(Pzx V Pzy) sum 
(41) xx y =q Uz(Pzx A Pzy) product 
(42) x—y =q Uz(Pzx A Dzy) difference 
(43) ~ z =a UzDzx complement 
(44) U =a UzPzz universe 


(Absent the null entity, U has no complement while products are defined only 
for overlapping pairs and differences for pairs that leave a remainder). Since 
these functors are the natural mereological analogue of the familiar Boolean 
operators, with fusion in place of set abstraction, it follows that the parthood 
relation axiomatized by GEM has the same properties as the set-inclusion 
relation. More precisely, it is isomorphic to the inclusion relation restricted 
to the set of all non-empty subsets of a given set, which is to say a complete 
Boolean algebra with the zero element removed—a fact that has been known 
since Tarski (1935). 

There are other equivalent formulations of GEM that are noteworthy. For 
instance, it is a theorem of every extensional mereology that parthood amounts 
to inclusion of overlappers: 


(45) Pay «e Vz(Oza — Ozy) 


This means that in an extensional mereology ‘O’ could be used as a primitive 
and ‘P’ defined accordingly, and it can be checked that the theory defined by 
postulating (45) together with the unrestricted fusion principle (P.13) and the 
antisymmetry axiom (P.3) is equivalent to GEM. Another elegant axiomati- 
zation of GEM, due to an earlier work of Tarski (1929), is obtained by taking 
just the transitivity axiom (P.2) and the unique unrestricted fusion axiom (P.14). 


1.4.2 Composition, existence, and identity. Arguably, the algebraic 
strength of GEM speaks in favor of this theory as an account of the struc- 
ture of space, since it is rather intuitive (and common practice) to understand 
spatial regions in terms of sets of points mereologically related by set-inclusion. 
As a general theory of the mereology of all spatial entities, however, GEM 
reflects substantive postulates whose philosophical underpinnings are contro- 
versial. Indeed, all composition principles turn out to be controversial, just 
as the decomposition principles examined in Sec. 1.3. For, on the one hand, 
it appears that the weaker, conditional formulations, from (P.10,,) to (P.13¢), 
are just not doing enough work: not only do they depend on the specification 
of the limiting conditions expressed by the predicates ‘y’ and ‘£’; they also 
treat such conditions as merely sufficient for the existence of bounds and fu- 
sions, whereas ideally we are interested in conditions that are both sufficient and 
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necessary. On the other hand, the stronger, unconditional formulations—most 
notably (P.13)—appear to go too far, not only because they tend to obliterate 
any difference between weak and strong supplementation, i.e., extensional- 
ity, but because they commit the theory to the existence of a large variety of 
prima facie implausible mereological composites. So what is the right way to 
go, at least insofar as we are interested in the compositional structure of the 
spatial realm? 

Concerning the first sort of worry, one could of course strenghten every 
conditional formulation to a biconditional expressing both a necessary and 
sufficient condition for the existence of an upper bound, or a fusion. But then 
the question of how such conditions should be construed becomes crucial. For 
example, in connection with (P.10,,) we have mentioned the idea of construing 
‘p’ as ‘O’, the rationale being that mereological overlap establishes an important 
connection between what may count as two distinct parts of a larger whole. 
However, as a necessary condition overlap is arguably too stringent. We may 
have misgivings about the existence of scattered entities consisting of spatially 
unrelated parts, such as the top of my body and the bottom of yours, or the 
collection of my umbrellas and your left shoes. But in some cases no such 
misgivings arise. In some cases it appears perfectly natural to countenance 
wholes that are composed of two or more disjoint entities—a bikini, the solar 
system, a printed inscription consisting of separate letter tokens (Cartwright 
1975). More generally, intuition and common sense suggest that some and 
only some mereological composites exist, not all; yet it is doubtful whether 
the question of which composites exist—van Inwagen’s ‘special composition 
question’—can be answered successfully. Consider a series of almost identical 
mereological aggregates that begins with a case where composition appears to 
obtain (e.g., the body cells that currently make up my body) and ends in a case 
where composition would seem not to obtain (e.g., the same body cells after 
their relative distance has been gradually increased to a huge extent). Where 
should we draw the line? It may well be that whenever some entities compose 
a bigger one, it is just a brute fact that they do so (Markosian 1998b). But if we 
are unhappy with brute facts, if we are looking for a principled way of drawing 
the line so as to specify the circumstances under which the facts obtain, then 
the question is truly challenging. As Lewis (1986, p. 213) put it, no restriction 
on composition can be vague, since existence cannot be a matter of degree; but 
unless it is vague, it cannot fit the intuitive desiderata. 

For these reasons, although the axiom of unrestricted fusion has been a major 
source of complaint since the early days of mereology (see again Lowe 1953 
and Rescher 1955, with replies in Goodman 1956, 1958), it is a fact that most 
formally accomplished theories accept unrestricted composition principles of 
some sort. Apart from whatever algebraic considerations might motivate them, 
such principles suggest themselves as the only non-arbitrary ways of answering 
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the composition question. Besides, it might be observed that any complaints 
against such principles rest on psychological biases that have no bearing on 
how the world is actually structured. In the words of van Cleve (1986, p. 145), 
little would follow even if we did manage to come up ‘with a formula that jibed 
with all ordinary judgments’ about what counts as a unit and what does not, for 
such judgments need not have any ontological transparency. 

All of this speaks in favor of (P. 13) and the like against their weaker, condi- 
tional formulations, providing also an answer to the second worry mentioned 
above: the prima facie ontological extravagance of a theory such as GEM is 
not by itself a sign that the theory has gone too far. There is, however, another 
worry that is worth mentioning in this connection, and this further worry con- 
cerns the ontological exuberance—if not the extravagance—of the theory. For 
even granting the impossibility of drawing a principled line between natural 
fusions and unnatural ones, one could still object that positing every conceiv- 
able fusion is utterly unjustified. Why should mereology be committed to the 
existence of all such things over and above their constituent proper parts? 

There are two lines of response to this question. First, it could be observed 
that the ontological exuberance associated with the relevant composition prin- 
ciples is not substantive. This is obvious in the case of a modest principle in 
the spirit of (P.10,;), to the effect that entities of the right sort have an upper 
bound. After all, there are small things (my fingers) and large things (my body), 
and it is just a fact that the latter encompass the former. But the same could be 
said with respect to those stronger principles that require the large thing to be 
composed exactly of the small things—to be their mereological fusion. For one 
could argue that even a fusion is, in an important sense, nothing over and above 
its constituent parts. The fusion is just the parts ‘taken together’ (Lewis 1991, 
p. 81); itis the parts ‘counted loosely’ (Baxter 1988, p. 580); itis, effectively, the 
same portion of reality, which is strictly a multitude and loosely a single thing. 
This thesis, known in the literature as “composition as identity’, is by no means 
undisputed (see e.g. van Inwagen 1994, Yi 1999, Merricks 1999). Nonetheless 
it should be carefully evaluated in connection with any worry about the onto- 
logical exuberance of fusion principles. And if the thesis is accepted, then the 
charge of ontological extravagance loses its force, too. If a fusion is nothing 
over and above its constituent parts, and if the latter are all right, there can be 
nothing particularly extravagant in countenancing the former: it just is them. 

Secondly, one may observe that the worry in question bites at the wrong 
level. If, given two entities, positing their sum were to count as further onto- 
logical commitment, then, given any mereologically composite entity, positing 
its proper parts should also count as further commitment. After all, every entity 
is distinct from its proper parts. But then the worry has nothing to do with the 
composition axioms; it is, rather, a question of whether there is any point in 
countenancing a whole along with its parts, or vice versa. And if the answer is 
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in the negative, then there seems to be no use for mereology tout court. From 
the point of view of the present worry, it would seem that the only truly par- 
simonious account would be one that rejects, not only some, but all logically 
admissible fusions—in fact, all mereological composites whatsoever. Philo- 
sophically such an account is defensible (see Rosen and Dorr 2002) and the 
corresponding axiom is compatible with M: 


(P.15) Ax Strong Atomicity 


The following immediate corollary, however, says it all: nothing would be part 
of anything else and parthood would collapse to identity. 


(46) Payou=y 


(This account is known as mereological nihilism, in contrast to the mereological 
universalism expressed by (P.13); see van Inwagen 1990, pp. 72ff.) 

In recent years, further worries have been raised concerning mereological 
theories with non-trivial composition principles—especially concerning the full 
strength of GEM. It has been argued that unrestricted composition does not sit 
well with certain intuitions about persistence through time (van Inwagen 1990, 
75ff), that it requires every entity to necessarily have the parts it has (Merricks 
1999), or that it leads to paradoxes similar to the ones afflicting naive set theory 
(Bigelow 1996). Such arguments are still the subject of on-going controversy 
and a detailed examination is beyond the scope of this chapter. Some discussion 
of the first point, however, is already available in the literature: see especially 
Rea (1998), McGrath (1998), and Hudson (2001, pp. 93ff). Hudson (2001, 
pp. 95ff) also contains a discussion of the last point. 


15 The problem of vagueness 


Let us conclude this discussion of mereology by considering a question that is 
not directly related to specific mereological principles but, rather, to the under- 
lying notion of parthood that mereology seeks to systematize. All the theories 
examined so far, from M to GEM, assume that parthood is a perfectly deter- 
minate relation: given any two entities x and y, there is always a definite fact 
of the matter as to whether or not zx is part of y. However, it may be argued that 
this is a simplification. Perhaps there is no room for vagueness in the idealized 
mereology of pure space, but what about the real world? Think of a cloud, a 
forest, a pile of trash. What parts do they have, exactly? What are the mereo- 
logical boundaries of a desert, a river, a mountain? Some stuff is positively part 
of Mount Everest and some stuff is positively not part, yet there is borderline 
stuff whose mereological relationship to Everest seems indeterminate. Even 
living organisms may, on closer look, give rise to vagueness issues. Surely 
John’s body comprises his heart and does not comprise mine. But what about 
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x o ey X o ey 


Figure 15.7. Objects with indeterminate parts (in grey). 


the candy he is presently chewing: Is it part of John? Will it be part of John 
only after he swallowed it? After he started digesting it? After he digested it 
completely? 

In the face of such examples, it might be thought that the conceptual apparatus 
on which M and its extensions are based is too rigid. It might be thought that the 
world includes various sorts of vague entities, and that relative to such entities 
the parthood relation need not be fully determined (van Inwagen 1990: Ch. 13, 
Parsons and Woodruff 1995). There are, in fact, various ways one could seek 
greater flexibility. One could leave everything as is but change the underlying 
logic (and semantics), for instance by allowing statements of the form ‘Py’ to 
receive no determinate truth-value (as in Tye 1990), or to receive truth-values 
that are intermediate between classical truth and falsity (as in Copeland 1995). 
Or one could change the very basic apparatus of mereology, replacing the ‘part 
of’ predicate with a new primitive ‘part of to a degree’: this is, for example, 
the approach that led to the development of Polkowsky and Skowron’s (1994) 
‘rough mereology’, where parthood undergoes a fuzzification parallel to the 
fuzzification of membership in Zadeh’s (1965) fuzzy set theory. No matter how 
exactly one proceeds, obviously many among the principles discussed above 
would have to be reconsidered, not because of what they say but because of their 
classical, bivalent presuppositions. For example, the extensionality theorem of 
EM, (31), says that composite things with the same proper parts are identical, 
and this would call for qualifications: the model in Fig. 15.7, left, depicts x 
and y as non-identical by virtue of their having distinct determinate parts; yet 
one might prefer to describe a situation of this sort as one in which the identity 
between zx and y is itself indeterminate, since it is indeterminate whether they 
really have distinct parts. Conversely, the model on the right depicts x and y 
as non-identical in spite of their having the same determinate proper parts; yet 
again one might prefer to suspend judgment owing to the indeterminacy the 
middle element. 

That there are vague objects in this sense, however, i.e., objects whose mere- 
ological composition may to some extent be objectively indeterminate, is all 
but obvious. Surely a statement such as 


(47) «xis part of Everest 


may lack a definite truth-value, if x lies somewhere in the borderline area. 
But—it could be argued—this need not be due to the way the world is. The 
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indeterminacy of (47) may be due exclusively to semantic factors—not to the 
vagueness of Everest but to the vagueness of ‘Everest’. When the members 
of the Geodetic Office of India baptized the mountain after the name of their 
British founder, they simply did not specify exactly which parcel of land they 
were referring to (or which parcel of land constituted the mountain they meant 
to name). The referent of their term was vaguely fixed and, as a consequence, 
the truth conditions of a statement such as (47) are not fully determined; yet 
this is not to say that the stuff out there is mereologically vague. Each one 
of a large variety of slightly distinct parcels of land has an equal claim to the 
vaguely introduced name ‘Everest’, and each such thing has a perfectly precise 
mereological structure. To put it differently, a statement such as 


(48) Itis indeterminate whether x is part of Everest 
admits of a de re reading, as in (48a), but also of a de dicto reading, as in (48b): 


(48a) Everest is a y such that: it is indeterminate whether <x is part 
of y. 

(48b) It is indeterminate whether: Everest is a y such that x is part 
of y. 


The first reading corresponds to the initial thought, to the effect that Everest’s 
parts are indeed indeterminate, with the consequence that mereology ought to be 
revised as seen above. The second reading, by contrast, corresponds to the idea 
that it is the semantics of ‘Everest’ that is indeterminate, and there is no reason to 
suppose that this is due to some objective deficiency in the parthood relation— 
hence no reason to require revisions in the apparatus of mereology itself. (Ditto 
for the other cases mentioned above. The reason why it’s indeterminate whether 
a certain molecule is part of a cloud, a tree part of a forest, or the candy part of 
John, is not that such things are mereologically indeterminate; rather, on a de 
dicto understanding the indeterminacy lies entirely in our words, in the terms 
we use to pick out such things from a multitude of slightly distinct but perfectly 
determinate potential referents.) 

If the semantic conception is accepted, then, the problem of vagueness dis- 
solves. Or rather: it ceases to be an issue for mereology and it becomes a 
problem for semantics broadly understood—a problem that manifests itself in 
many contexts besides those under consideration. (How much money do you 
need to be rich? How slowly can you run? How late can I call you?) Again, 
there are many things one could do at this point. A favored option is afforded 





6That mountains are just parcels of land is, of course, a substantive assumption: an anti-extensionalist may 
want to deny it, as with Tibbles and the relevant amount of feline tissue (Sec. 1.3.2). On the ontology of 
topographic entities, see e.g. Smith and Mark (2003). 
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by so-called supervaluational semantics, whose first application to vagueness 
can be traced back to Fine (1975). According to such semantics, the truth-value 
of a sentence involving vague terms is a function of its truth-values under the 
admissible precisifications of those terms: the sentence is true if it is true under 
every precisification, false if false under every precisification, and indetermi- 
nate otherwise. Thus, if x is in the borderline area, then the indeterminacy of 
(47) is explained by the fact that among the many admissible ways of precisify- 
ing the term ‘Everest’, some would pick out a referent that extends far enough 
to include x among its parts whereas others would not, which is to say that (47) 
would be true on some but not all precisifications. By contrast, if x were clearly 
part of Everest given the way the name is used in ordinary circumstances, or if 
it were clearly not part of Everest, then (47) would have a definite truth-value, 
for every precisification would yield the same response (always true and always 
false, respectively). We need not go into the details here. But three things are 
worth noting. 

First, none of this will have any impact on the mereological axioms consid- 
ered so far. For those axioms are expressed as (implicitly) universally quantified 
formulas involving no singular terms except for variables, and variables are not 
the sort of expression that can suffer from the phenomenon of vagueness. Vari- 
ables range over all entities included in the domain of quantification and pick 
out their values independently of any vagueness that may affect the non-logical 
vocabulary. 

Second, any model that satisfies a given axiom or theorem satisfies also any 
substitution instance that can be obtained by replacing one or more variables 
with corresponding names or descriptive terms. For example, the following 
sentence is a substitution instance of (P.1): 


(49) Everest is part of Everest. 


and it is easily verified that a supervaluational semantics will make (49) true in 
every model of M. For insofar as reflexivity is meant to hold for every entity 
in the domain, the truth of (49) is guaranteed no matter which entity we elect 
as the referent of ‘Everest’. Likewise, the following is a substitution instance 
of (31), the extensionality theorem of EM: 


(50) As long as they are non-atomic, Everest and Sagarmatha are 
the same if and only if they have the same proper parts. 


(‘Sagarmatha’ is the Nepalese translation of ‘Everest’ , though there is no guar- 
antee that both names admit of the same precisifications.) Again, it is easily 
verified that a supervaluational semantics will make (50) true in every EM 
model. For no matter how we precisify the terms ‘Everest’ and ‘Sagarmatha’ 
by tracing a precise boundary around their referents, the extensionality of part- 
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hood will guarantee that the referents coincide just in case their proper parts 
coincide too. 

Finally, it is worth emphasizing that a supervaluational semantics is perfectly 
adequate to classical logic (Fine 1975, McGee 1997, Varzi 2001). For example, 
although it does not obey to the semantic principle of bivalence, as with various 
instances of 


(51) Either ‘x is part of Everest’ is true or ‘x is part of Everest’ is false, 
it certainly satisfies the logical law of excluded middle: any instance of 
(52) Either x is part of Everest or x is not part of Everest 


is bound to be true, for it is true on any precisification of ‘Everest’. Things would 
change, however, if the language were enriched by adding an explicit senten- 
tial operator to express indeterminacy. In that case, the following principle 
would give expression to the assumption that parthood admits of no objective 
borderline cases: 


(P.16) Itis determinate whether Pry, Determinacy 


though it is obvious that this principle may have invalid instances as soon as 
‘x’ or ‘y’ is replaced by a vague singular term such as ‘Everest’, as in (48). 
At the moment, the logic of determinacy operators is an open area of research, 
especially in the context of a supervaluational semantics. (See Keefe 2000, 
87.4 and references therein.) It is, however, an important tool that any good 
semantic theory of vagueness should countenance. And it is bound to play a sig- 
nificant role in any application of the theory to mereology and spatial reasoning 
broadly understood. 


2. Philosophical issues in topology 


Let us now move on to the second major ingredient of a comprehensive theory 
of spatial reasoning—topology. There are many reasons for this move, but 
the main one is simply this: one need go beyond the bounds of a pure theory 
of parthood to come out with a true theory of parts and wholes. For as we 
have already mentioned, mereology by itself cannot do justice to the notion 
of a whole (a one-piece, self-connected whole such as a stone or a whistle, as 
opposed to a scattered entity made up of several disconnected parts, such as a 
bikini or a broken glass). Parthood is a relational concept, wholeness a global 
property, and the latter just runs afoul of the former. 

Whitehead’s early attempts to characterize his ontology of events, as pre- 
sented at length in his Enquiry (1919) and in The Concept of Nature (1920), 
exemplify this difficulty most clearly. The mereological system underlying 
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Figure 15.8. A connected sum (left) and two disconnected sums (middle, right). 


Whitehead’s ontology was not meant to admit of arbitrary wholes, but only 
of wholes made up of parts that are ‘joined’ or connected to one another— 
specifically, finitary sums of such parts. Thus, Whitehead was working with a 
composition principle patterned after (P.11,;), in fact with the corresponding bi- 
conditional, with ‘7’ understood as a predicate expressing the relevant relation 
of connection. And Whitehead’s characterization of this relation was purely 
mereological:’ 





(53) pay =p 3z(Ozx A Ozy AYw(Pwz — Ow V Owy)) 


Looking at the spatial patterns in Fig. 15.8, we can see how this definition is 
intended to work. What distinguishes the connected sum x + y on the left 
from the disconnected sum in the middle? Well, in the former case it is easy 
to find regions, such as z, that overlap both x and y without outgrowing the 
sum—tregions that lie entirely within x + y. By contrast, in the middle pattern 
it would seem that every z overlapping both x and y will also overlap their 
complement—the entity that surrounds x + y. Thus, only the left pattern 
satisfies the condition expressed by ‘7’; the pattern in the middle violates it. 
However, Fig. 15.8 also shows why this account is defective. For nothing 
guarantees that the item z overlapping two ‘joined’ items x and y be itself 
in one piece, so the pattern on the right depicts two entities that satisfy the 
condition expressed by ‘yw’, too. Yet this is a case where we should like to 
say that x and y are not connected. Of course, Whitehead would disqualify 
the counterexample because his ontology does not contain any disconnected 
zs—but this is plain circularity. The account works on the assumption that only 
self-connected entities can inhabit the domain of discourse, yet that is precisely 
the assumption that (53) is meant to characterize. 

These considerations apply mutatis mutandis to other attempts to subsume 
topological connectedness within a bare mereological framework (see e.g. 
Bostock 1979, Needham 1981, Ridder 2002). Nor is this exclusively an onto- 
logical concern. These limits show up in any attempt to account for a number 
of important spatial concepts besides connectedness, such as the distinction 





7The definition below corresponds to the formulation given in Whitehead (1920, p. 76). Whitehead’s earlier 
definition (1919, p. 102) is slightly different but essentially equivalent. 
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between a completely interior part and a tangential part that is connected with 
the exterior, of the difference between an open entity and a closed one. All of 
these—and many others indeed—are relations that any theory concerned with 
the spatial structure of the world should supply and which cannot, however, be 
defined directly in terms of plain mereological primitives. 


2.1 ‘Contact’ and connection 


It is here that topology comes into the picture. The connection relation that 
Whitehead was seeking to characterize is a topological relation. And if it cannot 
be defined in mereological terms, it must be formally treated on independent 
grounds. 

Before looking at how this can be done, it is important again to begin with 
a couple of terminological caveats. As with ‘part’, the term ‘connection’, and 
cognate terms such as ‘contact’ and ‘touching’, have different meanings in 
ordinary language, only some of which correspond to the intended relation. 
Most notably, in ordinary language we do not draw a clear distinction between 
a truly topological notion of connection and a merely metric notion of contact. 
Consider: 


(54) The handle is attached to the mug. 
(55) The table is touching the wall. 


The relation exemplified by (54) is topological: the handle and the rest of 
the mug form a unitary whole. For practical purposes there may be room for 
free rein, depending on whether the handle is glued to the rest of the mug (what 
Galton 2000, $4.2, calls ‘adhesion’) or truly continuous with it (‘cohesion’), but 
either way there is an obvious sense in which we are dealing with a single, one- 
piece object. By contrast, the relation exemplified by (55) is not topological 
but metric: the table is so close to the wall that we are inclined to say they 
are connected. If space were discrete, this might be the right thing to say. 
But if space is dense, as we may plausibly assume, then the surfaces of two 
bodies can never truly be connected, short of overlapping: there will always 
be a narrow gap separating them. The narrower this gap, the easier it is to 
disregard it for practical purposes, but genuine topological connection can only 
obtain when the gap is reduced to zero. We shall see in Sec. 2.5.1 that some 
interesting topological relations may be introduced to capture at least some uses 
of the metric relation of contact. Overall, however, metric relations cannot be 
squeezed into the conceptual apparatus of topology without the help of strongly 
simplifying assumptions on the structure of space. (In this sense, a diagram 
such as Fig. 15.8, left, is ambiguous, since one might think of the two squares 
x and y as being merely close to each other, as when we draw a picture of a 
table against a wall.) 
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A related issue concerns the distinction between connection patterns that 
involve a single point of contact, as in (56), or an extended boundary portion, 
as in (57), if not of an entire boundary, as in (58): 


(56) Colorado is connected to Arizona. 
(57) France is connected to Germany. 
(58) The Vatican is connected to Italy. 


To some extent this is a matter of convention. We may disregard (56) as irrel- 
evant, or we may treat it as an acceptable case. Since most theories go for the 
second option, we shall follow them on this score. As it turns out, under suitable 
conditions one can on such basis draw all the relevant distinctions (Sec. 2.5), 
so the choice proves to be a convenient one. 

Finally, let us just mention the fact that topological connection is, in a way, an 
idealized relation. Physically, as we know, the world consists of objects that are 
not continuous (or dense) in the relevant sense, and speaking of their boundaries 
is like speaking of the ‘flat top’ of a fakir’s bed of nails (Simons 199 1a, p. 91). 
Physically, a mug is just a swarm of subatomic particles whose exact shape and 
extension involves the same degree of arbitrariness as a mathematical graph 
smoothed out of scattered data. In this sense, the intuition behind the claim 
that (54) provides a good example of topological connection betrays a naive 
conception of the mid-size world. However, this is not to say that topology is 
inadequate as a tool for practical spatial reasoning. For, on the one hand, we 
are generally interested in describing the spatial structure of the world precisely 
insofar as objects are conceptualized as finite chunks of dense matter with 
closed, continuous boundaries. Even if talk of boundaries and contact were 
deemed unsuited to the ontology of the physical sciences, one would therefore 
need it when it comes to the dense entities carved out by ordinary discourse 
and to the spatial regions that these occupy. On the other hand, the geographic 
examples in (56)-(58) illustrate that at least some entities countenanced by 
common sense measure up to the strict standards of topological connection. We 
do want to say that a geopolitical unit occupies a region of space that is strictly 
dense in the relevant sense, even though the underlying territory may consist 
of material stuff that on closer inspection is best described as a gerrymandered 
aggregate of zillions of disconnected subatomic particles. 


2.2 Basic principles and definitions 


We are now in a position to take a closer look at the idea of a topological exten- 
sion of mereology—an extension that would take us beyond the prospects of 





8This may also depend on context. In chess, for example, the choice depends on whether we are a rook or a 
bishop. 
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a pure theory of parthood to deliver a genuine theory of parts and wholes. To 
this end, let us expand our formal language by adding a second distinguished 
predicate constant, ‘C’, to be understood intuitively as the relation of topo- 
logical connection. The question of how mereology can actually be expanded 
to a richer part-whole theory may then be addressed by investigating how a 
P-based mereological system of the sort outlined in Sec. 1 can be made to 
interact with a C-based topological system. 

Again, we may distinguish for this purpose ‘lexical’ from substantive postu- 
lates for ‘C’, regarding the former as embodying a set of minimal prerequisites 
that any system purporting to explicate the meaning of the concept of ‘connec- 
tion’ must satisfy. And a natural starting point is to assume that such lexical 
principles include at least the twofold requirement that ‘C’ be reflexive and 
symmetric: 


(C.1) Carex Reflexivity 
(C.2) Cary — Cyr Symmetry 


There is little room for controversy concerning the intuitive adequacy of (C.1)- 
(C.2), provided that we take ‘C’ to express, not just the relation of external 
connection that may obtain between two disjoint entities that share a common 
boundary (in some intuitive sense to be made precise), but the relation of con- 
nection that may obtain between any two entities that share at least a boundary. 
In this sense, mereological overlap qualifies as connection, too. We shall come 
back to this idea shortly. First, let us note that ‘C’ need not, on the intended 
interpretation, express a transitive relation. France is connected to Germany 
and Germany to Poland, but France and Poland are not connected—they do 
not share any common boundary. We can, however, consider a notion of con- 
nection that captures the fact that France is connected to Poland by Germany. 
De Laguna (1922), a forerunner in the area of qualitative topological reason- 
ing, actually based his account on a three-place primitive corresponding to this 
relation.” In terms of ‘C’, De Laguna’s primitive is easily defined: 


(59) BCayz =a Cz \ Czy By-Connection 


and we can accordingly introduce the desired notion of (possibly) indirect or 
mediate connection as follows: 





(60) MCay =gp 32BCryz Mediate Connection 


By an obvious generalization, we can also define: 





° Strictly speaking, De Laguna’s primitive is interpreted as ‘x can be connected to y by 2’, so it involves a 
modal ingredient. For a formal treatment, see Giritli (2003). 
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(61) MC” xy =af Sz1...2n(Caz1 A... A Czy) n-Connection 


It follows immediately from (C.1) and (C.2) that each MC” is reflexive and 
symmetric, and the union of all such relations is transitive. In the absence of 
further principles, however, e.g., principles guaranteeing the existence of an 
entity connected to all the intermediate links, such a transitive union cannot be 
defined in the object language—unless we allow for quantification over positive 
integers: 





(62) TCaxy =a InMC” zy Transitive Connection 


Now, let T be the first-order theory defined by the two basic axioms (C.1) 
and (C.2), in analogy with the theory M defined by the basic mereological 
axioms (P.1)-(P.3). T is, of course, extremely weak and a lot will have to be 
added before we can say we have an interesting topology. In particular, a model 
of T can be obtained simply by interpreting ‘C’ as mereological overlap, and 
what further principles should be added to T so as to distinguish C from O is 
precisely one of the questions a good topological theory is meant to answer. For 
instance, should one assume that connection is extensional, i.e., that things that 
are connected exactly to the same entities are identical? Should one assume that 
any two connected entities satisfy at least some form of Whitehead’s account 
in (53)? Or consider the binary relation defined by 


(63) Eay =a Vz(Cza — Czy) Enclosure 


It follows from (C.1)-(C.2) that this relation is reflexive and transitive, and 
if C is extensional, than E is also antisymmetric—a partial ordering. Should 
one assume this relation to satisfy any analogues of the axioms for parthood? 
For each mereological predicate defined in Sec. 1 using ‘P’ one could now 
introduce a corresponding topological predicate using ‘E’ instead. Should one 
assume any corresponding axioms? 

As it turns out, it is difficult to answer these questions in an abstract setting 
(see Cohn and Varzi 2003). Obviously, much depends on how exactly ‘C’ 
is interpreted, and that in turn may depend on how one thinks ‘C’ and ‘P’ 
should interact. Rather than pursuing these questions in isolation, then, let us 
proceed immediately to examining the main options for combining mereology 
and topology. 


2.3 Bridging principles 


The simplest option is just to append the T'-axioms to our preferred mereological 
theory, X, to obtain a corresponding ‘mereotopology’ X + T, which can then 
be strengthened by supplying further axioms for ‘C’. However, this would 
be of little interest unless one also adds some mixed principles to establish an 
explicit ‘bridge’ between X and T. 
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2.3.1 Parts and wholes. There is one sort of bridging principle that 
most theories, if not all, accept: it centers around the intuition that no matter 
how P and C are fully characterized, they must be related in such a way that 
a whole and its parts are tightly connected. Here are three ways one can try to 
capture this intuition: 


(C.3a) Pay — Cay Integrity 
(C.3,) Oxy — Cry Unity 
(C.3) Pay — Ery Monotonicity 


The first principle, (C.3,), is perhaps the most immediate: just as everything 
is connected to itself by (C.1), everything must be connected to its constitutive 
proper parts. This is not to say that the parts must all be connected to one 
another: the two main parts of a bikini are not. But they are, in an obvious 
sense, connected to their sum; they are detached from each other but not from 
the whole bikini. 

As it stands, however, (C.3,) is extremely weak. It doesn’t even capture 
the idea that if something is part of two things, then those things are thereby 
connected. This is not to say that they are connected by that common part, in the 
sense defined in (59); they are connected because of that common part. In other 
words, if sharing a common boundary is to count as sufficient for connection, 
then a fortiori sharing a common part ought to be sufficient, too. It is in this 
sense that overlap is to be regarded a special (and somewhat trivial) case of 
connection. And the second principle, (C.3,), makes this explicit. 

(C.3%) is stronger than (C.3,), since parthood implies overlap. Moreover, 
since the converse need not hold (on pain of trivializing the notion of connec- 
tion), (C.3,) provides the intuitive grounds for defining a non-trivial notion of 
external connection, or touching, that can only hold between mereologically 
disjoint entities: 


(64) ECaxy =a Cay ^ Dry External Connection 


(Note that this relation is still symmetric, but not reflexive; it is actually irreflex- 
ive, due to the irreflexivity of D.) This is an important notion, which makes 
all the difference between mereology and mereotopology. Yet (C.3,) is still 
too weak to capture the fundamental intuition that we are after. For while this 
principle guarantees that overlapping a part is sufficient for being connected 
to the whole, it doesn’t secure that touching a part is also sufficient. Surely 
something can touch a mug (say) just by touching its handle. So it is only 
with the third principle, (C.3), that we get a plausible formulation of the basic 
idea. Connection, if it is to behave properly, must be monotonic with respect 
to parthood. 
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Itis easily checked that (C.3) implies (C.3,), hence (C.3,), so let us just focus 
on (C.3), and let us call MT (for Minimal (mereo)Topology) the corresponding 
extension of 7’.!° Whether (C.3) is to be classified as a ‘lexical’ principle may 
be controversial and will depend, in an obvious sense, on the underlying axioms 
for ‘P’. Nonetheless, the principle itself is part of virtually every mereotopolog- 
ical theory in the literature, either as an axiom (Varzi 1996a, Donnelly 2004) 
or as a theorem. And although MT is still far from providing an adequate 
characterization of the relation of topological connection, it provides the basis 
for the definition of a number of important spatial relations which, like EC, 
cannot be distinguished within a purely mereological setting. In particular, we 
can now express the difference between a proper part that lies entirely within 
the interior of the whole and a proper part that is connected with the exterior: 


(65) IPPay =gp PPzy AVz(Cza — Ozy) Interior PP 
(66) TPPxy =a PPxy A -IPPxy Tangential PP 
(67) EPEay =g PExry A Vz(Czy — Ozz) Exterior PE 
(68) TPExy =a PExy \ -IPEry Tangential PE 


Note that, given (64), interior parts satisfy the following: 
(69) IPPaxy = (PPxy A 7dz(ECzx A ECzy)) 





Thus, tangential parts are those parts that reach far enough to touch something 
with which the whole itself is just in touch. Similarly for proper extensions. The 
diagram in Fig. 15.9 indicates how these predicates may represent a genuine 
addition to the mereological vocabulary introduced in (19)—(23) and illustrated 
in Fig. 15.1. We shall see in Sec. 2.4 that this diagram may actually be mislead- 
ing, owing to the delicate role played by boundaries in the proper understanding 
of the connection relation; but for the moment we take the intuitive, geometric 
interpretation of the diagram to be adequate enough to serve its purpose. 


2.3.2 Parthood vs. enclosure. Things begin to be controversial as 
soon as we consider the possibility of stronger bridging principles. Consider 
again the three principles above. Clearly the converse of Integrity, (C.3,), 
is unacceptable. And unacceptable is also the converse of Unity, (C.3,), for 
then connection would collapse on the relation of mereological overlap and the 
definitions in (64)-(68) would lose their intuitive appeal. On the other hand, 
the converse of the Monotonicity principle (C.3) is not obviously unreasonable: 


(C.4) Ery —> Pry Converse Monotonicity 





10Tn fact, the result of adding (C.3) to T yields a slightly redundant theory; a more elegant formulation can be 
obtained by dropping (C.2) and replacing (C.3) with the following variant: (C.3’) PayA Caz > Czy. 
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Figure 15.9. Basic mereotopological relations. (Shaded cells indicate connection; darker shad- 
ing stands for parthood.) 


This says that a sufficient condition for one thing to be part of another is that 
whatever is connected to the former is also connected to latter. This sounds 
intuitive, and several authors would actually include (C.4) as a further bridging 
principle on top of MT. Indeed, a principle along these lines may already be 
found in Whitehead’s latest work, Process and Reality (1929)—a remarkable 
fact, since the conjunction of (C.3) and (C.4) yields a biconditional that would 
allow one to define parthood in terms of connection: 


(70) Pay «e Ery 


Thus, if (53) turned out to be a defective attempt to reduce topological concepts 
to purely mereological ones, (C.4) (together with (C.3)) reflects a reductionist 
attempt in the opposite direction, to the effect that mereological concepts can be 
defined in terms of purely topological ones. And although few have followed 
Whitehead on the first route, it is a fact that many authors have taken the 
second strategy into serious consideration. Clarke (1981) provides the most 
influential example, and the so-called ‘Region Connection Calculus’ originated 
with Randell et al. (1992) is the best case in point when it comes to theories 
designed specifically for applications to spatial reasoning (see also Gotts et 
al. 1996 and Cohn et al. 1997). So the question deserves close scrutiny: Is 
(C.4) a reasonable addition to the basic postulates of MT? 

Never mind the fact that working with just one primitive may be mathemati- 
cally attractive. As it turns out, it is equally possible to rely on a single primitive 
even in the absence of (C.4). For instance, one can rely on the ternary relation 
‘x and y are connected parts of z’ (Varzi 1994). Writing this as ‘CPxyz’, one 
could define ‘P’ and ‘C’ as follows: 


(71) Pay =a 3z CPzzy 
(72) Cry =a dz CPryz 





and then go on to develop a mereotopological theory based on the relative 
irreducibility of these two predicates. (Note that (71) and (72) only carry 
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Figure 15.10. Two counterexamples to converse monotonicity. 


minimal presuppositions: that parthood be reflexive and that connected entities 
have an upper bound.) So the issue is not formal economy—the use of a single 
primitive. Itis, rather, conceptual economy. In the extension of MT obtained by 
adding (C.4)—henceforth RMT, for Reductive Mereotopology—the notion of 
parthood is fully subsumed under that of connection, and the limits of mereology 
are overcome by turning the original problem upside down: parthood cannot 
deliver the full story, but connection can. However, there are at least two worries 
here. 

The first worry concerns the material adequacy of the reduction. As Masolo 
and Vieu (1999) have observed (but the point goes back to Randell et al. 1992: 
$5.1), (C.4) appears to have implausible consequences if the domain contains 
entities with atomic proper parts. Consider an extended region (or object) x, 
and let y be x minus an atomic part z (Fig. 15.10, left). On any reasonable 
understanding of ‘C’, everything connected to x is connected to y, since z is 
connected to both. So (C.4) would force x to be part of y. Yet, intuitively, x 
should count as an extension of y: it is bigger, it contains z, it contains y as a 
further proper part. Things get worse if we consider that (C.4) forces z itself to 
count as part of y, since z is connected to y and anything else is connected to z 
only if it overlaps y. Yet y was defined as x minus z. Of course, such models 
would be ruled out if RMT were strengthened by adding an atomlessness 
postulate such as (P.7). But this is precisely the point: (C.4) does not merely 
reinforce the bridge between P and C; it actually embodies more substantive 
views about the mereological structure of space. Besides, even the atomless 
variant ARMT would be open to counterexamples. For we have the same 
sort of problem if we suppose that z is a non-atomic proper part of x with no 
interior proper parts of its own (Fig. 15.10, right). To rule out this model, a 
stronger assumption than (P.7) would be needed, corresponding to the thesis 
that everything has interior proper parts: 





(C.5) darlPPxry Boundarylessness 


In fact, it is precisely with the help of an axiom like (C.5) that we can give 
expression to a Whiteheadian, boundary-free conception of space (see below, 
Sec. 2.4.3). However, this is a controversial conception. Why should our 
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analysis of parthood force upon us a rebuttal of the boundary concept? Why 
should we assume (C.5) in order to ensure a coherent implementation of a basic 
bridging principle like (C.4)? 

The second worry is more general. Consider an object and the stuff it is 
made of—for instance, a statue and the corresponding amount of clay. As 
we have seen, few would regard the relationship of material constitution that 
holds between such entities as a case of proper parthood. And there are many 
philosophers for whom constitution is not identity (improper parthood) either: 
see again the papers in Rea 1997. This is by no means a pacific thesis, but never 
mind. The point is simply that the relationship between the clay and the statue 
is not obviously an instance of parthood. Yet, on any plausible understanding 
of ‘C’, whatever is connected to the clay is bound to be connected to the statue, 
too, so (C.4) would immediately settle the issue: the clay is part of the statue. 
Indeed, since it is equally plausible to suppose that the same applies in the 
opposite direction—whatever is connected to the statue is connected to the 
clay—(C.4) implies that the statue and the clay are one and the same thing. 
And this is a substantive tenet which, as such, ought not to be built into the 
basic apparatus of mereotopology at the outset. 

This worry is perhaps best appreciated by noting that the following theorem 
is an immediate consequence of (C.1)-(C.4), provided P satisfies the basic 
mereological axioms (P.1)-(P.3): 


(73) Yz(Czxz > Czy) -xr=y 


In fact, (P.1) and (P.2) (reflexivity and transitivity) are derivable from (70), and 
RMT turns out to be equivalent to the theory defined by taking (73) as an axiom 
along with (C.1) and (C.2). (That is actually the customary axiomatization since 
Clarke 1981; see also Biacino and Gerla 1991.) Now, with parthood construed 
as enclosure, (73) is nothing but the antisymmetry principle (P.3). Yet (73) 
does not merely assert the antisymmetry of parthood; it says that connection is 
extensional—that different things cannot connect to the same things. And this 
is just as controversial as the thesis that parthood is extensional. 

It could be replied that the analogy with mereological extensionality is in 
fact helpful, since the original arguments in support of (P.5) (Sec. 1.3.2) could 
now be offered on behalf of (C.4). Indeed, if a statue and the clay are construed 
as things that, at some level of decomposition, share the same constituents— 
e.g., the same molecules—the question of whether constitution is identity just 
is the question of whether parthood is extensional. However, the worry does 
not only apply to cases of material constitution. Consider a shadow cast onto a 
wall. The shadow is not part of the wall, yet anything connected to the shadow 
is—arguably—connected to the wall. Or consider a stone inside a hole. The 
stone is not part of the hole, yet one could argue that anything connected to the 
stone is connected to the hole. Broadly speaking, the problem arises as soon as 
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we allow for the possibility that distinct entities occupy the same space (Casati 
and Varzi 1999). That this is a real possibility is by itself contentious and is 
one of the questions to be addressed by an explicit theory of location (Sec. 3). 
But precisely for this reason, ruling it out on mereotopological grounds would 
be utterly inappropriate. 

This last point is particularly worth stressing, for it shows once again how 
the choice of a suitable set of principles may depend crucially on whether we 
are interested in a theory aimed at modeling the domain of all spatial entities 
or just a domain of pure spatial regions. The worry mentioned above arises 
forcefully in the context of theories of the first sort. It does, however, lose its 
force in relation to the second sort of theory, since two regions cannot overlap 
spatially without overlapping mereologically. Now, it is a fact that most authors 
committed to (C.4) have been working on such a narrower understanding of 
their theory. Whitehead’s own account was explicit in this regard: in the theory 
put forward in Process and Reality (as in Clarke’s 1981 reformulation), the 
field of C was meant to consist exclusively of spatial regions. On the other 
hand, it is also a fact that a major motivation for developing a theory of this 
sort has been the assumption that connection thus understood is all that matters 
for practical purposes. For one can always treat the relation of connection as 
the ‘shadow’ (in De Laguna’s 1922, p. 450 apt terminology) of the relation of 
physical contact or overlap that may obtain between actual, concrete entities. In 
other words, such theories have typically been developed on the assumption that 
the following principle provides the necessary and sufficient link between the 
mereotopology of pure space and the mereotopology of spatial entities broadly 
understood: 


(74) xis connected to y if and only if the region occupied by x 
is connected to the region occupied by y. 


If so, however, then the problems mentioned above resurface even for theories 
of this sort. For (74) will deliver an acceptable account if, and only if, spatial 
co-location is regarded as metaphysically impossible. 


2.3.3 Self-connectedness. There are other ways of supplementing MT 
with bridging principles that go beyond (C.3). In particular, consider again 
Whitehead’s early attempts to characterize topological connection in terms of 
parthood, i.e. (53), with ‘a’ understood as ‘C’. As a definition, this was found 
defective. However, one may certainly consider adding the corresponding 
biconditional as an axiom—or at least adding one of the two conditionals: 





(C.6) Cay > 3z(Ozzx A Ozy A Yw(Pwz => Owx VOwy)) Left Join 
(C.7) 3z(Ozx A Ozy A\Vw(Pwz — Owx V Owy)) —> Cry Right Join 
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Would this be a good way of tightening the conceptual link between the mere- 
ological and the topological ingredients of MT’? 

Consider (C.6). As it turns out, its status depends significantly on the under- 
lying axioms for ‘P’. If our mereological theory is strong enough to warrant 
the existence of a fusion for any pair of connected entities, i.e., if it contains 
the relevant instance of (P.11,,) as an axiom or (as in GEM) as a theorem, then 
(C.6) itself is derivable as a theorem, since the fusion of x and y is sure to qual- 
ify as a z satisfying the consequent. If, however, our theory does not warrant 
all the relevant fusions, then (C.6) may still be regarded as a plausible addition 
to MT’. Perhaps the fusion of any two connected entities turns out to be too 
large or gerrymandered to be acceptable, but we may still think of connection 
as sufficient for the existence of smaller fusions encompassing those portions 
of x and y that are sufficiently close to their common boundary. For example, 
with reference to Fig. 15.11, left, suppose we are only willing to acknowledge 
the existence of entities that are composed of at most two disconnected parts. 
Then the fusion of x and y is out, but a fusion of x2 and y2, as well as fusions 
of x and y2 and of x2 and y, would fit the bill. All of this speaks in favor of 
(C.6), though it may be argued that the existential import of this principle goes 
beyond the task of establishing a necessary conceptual link between P and C. 

As for (C.7), the picture is different. Assuming this principle is virtually 
tantamount to excluding disconnected entities from the domain—not all of 
them, to be sure, but many of them. For example, if the underlying mereology 
is sufficiently weak, (C.7) is compatible with the existence of a disconnected 
composite such as cin Fig. 15.11, right: on the assumption that c has no further 
proper parts besides a1, a2, and a3 (and parts thereof), the antecedent is false 
so (C.7) is vacuously satisfied. But consider a bikini, or a printed inscription 
consisting of separate letter tokens. As we have already noted, one need not 
buy into unrestricted composition to appreciate the dignity of such things. Yet 
their existence would be banned by (C.7). If x and y are the two main parts of a 
bikini, then the consequent of (C.7) is false even though the antecedent is made 
true by the bikini as a whole. So, on the face of it, this direction of Whitehead’s 
biconditional is definitely too strong as a general bridging principle and there is 
no philosophically neutral reason to add it to MT. One can, however, consider 
weaker versions, to the effect that the consequent of the conditional must hold 
whenever the antecedent is made true by the right sort of entity: 





(C.74) 3z(ġz A Oza A Ozy AVw(Pwz — (Owr v Owy))) — Cary 


In particular, one can take ‘ø’ to express the property of being self-connected. 
After all, this was precisely the intended import of Whitehead’s flawed defini- 
tion. And we have seen that the flaw of the definition, in the if direction, was 
not conceptual but formal: it lied exclusively in the impossibility of express- 
ing the relevant restriction in mereological terms. By making the restriction 
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Figure 15.11. Left joining with partial fusions; right joining without connection. 


explicit, (C.74) overcomes the difficulty and suggests itself as a natural bridging 
principle. 

Surprisingly, it is not easy to express the property of self-connectedness 
even in the extended language of mereotopology. If the axioms on P are strong 
enough, we can follow the ordinary set-theoretic definition—something is self- 
connected if it doesn’t consist of disconnected parts: 


(75) SCa =a Vyz(Vw(Owa = (Owy V Owz)) — Cyz) — Self-Conn. 
In particular, in MT + GEM this becomes: 
(76) SCa e Vyz(z = y + z > Cyz) 


This is a common definition in the literature, both among theorists subscribing 
to the converse monotonicity principle (C.4) (see e.g. Clarke 1981, Randell et 
al. 1992) and among theorists rejecting it (Tiles 1981, Varzi 1994, Smith 1996). 
However, if the axioms on P do not secure the necessary composition patterns, 
the definition is inadequate. For example, the object c in Fig. 15.11, right, is 
anything but self-connected, yet it (vacuously) satisfies the definiens of (75) 
unless we assume the existence of at least one sum consisting of a, and ag, a2 
and a3, or a, and ag. In the finitary case, the difficulty could be met by relying 
on the notion of mediate connection: any two parts of a self-connected entity 
must be at least n-connected for some n. More generally: 


(77) PCa =at Vyz(Pyx A Pzx — TCyz) Path Connectedness 


This, however, involves quantification over numbers (see (62)), which just 
confirms the expressivity limits in question. Moreover, (77) does not work in 
the infinitary case: the unit interval on the real line is connected, but we cannot 
account for this fact in terms of the relationships between the reals themselves; 
reference to subintervals is necessary, specifically reference to a subinterval 
and its relative complement. So, overall it appears that the notion of self- 
connectedness can be adequately grasped, via (75), only by theories that are at 
least as strong as MT + (P.6), the complementation principle, though this is at 
present an open question. 
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Figure 15.12. An implausible model of MT + GEM. 


2.3.4 Fusions. We conclude this discussion of bridging principles by 
noting that even a theory as strong as MT +GEM + any of the above axioms is 
incapable of capturing all the relevant links between mereology and topology. 
In particular, such a theory is consistent with the following implausible thesis 
(Tsai 2005, p. 137): 





(18) 3z(Cz(x +y) ^=Czz A =Czy) 


A model is given in Fig. 15.12, where the curve line indicates the relevant 
connection relationship (besides the obvious ones imposed by (C.3)). 

Clearly, this is a sign that some additional bridging principle is on demand. 
The following option suggests itself: 





(C.8) z= “adr —> Yy(Cyz —> Jr(ġx A Cyz)) Fusion Connection 


Whether this is enough to establish a good correlation between the mereological 
structure of composite objects and their topological behavior is a question that 
can hardly be addressed in general terms. The plausibility of (C.8), however, 
seems obvious. Since we have found good reasons to also accept (C.4) and 
(C.7gc), the theory resulting by adding these three principles to MT + GEM 
suggests itself as the natural topological extension of GEM. (Recall that (C.6) 
is already provable in MT + GEM.) For future reference, we shall call this 
theory GE MT, for General Extensional Mereotopology. 


2.4 Extensions and restrictions 


As it turns out, GE MT does not officially appear in the literature, mostly 
due to the limited study of such principles as (C.7gc) and (C.8). (The closest 
relatives are the axiomatic systems advocated by Smith 1996 and Casati and 
Varzi 1999.11) A good thing about this theory, as also about the core frag- 
ment afforded by MT + GEM, is that it makes it possible to supplement the 





1n Casati and Varzi (1999), GEMT is identified with MT + GEM. Smith’s (1996) version is based 
on a primitive ‘IP’ for (possibly improper) interior parthood. See also Pianesi and Varzi (1996a, 1996b) for 
similar formulations based on the predicate ‘B’ and the operator ‘c’ defined below, respectively. 
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mereotopological predicates and operators discussed so far with a number of 
additional operators that mimic the standard operators of point-set topology. 
For example: 


(79) iw =gp UzVy(Czy — Oxy) interior 
(80) ex =api(~ x) exterior 
(81) Ca =a ~ (ex) closure 
(82) ba =a ~ (ix + ex) boundary 


Like the mereological operators in (40)—(44), these operators are partially de- 
fined in view of the lack of a null entity that is part of everything. For instance, 
if x is a boundary, then it has no interior, and if x is the universal entity U, it has 
no exterior. Even so, in GE MT all of these operators are rather well-behaved. 
In particular, we can get closer to standard topological theories by explicitly 
adding the mereologized analogues of the standard Kuratowski (1922) axioms 
for topological closure: 


(C.9) Pax(cx) Inclusion 
(C.10) c(ex) = cx Idempotence 
(C.11) c(a+y) = Cr+ cy Additivity 


(These axioms are to be read as holding whenever c is defined for its arguments. 
Here and below we omit the relevant existential conditions to improve read- 
ability.) Indeed, (C.9) and (C.11) turn out to be provable in MT + GEM; see 
Tsai (2005, p. 141). 

The possibility of supporting such developments is of course a good indi- 
cation of the strength of GEMT. Philosophically, however, this strength may 
be regarded with suspicion, and several complaints have been raised in the 
literature. 


2.4.1 The open/closed distinction. The main sort of complaint con- 
cerns the very notion of connection that the theory is meant to characterize. So 
far we have worked mostly with an intuitive notion in mind but obviously more 
can and must be said—and GE MT says a lot. In particular, the Kuratowski 
extension of GEMT (KGEMT for short) yields a full account of the intended 
interpretation of ‘C’: two things are (externally) connected if and only if they 
share (only) a boundary, i.e., if and only if the closure of one overlaps the other, 
or vice versa: 


(83) Cay = (Ox(cy) v O(cx)y) 
(84) ECay + (Cay A ~C(iz) (iy)) 


Now, this shows in what sense the behavior of C in this theory closely approx- 
imates that of standard set-theoretic topological connection; just let ‘x’ and ‘y’ 
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range over sets of points and interpret ‘O’ as set intersection. On the other 
hand, one aspect in which ordinary point-set topology appears to conflict with 
common sense—an aspect that has been emphasized by authors interested in a 
mereotopological characterization of qualitative spatial reasoning, such as Ran- 
dell et al. (1992) and Gotts et al. (1996)—s precisely the distinction between 
‘open’ and ‘closed’ entities on which it rests, and which GEMT preserves 
holus bolus: 


(85) OPx =g x = ix Open 
(86) CLr =g x = CT Closed 


This distinction goes at least as far back as Bolzano (1851, §66ff). But already 
Brentano (1906, p. 146) regarded it as ‘monstrous’, and we have already seen 
that the sort of idealization it embodies does not sit well with the way we 
ordinarily speak. We may intuitively grasp the difference between an open 
and a closed interval on the real line—the objection goes—and we may even 
understand how this difference applies to ideal three-dimensional manifolds 
such as Euclidean space. But what does it mean to draw a similar distinction 
in the realm of concrete spatial entities, where the very notion of a boundary 
is the result of a conceptual idealization? What does it mean to say that some 
objects are closed and some are not, and that contact is only possible between 
objects of one type and objects of the other? 

Besides, even if common sense and ordinary language were put aside, the 
open/closed distinction seems to yield genuine paradoxes as soon as we move 
from the realm of pure space to its worldly population: Consider (i) what 
happens when a solid body splits into two halves. Before the splitting the two 
halves were in contact, so we are to suppose that one was closed and the other 
open, at least in the relevant contact area. But then, after the splitting, only one 
of the two halves will have a complete boundary. This may not be ‘monstrous’, 
but it certainly seems implausible: the two halves—one should think—are 
perfectly indistinguishable. On the other hand, consider (ii) what happens 
when two bodies come into contact. We may imagine the same experiment 
performed twice. First we take an open cube and push it toward a closed cube 
until they touch. Then we do the same with two closed cubes. What reason 
can we offer to explain the fact that in the latter case the two cubes will not 
come into contact? As Zimmerman (1996a, p. 12) put it, what sort of ‘repulsive 
forces’ can be posited to explain such deferential behavior? 

There is no quarrel that these are pressing questions. (For more examples, 
see Kline and Matheson 1987.) Nonetheless, there are various things one can 
say in reply to such worries. Concerning (i), for example, one could say that the 
paradox is grounded on a questionable model of what happens when a process 
of ‘splitting’ takes place (Varzi 1997). Topologically, this is no bloodstained 
business. Dissecting a solid body does not ‘bring to light’ (Adams 1984, p. 400) 
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Figure 15.13. Wrong (left) and right (right) topological models of splitting. 


a surface that was trapped inside and that must by necessity belong to one of two 
severed halves. Rather, the topological model is one of gradual deformation. 
Think of a splitting oil drop. The drop grows longer and, as it grows, the middle 
part shrinks and gets thinner and thinner. Eventually the right and left portions 
come apart and we have two drops, each with its own complete boundary. 
Ditto with any splitting object. A long, continuous process suddenly results in 
an abrupt topological change: there was one surface, one closed body, and now 
there are two (Fig. 15.13). Of course, one can still raise a question about the 
last point of separation: Where does this one point belong—to the left half or 
to the right half? However, this is just a sign of the magic that surrounds any 
sort of topological change, as when you drill a hole through an object. The 
instantaneous event of a sphere turning into a torus is just as magical, and this 
magic has nothing to do with the open/closed opposition. It simply reflects the 
fact that topological change marks one point at which common sense reaches the 
limits of its theoretical competence, and a complete assessment would require a 
step beyond pure part-whole theorizing. It would require a step into the territory 
of qualitative kinematics, for example (as in Davis 1993), if not an account in 
terms of the microscopic analysis of matter. (What is—physically—the ‘last 
point of separation’ involved in the splitting?) 

This line of reply on behalf of kK GE MT applies to (ii) as well—the ‘merging’ 
puzzle (Casati and Varzi 1999). Surely the positing of repulsive forces to explain 
the peculiar behavior of the two closed cubes would be utterly and unagreeably 
ad hoc. But there are other possibilities. For instance, perhaps the two closed 
cubes will indeed come into contact. From the fact that two closed entities 


Spatial Reasoning and Ontology: Parts, Wholes, and Locations 993 


cannot be in contact it does not follow that they cannot come into contact, just 
as from the fact that two parts are connected it does not follow that they cannot 
be separated. Only, the coming into contact (just as the separation) determines a 
true topological catastrophe: there is a breaking through the relevant boundary 
parts and the two objects become one. (Think also of the two drops of oil 
merging into each other.) The two processes are dual: merging is the reverse 
of splitting. And both involve a seemingly magic moment that runs afoul of 
the confines of extensional mereotopology and calls for a thorough kinematic 
account. 

One could still press the objection here by noting that the puzzles admit 
of perfectly static variants, where the appeal to kinematics would be out of 
place. Consider the dilemma raised by Leonardo in his Notebooks: What is it 
that divides the atmosphere from the water? Is it air or is it water? Or consider 
Peirce’s puzzle: What color is the line of demarcation between a black spot and 
its white background? (1893, p. 98). More generally, given any object, x, does 
the boundary belong to x or to its complement? Does it inherit the properties— 
e.g., color properties—of x or of ~ x? There is no kinematic story to tell here. 
But how can one answer without selecting one candidate at random? 

Here one might reply that figure/ground considerations will help. Accord- 
ing to Jackendoff (1987, Appendix B), for example, normally a figure owns its 
boundary—the background is topologically open. This may well be the right 
thing to say vis-à-vis Peirce’s puzzle: the black spot is closed, so the line is 
black. But what is figure and what is ground when it comes Leonardo’s case? 
We do talk about the surface of the water, not of the air. But what goes on at 
the seashore? Three things meet—water, air, soil; how can figure/ground con- 
siderations help in such contexts? Perhaps such dilemmas are not real. Galton 
(2003, pp. 167ff), for example, argues that they arise as an artifact of the mod- 
eling process: surely properties like color or material constitution only apply 
to extended bodies, so it wouldn’t make sense to ask whether a boundary-like 
entity is air, water, or colored. There is, however, a less dismissive way to meet 
the challenge on behalf of KGEMT. For one may acknowledge that such 
dilemmas are real and yet insist on a friendly attitude towards the open/closed 
distinction. The actual ownership of a boundary—one might argue—is not an 
issue that a mereotopological theory must be able to settle. The theory only 
needs to explain what it means for two things to be connected. Which things are 
open and which are closed is a metephysical question that, plausibly enough, 
goes beyond the concerns of the theory. If the ocean is a closed body, then it 
can only touch the air if the latter is open. If it isn’t, then it can only touch the 
air if the latter is closed. And if both water and air are open, they cannot truly 
touch, though they may touch a closed piece of land. That’s all the theory says, 
and there is no reason to think that the theory is wrong just because it is difficult 
to classify actual things into open and closed. 
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One last problem is worth mentioning. Consider again the cutting of a solid 
object in half. We have said that this process does not bring to light a new 
surface. But, of course, we can conceptualize a new, potential surface right 
there where the cut would be. In fact, we can conceptualize as many boundaries 
as we like, even in the absence of any corresponding discontinuity or qualitative 
heterogeneity among the parts. Think of John’s waist, the equator, the Mason- 
Dixon line between Maryland and Pennsylvania. As Smith (1995, 2001) has 
pointed out, such ‘fiat’ boundaries are a major ingredient of our picture of the 
world. Even the surfaces of ordinary objects, as we have seen, may involve a 
certain degree of fiat owing to the microscopic scatterdness of the underlying 
stuff. Yet no fact of the matter can support the ownership of boundaries such 
as these by one side rather than the other, hence there is no point in deferring 
to a metaphysical theory of the extended entities at issue. Isn’t this enough to 
give rise to the demarcation puzzle? 

Once again the answer is in the negative. Fiat boundaries are not physical 
boundaries in potentia. They are not the boundaries that would envelop the 
interior parts to which they are associated in case those parts were actually cut 
off. To think so would take us back to the wrong topological model illustrated 
in Fig. 15.13, top. On a better model, fiat boundaries are just placeholders for 
genuine physical boundaries and the demarcation puzzle need not, therefore, 
lead to ontological anxiety. We can say that a boundary of this sort stands for 
two boundaries, one for each side. Or we can say that in drawing it we leave the 
question of its belongingness (hence the open/closed distinction) indeterminate. 
We do so because it is a question of no practical relevance. But precisely for 
this reason the indeterminacy is innocuous: it is pragmatic, perhaps semantic, 
not ontological—just like the sort of indeterminacy that afflicts the vagueness 
of parthood (Sec. 1.5). (It is in this sense that the diagram in Fig. 15.9 is partly 
indeterminate: in saying that x2 is externally connected to y, for example, we 
did not specify which of these two regions owns the boundary in the relevant 
contact area. Ditto for all other cases of external connection, as in Fig. 15.8, 
left.) 


2.4.2 Connection by coincidence. All of this, of course, is subject to 
controversy. If the foregoing remarks are found compelling, then the strength 
of KGEMT is vindicated and such theorems as (85) and (86) deliver a full and 
correct understanding of ‘C’. If not, however, then KGE MT will be deemed 
inadequate and the intended interpretation of ‘C’ remains unsettled. Are there 
any other options? We may distinguish two main alternatives, depending on 
whether or not a rejection of the open/closed distinction is taken to be compatible 
with a realist attitude towards the ontological status of boundaries. 

The realist option finds its best expression in those theories that attempt to 
provide a detailed reconstruction of the view Brentano put forward in reaction 
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to Bolzano’s ‘monstrous doctrine’, as in Chisholm (1984, 1993) and Smith 
(1997). According to this view, boundaries are genuine denizens of the world of 
spatial entities, but their lack of proper interior parts makes them peculiar in two 
important respects (Brentano 1976, part I). First, they can never exist except as 
belonging to entities of higher dimension. There are, in other words, no isolated 
points, lines, or surfaces, for boundaries are, in Chisholm’s terms, dependent 
entities. Second, and more to the point, insofar as boundaries are not possessed 
of divisible bulk, they do not occupy any space and can therefore share the same 
location with other boundaries. They can coincide, and the topological relation 
of external connection is to be explained, not via the open/closed opposition, 
but in terms of genuine boundary coincidence. Thus, we can speak of the 
surface of an object. But this single surface is to be recognized as being made 
up of two parts, two perfectly coinciding boundaries bounding the object and 
its complement, respectively. 

As is clear, a rigorous formulation of such theories is no straightforward 
business. For one thing, it is not immediately obvious how to formulate the 
dependence thesis, both because of the modal ramifications that a good theory of 
ontological dependence would require (see e.g. Correia 2005) and because the 
relevant notion of dimension is by itself hard to characterize mereotopologically 
(Chisholm 1984). Ignoring such complications, and assuming GEM, one can 
capture the gist of the thesis as follows (Smith 1996): 








(C.12) SCa A Bry > 3z(SCz A BPxz A -=AwBzw) Dependence 


where 
(87) Bay =a Px(by) Boundary 
(88) BPaxy =a Bry ^ Pry Boundary Part 


In other words, every self-connected boundary is part of some self-connected 
entity which it bounds and which is not itself a boundary. (The restriction to 
self-connected entities is to avoid that (C.12) be trivially satisfied by a scattered 
z containing x as an isolated proper part.) Without the full mereological support 
of GEM, however, things are significantly more complex, among other reasons 
because of the apparent elusiveness of the self-connectedness predicate ‘SC’ 
(Sec. 2.3.3.) 

Secondly, and more to the point, a lot depends on how exactly one under- 
stands the relation of spatial coincidence invoked by such theories to explain 
the phenomenon of (external) connection. Chisholm and Smith treat it as an 
undefined primitive, suitably axiomatized so as to guarantee that coinciding 
entities have coinciding parts. (See also Smith and Varzi 2000 for a similar 
treatment of the relation of coincidence between fiat boundaries.) Alterna- 
tively, one can embed the theory of coincidence into a theory of spatial location 
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broadly construed: to say that things coincide is to say that they literally share 
the same location. Clearly, the choice between these two options is not just 
a matter of taste. Treating coincidence as a primitive is in principle compat- 
ible with different metaphysical conceptions of the nature of space, whereas 
the second option is best understood within the framework of a substantivalist 
(Newtonian) conception, i.e., a conception according to which space is an entity 
in its own right. In any event, it is apparent that both options yield theories 
that are not strictly mereotopological, since a third primitive—coincidence or 
location, respectively—needs to be brought into the picture to provide a full 
account of the connection relation. We shall not go into the details of the first 
option here, but we shall have more to say about the second option in Sec. 3. 
For the moment, let us just observe that construing coincidence explicitly in 
terms of spatial co-location amounts to a partial reduction of topology to mere- 
ology: connection between entities of a kind (space occupiers) is reduced to 
overlap between entities of a different kind (their spatial receptacles), as per the 
following principle: 


(89) xis connected to y if and only if the region occupied by x 
overlaps the region occupied by y. 


This is by itself interesting, though we are obviously left with the task of provid- 
ing an account of the topology of space as such. And if the account is to match 
the strength of a theory such as KGEMT, then the open/closed distinction 
will at least be partially preserved. It will be obliterated from the Brentanian 
realm of space occupiers, but space itself would be Bolzanian. (Compare the 
initial worry: we can grasp how the distinction applies to ideal manifolds such 
as the real line or Euclidean space; it is when it comes to the realm of ordinary 
objects that their classification into ‘open’ and ‘closed’ is problematic.) 


2.4.3 Omitting boundaries. The alternative route is to avoid the 
puzzles raised by the open/closed distinction by dismissing boundary talk alto- 
gether. This is the anti-realist option. 

Philosophically, this route is often motivated on its own grounds, for instance 
because of the dubious ontological status of boundaries vis-à-vis the micro- 
scopic analysis of the physical world (Stroll 1988), or because of their suspect 
nature qua lower-dimensional entities (Zimmerman 1996b). In the context of 
formal theories, however, the main motivation for doing away with boundaries 
is precisely the rejection of the open/closed distinction vis-a-vis common sense. 
To use an example from Gotts et al. (1996, p. 57), Fig. 15.14 depicts a disc with 
and without its boundary, and with just part of its boundary. Of course, the de- 
piction has to show the boundary as having some finite thickness, which strictly 
speaking it does not possess. But this is the very point that appears counter to 
common sense: all three discs, if superimposed, would cover exactly the same 
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Figure 15.14. Open, closed, and semi-closed discs with the same area. 


area; yet the second disc includes unextended parts that the others do not, 
while the third includes some that the first does not. Such discriminations—it 
is argued—are not warranted. 

There are radical as well as moderate variants of this view. The radical 
variants are represented by those theories that follow Whitehead (1929) in doing 
away with all boundaries. This amounts to assuming the boundarylessness 
axiom (C.5) in its full strength: everything has interior proper parts. The 
moderate variants, by contrast, only assume some weaker version of the axiom 
in which the variable is suitably restricted so as to range over entities of a certain 
sort: 





(C.54) ox — AzIPPyx Restricted Boundarylessness 


In particular, relative to our present concerns it is natural to construe ‘@’ as a 
distinguished property of all concrete spatial bodies. This would allow for the 
possibility that space as such include points and other boundary-like elements, 
which means that the open/closed distinction would be partially preserved. But 
as we have just seen, restricting the distinction in this way may be enough to 
bypass the intuitive puzzles that it raises, so this may well be a good compro- 
mise. For example, Cartwright (1975) holds that concrete spatial bodies are 
the material content of (regular) open regions of space, connection relations 
between the former being explained in terms of overlap relations between the 
closures of the latter: 


(90) xis connected to y if and only if the closure of the region 
occupied by x overlaps the closure of the region occupied by y. 


As a mereotopological theory, this is of course another hybrid—just as the 
theory behind (89)—for it requires an explicit treatment of locative relations. 
Still, there is no question that (90) allows for a systematic boundary-free account 
of the mereotopology of concrete spatial bodies. (The real challenge, rather, 
is to justify the claim that only some regions are receptacles, e.g., only open 
regular regions; see Hudson 2002). 

Let us focus on the radical variants. We have seen that positing (C.5) is a 
necessary move for any reductive mereotopology based on the converse mono- 
tonicity axiom (C.4), and it is a fact that most theories that accept one axiom 


998 HANDBOOK OF SPATIAL LOGICS 


accept the other as well. But let us put that aside for a moment and let us just 
focus on (C.5). Where X is any theory including MT, let BX be the corre- 
sponding boundaryless extension obtained by adding this axiom. What sort of 
mereotopology do we get? 

As it turns out, the number of options is significantly constrained, both mere- 
ologically and topologically. For example, surely X cannot be atomistic, since 
(C.5) implies the atomlessness axiom (P.7). So any model of BX is perforce 
infinitary. And surely the interaction between compositional and decomposi- 
tional principles will have to be carefully re-examined. In particular, it is easy 
to verify that in BMT the weak supplementation principle (P.4) is incompatible 
with the unrestricted fusion axiom (P.13) and, more generally, with any version 
of the strong fusion axiom (P.13¢) in which the condition ‘&’ is satisfied by all 
interior and tangential proper parts of any given thing. For suppose we allow 
for such fusions. Then every entity would have an interior as well as a closure, 
and the following would hold: 


(91) PP(ix)(czx). 


By (P.4), this would imply 





(92) Az(Pz(cx) A =Oz(iz)), 


which in turn would imply 





(93) AzPz(bzx), 


contradicting (C.5). Thus, BMT + (P4) + (P13) is inconsistent, as is any 
theory BX including (P.4) along with (P.13¢) with ‘€’ as indicated. This is 
not surprising, since the whole point of going boundary-free is, in the present 
context, to avoid the open/closed distinction, hence the distinction between 
interiors and closures reflected in (91). However, this means that (C.5) prevents 
the formulation of any reasonably strong theory unless we are willing to give up 
weak supplementation, and this may certainly be regarded as a major drawback 
of the approach. 

In fact, one may consider both options here. One may (i) regard the compo- 
sitional weakness of the theory as a necessary price to pay to preserve mereo- 
logical supplementation and avoid the topological conundrums surrounding the 
open/closed distinction. But one may also (ii) go for a stronger theory with gen- 
eralized or even unrestricted fusions, dismissing the conundrums precisely by 
forgoing supplementation. After all, it could be argued that the open/closed dis- 
tinction is problematic only insofar as it is cashed out in terms of a discrimination 
between entities that do and entities that do not possess their boundaries, and in 
the absence of boundaries the discrimination dissolves. In the literature the first 
option is more widespread, its closest representative being the greatly influential 
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Region Connection Calculus (RCC) originated with Randell et al. (1992)— 
a reductive extension of BMT+ (P.5) with binary sums and complements. 
But the second option, which is closer to Whitehead’s original approach, is also 
well represented, as evidenced by Clarke (1981), Biacino and Gerla (1991), and 
Asher and Vieu (1995) inter alia (the first two admitting unrestricted fusion, the 
latter admitting fusions of interior parts). Indeed, we have seen in Sec. 1.4.2 that 
the idea of restricting mereological fusion in order to avoid undesired entities is 
by itself suspicious. If we agree with the thought that a fusion is nothing over 
and above the things that compose it, then the intuitive problems raised by the 
open/closed distinction are hardly solved by eschewing formal commitment to 
such things as interiors and closure, i.e., fusions of interior and of tangential 
parts. For such parts are all there already (and, of course, one needs IPP and 
TPP to be distinct in order to state (C.5) in the first place). In this sense, option 
(ii) might be regarded as preferable on philosophical grounds, though the failure 
of supplementation would remain a hindrance. 

Unfortunately, all of these theories include the monotonicity axiom as well as 
its converse, (C.4), i.e., they are all of the reductive sort, which makes it difficult 
to assess their relative pros and cons vis-à-vis the two options in question. In 
fact, not only do such theories include (C.4) and (C.5); they also rest on a sui 
generis characterization of the fusion operator in which C takes over the role 
of O, which makes it difficult to compare them to KGE MT. For example, in 
Clarke’s theory, which goes as far as to include analogues of the Kuratowski 
axioms, the unrestricted fusion principle does not equal (P.13) but, rather, the 
following schema: 











(C.13) dwé@w — AzVw(Cwz — Ju(dv A Cwv)) Topological Fusion 


This leads to a correspondingly sui generis fusion operator: 





(94) M* aga =q 12z2Vw(Cwz = du(dv A Cwv)) fusion* 


(Recall that RMT treats C as extensional: see (73).) And it is easily checked 
that &* does not coincide with the operator X defined in (39): if parthood 
reduces to enclosure, then the interior of a closed entity x qualifies as the fusion 
of x’s proper parts, but not as their fusion*. (This is because in the absence 
of boundaries the interior is sure to overlap any y that x overlaps, and vice 
versa, though it will to be disconnected from any z to which x is externally 
connected; see Fig. 15.15). This amendment is plausible enough. But it means 
that the mereotopological operators defined in (40)—(44) and in (79)-(82) must 
be revised accordingly, and at the moment there is no systematic comparison 
between the behavior of such revised operators in a boundary-free theory and 
the behavior of the original operators in a boundary-tolerant theory. Just to give 
an example, note that re-defining ‘~’ in terms of &*: 
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Figure 15.15. The interior of a closed disc x is a fusion, but not a fusion”, of its parts. 


(95)  ~* £ =¢p X*zDzz complement* 


implies that nothing is connected to its own complement and, therefore, that 
the universe is bound to be disconnected. Of course we can rely on a different 
notion of complement (as suggested in Randell et al. 1992, p. 168), or one can 
change the definition of self-connectedness in such a way as to avoid at least 
the latter consequence (as suggested in Clarke 1985, p. 69). But this is playing 
with definitions. The ‘old’ notions, when revisited in terms of (94), continue 
to make good sense no matter how we change the official nomenclature, so we 
can hardly leave it at that. 

Perhaps the best way to assess the strengths and weaknesses of these theories 
is to note that the departure from the ordinary conception of fusion affects the 
very distinction between open and closed entities. Consider the following 
variants of (79) and (85): 


(96) ix =qp U*2zVy(Czy — Oxy) interior* 
(7). “OP 2 pee Open* 


It is easily checked that any theory at least as strong as BRMT + (C.13) has 
the following theorem: 


(98) OP*x — =ECzy. 


Thus, open* entities never touch anything. It is only closed* and semi-closed* 
entities (defined similarly), that can touch something without sharing any parts. !? 
And if the open/closed distinction is replaced by the open*/closed* distinction, 
then the intuitive import of the relevant misgivings is up for grabs, and the 
choice between a conservative attitude towards mereological supplementation 
(option (i)) and a liberal attitude towards mereological composition (option (ii)) 
calls for independent thinking. We are no longer dealing with a partitioning 
of the domain into entities that do and entities that do not possess a boundary; 





12 Actually, Clarke’s definition of the closure operator does not exactly parallel (81). In our notation it reads 
as follows: 


c*x =qaf U*z7Cz(i* x). 


However, this peculiarity does not affect the main point made in the text. 
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we are dealing with a partitioning into entities that do and entities that do not 
connect externally. 

Be that as it may, all of this suggests that a thorough comparison between 
these two strategies for construing boundaryless mereotopologies is a challeng- 
ing task (Cohn and Varzi 2003). What is clear is that the strategies are mutually 
incompatible in spite of their common motivation and this, in all fairness to 
KGEMT, is disturbing. No matter how one feels about subtracting or adding 
elements to the domain, there is something puzzling in the thought that a topo- 
logical ‘monstrosity’ should by cured through mereological surgery. Indeed, 
philosophically this puzzling feature is especially striking when it comes to 
explaining the intended interpretation of these theories. As it turns out, both 
can be modeled on a domain with a standard point-set topology, interpreting 
‘C’ as in (99) for type-(i) theories, and as in (100) for type-(ii) theories: 


(99) xis connected to y if and only if the closure of x and the 
closure of y have a point in common. 

(100) x is connected to y if and only if x and y have a point in 
common. 


(See e.g. Randell et al. 1992, p. 167, Gotts 1996b and Pratt and Schoop 2000 
vs. Clarke 1981, p. 205, Biacino and Gerla 1991, and Asher and Vieu 1995, 
respectively). There is, of course, nothing wrong with this sort of models 
when it comes to proving the consistency or even the completeness of such 
theories. And there would be nothing wrong with (99) and (100) as genuine 
models if we were dealing with boundaryless theories of the moderate sort, as 
seen above. It is disturbing, however, that one can hardly do any better when 
it is comes to theories that are meant to be radically eliminativist—when it 
comes to explaining how contact relations may obtain in a world that is truly 
lacking the topological glue provided by points, lines, and surfaces even in the 
realm of pure space. (Whether one can do better is an open question. For 
example, with reference to type-(i) theories, Bennett 1996a suggests that RCC 
can be interpreted by encoding it into the bimodal propositional modal logic 
S4,, though the encoding is imperfect, as shown in Aiello 2000, and its natural 
canonical model is itself topological, as evidenced in Renz 1998 and Nutt 1999. 
Likewise, Stell and Worboys 1997, Stell 2000, and Diintsch et al. 2001 provide 
algebraic interpretations of RCC that dispense with any reference to point- 
based topologies, but the ontological transparency of such interpretations is 
itself a delicate matter.) 


2.5 Expressivity and ontology 


Let us conclude this philosophical excursus on topology with some general 
considerations concerning the delicate interplay between the expressive power 
of a theory and its ontological presuppositions. Regardless of whether we rely 
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on the full strength of kKGEMT or on theories of weaker import, we have 
seen that the move from mereology to mereotopology represents an important 
step towards the formulation of an adequate model of our spatial competence. 
The mereological distinction between a whole and its parts is crucial, but so is 
the distinction between interior and tangential proper parts, or the distinction 
between a connected whole and a scattered one, and such distinctions are intrin- 
sically topological. This is not to say that mereotopological concepts exhaust 
the picture; geometric and morphological considerations also play a significant 
role when it comes to practical matters. But there is no question that a great 
deal of our spatial competence is grounded on our capacity to ‘parse’ the world 
in terms of parthood and connection relationships. The interesting question, 
rather, is whether such relationships can be fully captured by the behavior of 
the binary predicates ‘P?’ and ‘C’ when characterized by means of formal prin- 
ciples of the sort that we have been discussing up to now, and to what extent the 
answer depends on one’s specific views when it comes to matters of ontology. 
Here are some indicative examples. 


2.5.1 Modes of connection. So far we have followed the familiar 
course of explaining connection in general terms, i.e., irrespective of the size 
(dimension) of the relevant contact area. In introducing that notion, however, 
we have mentioned the possibility of distinguishing connection ties of different 
strength—e.g., ties involving a single point of contact (as between Colorado 
and Arizona), an extended portion of a common boundary (France and Ger- 
many), or an entire boundary (Vatican and Italy). Even without bringing in 
boundaries, one may want to draw such distinctions to fully grasp, for example, 
the difference between a whole consisting of two spheres that barely touch from 
the whole consisting of two halves of a single sphere: both wholes are self- 
connected, but the second is surely more firmly connected than the first. And 
these distinctions have ramifications. For example, since tangential parthood 
is defined in terms of external connection, we may want to distinguish those 
proper parts that barely touch the exterior from those that firmly touch it—and 
so on. Moreover, the number of distinctions grows with the dimensionality of 
the entities we consider. Fig. 15.16 illustrates the four main patterns of external 
connection (no overlap) that can be distinguished in 2D space. But in 3D we 
might want to further distinguish, for example, two cubes barely touching at 
a vertex, two cubes barely touching along an edge, two cubes touching along 
a face, and so on. Now, can all such distinctions be expressed in terms of the 
mereotopological primitives ‘P’ and ‘C’ (or just ‘C’, if one goes reductive)? 
As it turns out, within a sufficiently rich mereotopological theory such as 
KGEMT the answer is in the affirmative. To illustrate, with reference to 
Fig. 15.16 the difference between the first two cases can be explained as follows. 
In both cases, x and y are connected; but whereas in the first case this simply 
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Figure 15.16. Four patterns of external connection, from weakest to strongest. 


means that one can go from x to y without ever going through the exterior of 
the connected sum x + y, in the second case it is also possible to go from z to 
y without ever leaving the interior of x + y. More precisely, let us distinguish 
between the connectedness of a whole (‘SC’) and the connectedness of its 
interior, and let us say that an object is firmly self-connected just in case the 
latter condition holds: 


(101) FSCa =gp SCa ^ SCix Firm Self-Connectedness 


Then we can say that two entities are firmly connected when they have parts 
that add up to a firmly self-connected sum: 





(102) FCay =a dwz(Pwx \Pzy AFSC(w+z)) Firm Connection 


This captures the difference between the second case of Fig. 15.16, where 
the relevant connection relationship is firm, and the first, where it isn’t. The 
stronger connection patterns corresponding to the third and fourth cases can 
then be defined by reference to the complement of x + y: 


(103) CCay =a FCry \ ~FCx(~ (x + y)) Complete Connection 
(104) PCay =a FCxy A ~Ca(~ (x£ + y)) Perfect Connection 


(Strictly speaking, (103) and (104) would call for refinements, owing to the 
possibility that x and y have internal holes; see Cohn and Varzi 2003 for a 
more general picture.) On this basis, the generalization to spaces of higher 
dimensionality is not difficult. For instance, in 3D space the difference between 
two entities touching at a point and two entities touching along an edge can 
be described by further distinguishing two patterns of non-firm connection, 
depending on whether the common boundary is atomic or a self-connected 
composite (a line segment, or curve). 

Now, all of this is easy in KGEMT. When it comes to weaker theo- 
ries, however, things get more difficult. For the definitions above involve the 
self-connectedness predicate along with the interior, complement, and fusion 
operators, all of which may be absent in a theory deprived of the necessary com- 
positional strength. For instance, in a boundaryless theory with no open/closed 
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(or open*/closed*) distinction, we can supply for the lack of the interior operator 
by redefining firm self-connectedness as follows (Bennett 1996b, p. 345): 


(101) FSCa =¢ Vy(IPPyx — dz(IPPzx A Pyz ^A SCz)) 





This makes it possible to go ahead with definitions (102)-(104) and capture the 
relevant distinctions in the 2D case. (See also Borgo et al. 1996 for a boundary- 
less theory in which FC is treated as primitive.) Yet it is not clear how one can 
capture the further distinctions available in 3D and higher-dimensional spaces 
without appealing explicitly to the dimensionality of the relevant boundaries 
or to special assumptions concerning the structure of space (see Gotts 1994a). 
And of course things get worse in a theory that lacks the complementation prin- 
ciple (P.6), for in that case, as we have seen, the notion of self-connectedness 
is already problematic. In short, the expressive power of a theory depends cru- 
cially on the underlying ontology, which in turn is reflected in the strength of 
the relevant compositional and decompositional principles. 

Similar considerations apply to further conceptual distinctions that may 
be deemed relevant in the context of spatial reasoning. Consider again the 
common-sense notion of contact exemplified by such statements as (55): the 
table is touching the wall. We have said that this notion is not topological but 
metric. This is actually true in KGEMT, assuming that both entities—table 
and wall—are treated alike, i.e., as both closed or both open. For KGEMT 
has the following two theorems: 


(105) ECay — (CLr — =CLy) 
(106) ECay — (OPx — =OPy) 


By contrast, in a boundaryless theory the picture is different. For example, 
insofar as RC'C admits of models satisfying (99), it can treat the table and the 
wall as genuinely connected as long as their closures overlap. This may be 
at odds with physics, but it captures the common-sense intuition. (Ditto for 
a moderate variant such as Cartwright’s—see again (90).) So which of these 
accounts is better depends on the seriousness with which we handle the spatial 
ontology of common sense. On the other hand, none of this should be taken 
to imply that KGEMT lacks the resources to account for the loose notion 
of connection countenanced by common sense. Suppose we understand this 
notion in the following sense: the table is touching the wall insofar as nothing 
can be squeezed between them. The metric flavor of this notion lies in its 
modal ingredient: to say that nothing can be squeezed between two objects 
is to say that they are ‘vanishingly close’ to each other—that their relative 
distance is arbitrarily small. We can, however, define a predicate that captures 
this ingredient in mereotopological terms: we can say that two objects are 
at least loosely connected in the relevant sense when one is connected to the 
closure of every open neighborhood of the other: 
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x y. 


Figure 15.17. An interior tangential part? 


(107) LCry =a Vz(OPz A Pyz — Cx(cz)) Loose Connection 


(See Asher and Vieu 1995 for a similar definition.) This relation captures 
the intuition that nothing can lie between two entities that touch, even when 
those entities are closed. And surely enough the definition is consistent with 
KGEMT. So it is not that KGEMT lacks the conceptual resources to do 
justice to common sense. It is, rather, that the relation of loose connection is 
bound to be empty in those models of KGEMT where the open entities form 
a dense ordering, i.e., where the following holds: 





(C.14) OPx A OPy A PPay — 4z(OPz A PPyz A PPzy) Open Density 


And whether all models should satisfy this axiom is, on the face of it, a question 
about the ontological make-up of the world. (One can argue that as long as the 
open/closed distinction holds, common sense only requires a denial of (C.14) 
when the variables are restricted to the range of ordinary entities, as opposed 
to their spatial receptacles. For a full account of the mereotopology of dis- 
crete space, see Galton 1999 and 2000, §2.6, and the generalizations in Li and 
Ying 2004.) 


2.5.2 Dimensionality. Consider a second example. We have seen 
that in a boundaryless theory everything must have interior proper parts, as 
per (C.5). However, the notion of an interior part is itself, in a way, a relative 
one, depending on the dimensionality of the space we are considering (Galton 
2004). In 1D space, the middle portion of a line segment y, or any portion 
x that does not extend to y’s extremities, would qualify as an interior part of 
y, so y itself would satisfy the axiom. As soon as y is embedded in 2D (or 
higher) space, however, all of its parts would be tangential, as they can all be 
connected to things to which y itself would be externally connected—e.g. a disc 
z (Fig. 15.17). Thus, in such higher spaces y would not satisfy (C.5). 

Now, this is not by itself a disturbing fact if we take boundaryless theories 
to reflect a general intuition to the effect that all entities in the domain are of 
equal dimensionality. On the other hand, this very intuition is philosophically 
problematic. It is puzzling that whether something exists—a line segment, 
for instance—should depend on the dimension of space, for one may want to 
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declare one’s ontological commitments while remaining neutral with respect 
to the difficult question of the dimensionality of space. Indeed, this becomes 
a necessity if one does not have the resources to address such a question in 
the first place. In ordinary point-set topology, one can say that a domain is of 
dimension < n if and only if every open cover O,...,O, can be refined to a 
closed cover C4, . . . , Ck such that every point occurs in at most n +1 of the Cis. 
This characterization is not available in mereotopology unless one goes second- 
order, and surely it cannot be mimicked in a boundaryless theory that eschews 
the open/closed distinction. Gotts (1994b) and Bennett (1996b) suggest a way 
to bypass such difficulties by means of a different characterization, which only 
requires P (or rather: E) to be closed under the operations of complementation 
and binary sum, but the adequacy of such proposals is an open question. (By 
contrast, Galton 1996 shows that an adequate characterization is available in a 
boundary-based theory such as kK GEMT, or weaker variants thereof; see also 
the layered mereotopologies of Donnelly and Smith 2003 and Donnelly 2004). 
So, again, we see here how the expressiveness of the theory depends crucially 
on the ontology it countenances, which in turn may be hard to specify within 
the theory itself. 


2.5.3 Counting the holes. Finally, consider an example that lies some- 
where between the previous ones—the notion of a simply connected whole, i.e., 
intuitively, a whole with no holes. Topologically, as also from the perspective of 
common sense, this notion is just as significant as the notion of a self-connected 
whole. Just as there is a big difference between an apple and a bikini, there 
is a big difference between an apple and a donut. Indeed, topology is often 
defined, intuitively, as a sort of rubber-sheet geometry that focuses precisely 
on these two differences, ignoring shape, size, and all sorts of other spatial 
properties that concern geometry broadly understood. Now, we have seen that 
self-connectedness is easily defined in mereotopological terms, though the ad- 
equacy of the definition presupposes the complementation principle (P.6) and 
is not, therefore, entirely neutral from an ontological standpoint. What about 
simple connectedness? More generally, the definition of self-connectedness 
can be extended so as to classify every object in terms of the (maximum) num- 
ber of self-connected parts of which it consists—what is sometime called its 
“separation number’. This can be done by defining a corresponding sequence 
of predicates, one for each positive integer, as follows: !3 


(108) SNiız =ap SCa Separation; 
(109) SNp4i% =a Syz(x = y + z A aCyz A SCy A SN, z) Sep.n41 








'3Here ‘n + 1’ indicates arithmetical addition, as opposed to mereological summation. 
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Figure 15.18. Simple connectedness and genus classification in 2D. 


Can we also provide a mereotopological characterization of the genus of an 
object, so as to classify it in terms of the number of holes it has? As it turns out, 
the answer to this question is in the affirmative, again provided that we assume 
the complementation principle (P.6), but this affirmative answer has interesting 
ontological ramifications. 

Here is how the basic account goes (Gotts 1994a). Let us say that something 
has dissectivity n (n a positive integer) just in case it is self-connected and 
can be decomposed into n + 2 self-connected, disjoint parts with the following 
property: two of them, y and z, are not connected, whereas all the others are 
connected to both of them but disconnected from one another. Formally, this 
amounts to the requirement that there be two disconnected parts y and z that 
are connected by a remainder whose separation number is n (‘connected by’ in 
the sense of (59)): 





(110) DS,x =a SC A Jyzw(x — (y + z) = w 
A BCyzw A ~Cyz A SNnw) Dissectivity », 


Then we can say that something is simply connected just in case its maximum 
dissectivity equals 1: 


(111) SSCa =gp DSi A ~DS2a Simple Connectedness 


With reference to Fig. 15.18, for example, only the left pattern is simply con- 
nected, for the others have higher dissectivity numbers. This is how it should 
be, and it can be checked that the definition would yield the right classification 
also with reference to objects of different dimensionality. A donut has dissec- 
tivity 2, so it is not simply connected; a solid ball is. Indeed, we can now use 
(110) also to provide a mereotopological characterization of the genus of an 
object. Something is of genus n, i.e., has n holes (n > 0), if and only if its 
maximum dissectivity is n + 1: 


(112) Gne =a DS,41% A ~DSp+42% Genus,, 


Again, Fig. 15.18 illustrates the definition in the 2D case, but it can be checked 
that (112) yields the correct classification also for objects of higher dimension- 
ality: a solid ball has genus 0, a donut has genus 1, a pretzel has genus 2, and 
so on. 
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Figure 15.19. A donut, a punctured block, and other deviant solids of the same genus. 


So this is the basic account, which fully answers the questions above: pro- 
vided we work with a mereotopology that is closed under complementation and 
binary sums, we can define simple connectedness and, more generally, classify 
any object in terms of the number of holes it has. There are, however, two 
additional questions one may ask at this point. The first is whether this basic 
account can be refined so as to do justice to further distinctions that could be 
drawn in view of the dimensionality issues discussed above. We may, for ex- 
ample, want to tell a genuine donut from the deviant cases in Fig. 15.19, all 
of which have the same genus. And here, as one might expect, the answer 
depends more heavily on the strength of the theory. In KGEMT we can go 
quite far; in weaker theories we may not, as we may not be able to distinguish 
the various kinds of non-firm connection that are needed to operate the relevant 
discriminations. For example, Gotts (1994b, 1996a) has shown that the issue 
can eventually be settled within the boundaryless framework of RCC, but this 
result rests on various assumptions on the topology and dimensionality of the 
entities in the domain that cannot themselves be expressed in the language of the 
theory. More importantly, it rests on the interpretation of ‘C’ given in (99): two 
bodies are connected if and only if their closures have a point in common. We 
have seen that such an interpretation is of dubious legitimacy in a boundaryless 
ontology (as Gotts himself laments in 1996b). So, once again, we reach a point 
where the expressive power of a theory depends crucially on the ontological 
commitments that one is willing to make. 

The second question is whether the basic account can be refined so as to 
do justice to further distinctions that could be drawn in view of the various 
ways—and there are many—in which an object can be perforated. And here it 
appears that even a strong, boundary-based theory suchas KGEMT may show 
its limits. In fact, there is a sense in which the limits in question are not just 
the limits of the mereotopological approach of which the theory is expression; 
they are the limits of topology as a general theory of space. Let us focus on 
the 3D case. 

For one thing, we have been speaking of ‘holes’ in the sense of perforations, 
but we may also want to classify a self-connected object in terms of the number 
of its internal ‘cavities’. To some extent this is easy: on the assumption that the 
universe U/ is self-connected, it is sufficient to identify the number of internal 
cavities with n—1, where n is the separation number of the object’s complement. 
In 2D space, this coincides with the genus of the object—there is no difference 
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Figure 15.20. Cavities, donut-cavities, and donuts with donut-cavities. 


between a 2D perforation and a 2D cavity. In 3D the numbers may diverge: a 
solid donut has genus 1 but 0 cavities, since its complement is self-connected. 
Dropping the assumption on U, we can express this as follows: 


(113) ICnx =gp SCax A Vy(SCy A IPPay => SN,+1(y — 2)) Cavity, 


This definition works for every object in any dimension (except, of course, for 
U). But this is just the beginning. A cavity may come in different forms. It 
may be a solid cavity, so to say, but it may itself be donut-shaped. It may also 
have the shape of an irregular donut of the sort illustrated in Fig. 15.19. Or 
it may be ‘knotted’ in various fashions—as a trefoil knot, for instance, or a 
granny knot. Clearly such distinctions are not covered by (113). Moreover, 
consider an object with two donut-shaped cavities. The cavities may lie next 
to each other, so to say, or they may be interlocked like the rings of a necklace. 
Or consider an object with a perforation—a donut—which also has an internal, 
donut-shaped cavity. The perforation may go through the ‘hole’ in the cavity or 
it may lie next to it. All of these and many others are distinctions that are easily 
described in words just as they can easily be depicted (Fig. 15.20), and reflect 
significant differences in the spatial structure of the objects in question. It is far 
from clear, however, whether one can capture them in mereotopological terms. 

Secondly, a perforation may come in different forms, too. It can be straight or 
it can be knotted, and the knot may or may not wrap around another perforation, 
just as it may or may not go through the ‘hole’ of an internal, donut-shaped cav- 
ity. It can also branch in the middle, so as to have more than two openings. 
Indeed, it can branch in many different fashions, as it can ‘merge’ in various 
ways with internal cavities of various kinds. Again, all of these possibilities re- 
flect significant distinctions that are easily described in words and can easily be 
depicted, but it is far from clear whether one can give a proper characterization 
in mereotopological terms—even with the full strength of KGEMT. In some 
cases it is not even clear to what extent such a characterization should just paral- 
lel the standard topological account of such patterns. Standardly, for example, a 
block with two parallel, straight perforations is equivalent to a block with a sin- 
gle, Y-shaped perforation—both have genus 2 (and can be transformed into each 
other by mere elastic deformation). This much can be said in mereotopological 
terms, using (112) above. But here is where standard topological considerations 
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Figure 15.21. Same genus, different holes. 
































might be regarded as inadequate for a good description of the spatial structure of 
ordinary objects, and of the intuitions underlying our spatial reasoning broadly 
understood. The topological equivalence between such patterns—and between 
such patterns and many others; see Fig. 15.21—appears to deliver a partial ac- 
count of the relevant spatial structures, for the genus of an object only captures 
the intrinsic topology of the object, not the way it relates to the environment. 
To get a better picture it seems necessary to keep an eye on the holes, not just 
on the object. And this is obvious from the fact that in describing such patterns 
we tend to do so by describing the mereotopology of the holes and the way 
they relate to each other; we do not describe the objects themselves. We tend 
to treat holes as objects in their own right, as ‘negative objects’ about which 
we can say exactly the same sorts of thing we say about ordinary, ‘positive’ 
objects. And we count both sorts of objects in the same way: we count two 
straight perforations and one Y-shaped perforation. 

If this is correct, then there are two things one can say. One can say (and 
accept) that the limits of mereotopology vis-a-vis such fine-grained distinctions 
are just the limits of topology, mutatis mutandis. Or one can say that the limits 
in question reflect precise ontological assumptions concerning the domain of 
application of the theory, specifically a dismissive attitude towards the ontolog- 
ical status of holes. This is not to say that holes are left out of the picture. Surely 
any theory with unrestricted fusions has room for such things, for mereologi- 
cally speaking a hole is nothing but part of the object’s complement. Rather, 
the point is that mereotopology by itself says nothing specific about which parts 
of the complement qualify as holes. The boundaries of a hole simply cannot be 
determined by purely mereological or topological considerations. Of course, 
we have seen that mereotopology says nothing about the boundaries of material 
objects either. But draw such boundaries as you like, chose the objects you like, 
unless you also draw the boundaries of their holes (if any) you cannot get a full 
picture of the mereotopological structure of the objects themselves. And to 
draw the boundaries of something is to confer ontological dignity to it. 

In Casati and Varzi (1994, 1999) it is argued that this alternative way of con- 
struing the limits of mereotopology has far reaching consequences. Suppose we 
take holes seriously: a hole in an object is something with well-defined bound- 
aries. Then the fine-grained distinctions mentioned above can be recovered 
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Figure 15.22. The objects of Fig. 15.21 have topologically different internal skins. 


by looking at the mereotopological interplay between matter and void, at the 
properties of the boundary where an object comes into contact with its holes. 
More precisely, let the interface between two entities x and y be the product of 
their boundaries: 


(114) a|y =q bx x by Interface 
And let the internal skin of an object x be the interface between x and the fusion 


of its holes. Using ‘H’ for the binary relation ‘is a hole in’, this can be defined 
as follows: 





(115) sx =gp Uzdy(Hyx A z = y|x) skin 


Then the distinctions in question are distinctions that reflect the mereotopology 
of an object’s skin. With reference to Fig. 15.21, for example, it can be checked 
that the skin of the doubly perforated block on the left is the disconnected sum 
of two cylinders, i.e., topologically, two spherical surfaces with two punctures 
each. By contrast, the block with a Y-shaped hole has a connected skin that is 
equivalent to a spherical surface with three punctures, while the other blocks 
have skins equivalent to a torus with two punctures and to a bitorus with one 
puncture, respectively (Fig. 15.22). (Note that a puncture is not a hole but a mere 
boundary. The surfaces of the objects in Fig. 15.21 do not have boundaries, yet 
their internal skins do—and that makes all the difference.) 

Now, in a boundaryless theory all of this is beyond reach. But ina sufficiently 
strong boundary-based theory such as KGE MT the notion of an internal skin 
is perfectly meaningful and well defined for every object of positive genus, and 
its mereotopological classification does not present any special challenge. This 
confirms once again the greater expressive power that comes with an ontological 
commitment to boundaries. It also shows, however, that such a commitment is 
not enough: the existential quantifier in definition (115) shows that an explicit 
commitment to holes is also needed. To the extent that the binary predicate ‘H’ is 
to be treated as a primitive, it is clear that this requires a step beyond KGEMT 
and its pure mereotopological extensions. (For an axiomatic treatment of ‘H’ 
and of its interplay with ‘P’ and ‘C’, see Casati and Varzi 1994, Appendix, and 
Varzi 1996b.) 
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3. Location theories 


Letus finally turn to the relation of spatial location. Intuitively, this is the relation 
that holds between an entity and the spatial region that it occupies, and we have 
already seen that this relation can hardly be reduced to a chapter of mereology 
and/or topology. Even if it were—as someone inclined to favor a Leibnizian, 
relationist conception of space against its Newtonian, substantivalist foes would 
urge—methodological prudence would suggest that we regard the reduction as 
a theorem, not as a starting point, hence that the relation of location be treated 
as an independent primitive next to parthood and connection. Exactly how this 
relation should be characterizeed, and how it should interact with the principles 
governing those other primitives, is precisely the sort of question that a good 
theory of location should aim to answer. 


3.1 Varieties of Location 


Before looking at the main options, the usual terminological caveats are in 
order. As with ‘part of’ and ‘connected to’, locative predicates have various 
meanings in ordinary language and it is important to be explicit. 

For one thing, we often speak so as to specify the location of an object by 
reference to another object, as opposed to a spatial region. Consider: 


(116) The biceps muscle is located in the arm. 
(117) The parking area is located next to the stadium. 
(118) The elevator is located inside the main building. 


Pretty clearly, such cases are of no special interest, as they reflect different 
ways of asserting mereotopological relations of the familiar sort: in (116) the 
locative predicate is just a variant of ‘part of’, in (117) it expresses the relation 
of external connection, and in (118) it stands for a relation of containment that 
can be cashed out in terms of interior proper parthood. Of course, establishing 
mereotopological relations may be an indirect way of specifying a genuine 
location: insofar as the biceps muscle is part of the arm, for instance, the 
muscle is bound to be located within the region occupied by the arm, though 
this is by itself an intuition that needs to be spelled out carefully (Sec. 3.3). 
Moreover, not every case of relative location can be explained in this fashion 
(Sec. 3.4). For the moment, let us just emphasize that the main concern of a 
theory of spatial location as we understand it here is with those cases in which 
an object’s location is specified directly, as in 


(119) The peak of Mount Everest is located at 27°59’ N 86°56’ E. 
(120) The new library will be located at this site. 


It is in this sense that the theory presupposes an ontology that includes spatial 
regions as bona fide entities in their own right. Indeed, we shall assume that 
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the location primitive is a relation whose second argument can only be a re- 
gion of space—a ‘place’. (Never mind the question of what sort of linguistic 
expressions can serve the purpose of referring to places, as opposed to things 
that have a place. Statements such as 


(121) The bookcase is located in the living room. 
(122) The United Nations are located in Manhattan. 


are somewhat ambiguous in this respect, but we may suppose that the context 
will always suffice to determine the intended meaning.) 

Secondly, there are various ways in which an object may be said to be located 
ataregion. Ina very loose sense, I am located at any region that is not completely 
free of me (this room, or even the adjacent dining room if I am reaching a foot 
out of the doorway); in a stricter sense, I am only located at those regions that 
host me entirely (this room, if I am not reaching out of the doorway); and in 
a stricter sense still, I am only located at one region, namely the region that 
corresponds exactly to the volume of my body. In the following we shall use 
‘located at’ as designating the last, strictest relation; the weaker relations can be 
introduced by definition. More precisely, suppose we use ‘L’ for the predicate 
of exact location. Then three additional predicates can immediately be defined 
as follows (from Parsons 2006!*): 





(123) GLay =gp dz(Ozy A Laz) Generic Location 
(124) Ebay =gp dz(Pzy A Lez) Entire Location 
(125) UbLay =a Sz(Pyz A Laz) Ubiquitous Location 





Thus, I am generically, in fact entirely located in Manhattan, but not ubiqui- 
tously (or exactly) located there; I am generically, in fact ubiquitously located 
at the region occupied by my left arm, but not entirely (or exactly) located there; 
and if I reach an arm in my neighbor’s window, then I am generically, but neither 
entirely nor ubiquitously (let alone exactly) located at the region corresponding 
to her living room. As we shall see, under suitable conditions these predicates 
are interdefinable, so the choice of ‘L’ as a primitive is ultimately immaterial. 
Finally, it goes without saying that the location of an object may change over 
time. This would suggest treating ‘located at’, not as a binary predicate, but 
as a three-place predicate involving a spatial as well as a temporal argument 
(or as a temporally indexed binary predicate). However, we have seen that the 
same goes for parthood and connection: unless one accepts a radical form of 
mereotopological essentialism, an object may in principle change its parts or 
its topological relations without ceasing to exist. In the preceding sections we 





14Parsons’s term for generic location is ‘weak location’, and his term for ubiquitous location is ‘pervasive 
location’. Our different terminology is dictated merely by notational convenience, in view of the predicates 
‘whole location’ (WL) and ‘proper location’ (PL) introduced below. 
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Figure 15.23. Basic locative relations. 


have tried to keep things simple by treating ‘P’ and ‘C’ as binary predicates, 
and we shall do the same with our location primitive ‘L’. In a way, this means 
that we are assuming the relevant time to be fixed throughout. But one may 
also consider an alternative reading, to the effect that the variables of the theory 
range over four-dimensional entities extended in space-time. (See again the 
brief discussion following (32), Sec. 1.3.2.) Not much of what follows depends 
on the strategy one favors, but for simplicity we shall continue to speak of 
the location of an object as a 3D region of space rather than—possibly—a 4D 
region of space-time. 


3.2 Basic Principles 


With these conventions in place, let us officially expand our formal language 
by adding the new binary predicate ‘L’, intuitively understood as the relation of 
exact location holding between an object and a region of space. To make this 
interpretation explicit, we may begin by assuming the following axiom: 


(L.1) LayALlLaz > y=z Functionality 


This guarantees that nothing can have more than one exact location, which is 
all that is needed to justify the definitions in (123)-(125) (Fig. 15.23.). Indeed, 
it is easy to see that in the presence of an extensional mereology, (L.1) has the 
following corollaries: 


(126) Lay e (ELay A ULzy) 
(127) UbLay © (GLay A Vz(Ozy > GLzz)) 
(128) Ebay + (GLay A Vz(GLaz — Ozy)) 


Thus, although we have settled on the strictest possible primitive, the predicate 
‘L’, one could equally well settle on the weakest predicate, ‘GL’, and define 
the rest via (126)-(128). More precisely, if the mereological theory in the 
background is at least as strong as EM, it turns out that the L-based system 
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defined by (L.1) plus (123)+(125) is equivalent to the GL-based system defined 
by (126)-(128) plus the following: 





(L.2) GLay — 3zLzz Exactness 


(See Parsons 2006; for a different choice of primitives, compare also Perzanowski 
1993.) Note that the conjunct ‘GLay’ is redundant in (127) as long as O is re- 
flexive. However, this extra conjunct is needed in (128) unless one assumes 
that everything is located somewhere: 





(L.3) JyLry Spatiality 


This is clearly an assumption that reflects a substantive thesis (a central tenet 
of most nominalistic ontologies), so it is fair to keep it separate. 

To be sure, there is a sense in which (L.1) may also be read as a substantive 
thesis: functionality is reasonable only to the extent that we are thinking of so- 
called ‘particular’ entities, entities such as material bodies or events, as opposed 
to ‘universal’ entities such as properties. That a material body cannot be in two 
places at once was already a central thesis of Aristotle’s theory of location 
(Morison 2002). But Aristotle also held the view that universals, too, exist 
in space and time: they exist wherever and whenever they are exemplified. 
Wisdom, for example, exists whenever and whevever there are wise people— 
and whenever a wise person exists, wisdom exists in its entirety wherever that 
person is located. Wisdom can therefore be multi-located, and the same goes 
for all universals. Since this view is still very popular (Armstrong 1989), the 
functionality principle (L.1) would be objectionable. However, we can bypass 
this issue by taking ‘L’ to represent the location relation that is characteristic 
of particulars. In that sense, (L.1) is, if not a conceptual truth, a perfectly 
reasonable starting point. 

What else is needed in order to fix the intended meaning of ‘L’ in this sense? 
Since the idea is that every object must be located at a region, some restriction 
must be imposed on the second argument of the relation. This would be a trivial 
task if the language contained an explicit predicate for regions. However, we can 
make do without such a predicate and try to characterize regionhood directly in 
terms of suitable axioms on L. There are two options here, depending on whether 
we think that spatial regions are themselves entities located somewhere. If we 
think so, then the obvious thing to say is that such entities can only be located 
at themselves (Casati and Varzi 1999, p. 121): 


(L.4) Lay > Lyy Conditional Reflexivity 


This would immediately imply that no distinct regions can be exactly co-located, 
i.e., effectively, located at each other: 


(129) LayALzwALyw > y =w 
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Moreover, given (L.1), conditional reflexivity would ensure that L is both an- 
tisymmetric and transitive: 


(130) LayALyrx-~a2=y 
(131) Lay ALyz > Laz 


It follows that relative to the sub-domain of regions L would behave as a partial 
ordering. By contrast, if we think that regions do not have a location—they are 
locations—then the obvious option is given by: 


(L.5) Lay - aLyz Conditional Emptiness 


In this case, the restriction of L to the class of regions would again qualify as 
a partial ordering—a strict ordering—but only in a trivial sense: effectively, it 
would just collapse to the empty relation. 

There is, arguably, no deep metaphysical issue behind these two options: 
both (L.4) and (L.5) are equally good stipulations, and the difference would 
disappear as soon as we focus on cases of proper location: 


(132) PLhay =a Lay A wLyx Proper Location 


However, there are some differences that are worth mentioning. For one thing, 
given (L.1), the first option makes it possible to define regionhood in a perfectly 
straightforward way: 





(133) Ry =a JxLry Region 


(We are speaking of regions in a broad sense, including boundaries as limit 
cases.) By contrast, (L.5) would support this definition—and variants thereof— 
only on the assumption that there are no unoccupied regions, i.e., regions that 
fail to correspond to the location of some object or event. To put it differently, 
if all location is proper location, it is not possible to define regionhood unless 
the following principle is accepted: 





(L.6) Ry — J3rPLry Fullness 


And philosophically this principle is just as controversial as the spatiality prin- 
ciple (L.3). (Among other reasons, one might want to allow for boundary- 
like regions while rejecting the existence of boundary-like objects, as seen in 
Sec. 2.4.3.) Secondly, it is also apparent that the two options differ with regard 
to (L.3) itself: (L.4) is compatible with this principle, (L.5) isn’t. Thus, the 
second option makes it impossible to assert the thesis that everything is located 
somewhere—a thesis which, albeit controversial, is certainly not inconsistent. 
Again, this is a limitation that would dissolve if ‘R’ were available in the lan- 
guage, in which case the thesis in question could be reformulated as follows: 
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(L.3’) =ARx — JyLry 


But precisely because ‘R’ cannot be defined absent (L.6), the limitation is not 
immaterial. 

For these reasons, in the following we shall favor the first option and as- 
sume the conditional reflexivity axiom (L.4). Together with the functionality 
postulate (L.1), this yields a minimal theory of (exact) spatial location, which 
we shall label S: this theory is incompatible with (L.5), but it includes the 
exactness principle (L.2) as a theorem and can be strengthened by adding the 
spatiality principle (L.3), the fullness principle (L.6), or both. 

At this point, we could consider various ways of strengthening © (or its 
extensions) by imposing suitable axioms on the predicate ‘R’ defined in (133). 
For example, it seems reasonable to assume that regionhood is both dissective 
and cumulative, i.e., that any part of a region is itself a region, and that the sum 
of any regions (if it exists) is a region, too: 


(L.7) RyA Pay — Ra Dissectiveness 
(L.8) z= Eror AVa(oxr — Ra) —> Rz Cumulativity 


It may also be reasonable to consider additional postulates concerning whether 
the class of all regions forms a dense domain, or whether it forms an atom- 
less, possibly a boundaryless domain as opposed to an atomistic domain every 
element of which consists (intuitively) of points: 





(L.9) Ra A Ry A PPay — 3z(Rz A PPxz A PPzy) R-Density 
(L.10) Ra — dy(Ry A PPyz) R-Atomlessness 
(L.11) Ra > dy(Ry A IPPyz) R-Boundarylessness 
(L.12) Ra — dy(Ry A Ay A Pyx) R-Atomicity 





More generally, it may be reasonable at this point to consider whether the 
domain of regions should be closed under various mereotopological principles, 
regardless of whether such principles hold of the entities that may occupy those 
regions. For example, even an anti-extensionalist about material objects will 
presumably deny that different regions may consist of the same proper parts, 
and even those who have misgivings about strong composition principles for 
arbitrary obects might be happy to endorse unrestricted composition of spatial 
regions. All such extensions of S are obviously worth examining, and they 
are crucial if we want to fix the intended range of the relational predicate ‘L’, 
but there is no need here to review all the options: suffice it to say that the 
availability of ‘R’ makes it possible to examine them in a systematic fashion. 
(An interesting question, for instance, is whether one can provide a purely 
mereotopological characterization of Euclidean space; see Tsai 2005, 87.4, for 
a negative answer). Rather than focusing on the structure of space per se, let us 
see how S'can be further extended by considering more closely the relationship 
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between the two terms of the location relation—the structure of regions and the 
structure of their tenants. 


3.3 Mirroring Principles 


To this end, let us begin by noting that the four relations L, GL, UL, and EL 
do not exhaust all the options. Additional locative relations can be specified by 
replacing the plain mereological predicates in (123)—(125) with finer-grained 
mereotopological predicates—for example: 








(134) TELay =gp dz(TPPzy A Laz) Tangential EL 
(35) IELay =gp dz(IPPzy A Laz) Interior EL 
(136) TULay =p 3z(TPPyz A Laz) Tangential UL 
(137) IULay =ar Sz(IPPyz A Laz) Interior UL 


More generally, given any mereotopological relation 4, there is a corresponding 
locative relation Ly, defined by: 





(138) Lyry =at dz(zy A Lez) w-Location 


(Thus, GL = Lo, EL = Lp, UL = Ly, etc.) Such generalizations are straight- 
forward, but they bear out that the language of location can be as rich as the 
underlying mereotopological vocabulary. Indeed, this is only half of the story. 
According to (138), to be ¢-located at a region y is to be exactly located at some 
region, z, that is ~-related to y. Thus, the resulting variety of locative relations 
is defined with reference to the mereotopological structure of the range of L— 
the structure of space. But one could also consider the obvious alternative, and 
characterize locative relations by reference to the mereotopological structure 
of the domain of L—the structure of space’s tenants. In this alternative sense, 
to be ~-located at a region y is to be -related to some object, z, that is exactly 
located at y: 


(139) L¥ xy =a dz(paz A Lzy) -Location (2) 





Now, the interesting question is whether these two ways of characterizing loca- 
tive relations should coincide—whether ‘L””’ and ‘Ly,’ should always stand and 
fall together. This is trivially true when the relata are of the same kind, i.e., 
regions, for in that case locative relations collapse to mereotopological relations 
in view of the following S-theorems: 


(140) Ra — (Lyry = pay) 
(141) Ra (LYay => way) 
But what about the general case? This is not just an interesting question to ask 


if we want to get the map straight; it is also a question that calls for interesting 
philosophical decisions. 
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To address the matter properly, let us consider the two directions of the 
equivalence separately, corresponding to the following theses: 


(L.13)  L¥ay > Lyry Bottom Mirroring 
L.14 Lyay > LY zy Top Mirroring 


3.3.1 Bottom mirroring and co-location. Informally, the first of these 
theses says that the structure of space should be at least as rich as the structure 
of its tenants For example, suppose I am exactly located at region r. Then my 
right foot, which is part of me, is LP-located at r. But if my foot is part of 
me, then it is reasonable to suppose that its exact location is part of my exact 
location, which is precisely what (L.13) would imply: my foot is Lp-located 
(i.e., entirely located) at r. For another example, since my foot is connected to 
the rest of my body, which is located at a certain region r’, then it is reasonable 
to suppose that the location of my foot is connected to r’, too: my foot being 
L°-located at r’ implies its being Lc-located at r’. In general, adding (L.13) to 
S would secure that if two objects are ~-related (where 4 is P, C, or any other 
mereotopological relation), then so are their locations: 


(142) LayALzwA waz = pyw. 


Plausible as all this might sound, it is however easy to see that (L.13) does 
not generally hold. A simple counterexample is depicted in Fig. 15.24, left. 
In this model there is just one (atomic) region, r, and a complex object, a, 
consisting of two (atomic) parts, b and c. Object a is located at r, hence its 
parts b and c are L?-located at r. Yet they are not Lp-located at r because there 
is no part of r at which they are exactly located. Note that the mereotopological 
relations in this model can easily be extended so as to satisfy all the axioms 
of KGEMT, so the counterexample does not depend on the strength of the 
underlying mereotopological theory. It depends esclusively on the behavior of 
the relation L. 

Now, it might be observed that this model (orits KGE MT closure) would be 
ruled out if we assumed that every part of a spatially located entity had a spatial 
location—a restricted form of the spatiality principle (L.3). More generally, it 
might be thought that in order to justify (L.13) one must assume the following: 





(L.15) waz ALzy — dwlew Conditional Spatiality 


Bottom mirroring should not hold holus bolus, but only when x and its w-relata 
are genuine spatial entities (in which case (L.13) turns out to be eqivalent to 
(142)). However, it is easy to see that even S + (L.15) would fail to warrant 
this claim. A counterexample is depicted in Fig. 15.24, center. Here b and c 
are LPP_located at r, since each is a proper part of a, which is located at r. Yet 
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Figure 15.24. Three violations of Bottom Mirroring. (Location relationships are represented 
by gray arrows; reflexivities omitted.) 


there is no proper part of r at which a or b is located, which is to say that neither 
is Lpp-located at r. 

We begin here to see the hidden force of (L.13). By requiring that the struc- 
ture of space mirror the structure of its tenants, this principle rules out the 
possibility that mereotopologically distinct entities be co-located, i.e., located 
at the same region. Of course, it may be difficult to imagine a concrete scenario 
corresponding to the model in question, and certainly very difficult to provide 
a less abstract representation of it. But it is not difficult to provide a good ex- 
ample if, for instance, we give up mereological extensionality. Consider again 
Tibbles, the cat, and the ‘mere’ mereological sum of his tail with the rest of his 
body, Tib + Tail (Sec. 1.3.2). Obviously, even if one treated these entities as 
distinct, one would still like to say that they share one and the same location. 
Yet, if ‘y’ expresses the relevant mereotopological relationship of distinctness 
cum sameness of proper parts, it is obvious that the region to which Tibbles (or 
Tib + Tail) bears the relation L” is not the region to which it bears the relation 
Ly (Fig. 15.24, right). Moreover, even in the presence of mereological exten- 
sionality one may conceive of situations where spatial co-location seems pos- 
sible. We have already seen, for instance, that according to Chisholm’s (1984) 
Brentanian theory, topological connection is explained precisely in terms of 
spatial coincidence of boundaries: boundaries are located in space but do not 
occupy space, hence they can coincide while being distinct (Sec. 2.4.2). Fora 
scenario compatible with the full strength of KGE MT, consider Davidson’s 
(1969) example concerning event identity: arguably the rotation and the get- 
ting warm of a metal ball that is spinning fast are two events, yet they occur 
exactly in the same region and they share that location with the ball itself. Fi- 
nally, if our ontology is rich enough to include immaterial or otherwise ethereal 
creatures for which genuine interpenetration is possible, then again co-location 
seems conceivable. Already Leibniz mentioned shadows as a case in point (New 
Essays, W-xxvii-1); other candidates include clouds (Shorter 1977), holes 
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(Casati and Varzi 1994), ghosts (van Inwagen 1990, p. 81), and even angels 
(Lewis 1991, p. 75). 

These are just some examples. But they are indicative of the many philo- 
sophical motives that may lie behind a rejection of the principle according to 
which proper spatial co-location is impossible—a principle that can be put thus: 


(L.16) Play A PLzy - x = z Exclusiveness 


Giving up this principle involves giving up (L.13), at least in its general form. 
And the question of what special instances one should posit, i.e., what val- 
ues of ‘7’ and ‘x’ satisfy bottom mirroring, calls for a detailed case-by-case 
investigation—and for explicit ontological decisions. 


3.3.2 Top Mirroring. Consider now the converse of (L.13), namely 
the principle of top mirroring, (L.14). Informally, this says that the structure of 
space should be mirrored in the structure of those entities that inhabit it. For 
example, if the location of my body properly includes a region r, then it is 
reasonable to suppose that my body properly includes something located at r: 
Lpg-location, hence LPE_jocation. Pretty clearly, however, there are numerous 
relations ~ for which this sort of implication appears problematic. The location 
of my body is a proper part of any region that includes this room, but there 
is no obvious reason to think that every such region is the location of some 
existing object—no reason to think that Lpp-location implies LP?-location. 
(Compare Fig. 15.25, left.) Similarly, my body is Lec-located at many regions, 
viz. regions externally connected to my body’s current location; yet there is no 
obvious reason to think that my body is LEC -located at those regions, since many 
of them may be (partly) empty (Fig. 15.25, center.) Just as bottom mirroring 
appears to presuppose the spatiality principle, or at least its restricted variant 
(L.15), top mirroring appears to presuppose fullness, or at least the following 
restricted version: 





(L.17) RyAwzy A Laz = dJwPLwy Conditional Fullness 


And this is a substantive presupposition that few might grant, regardless of their 
views concerning purely mereotopological matters. 

Indeed, even when w is the seemingly innocent relation of proper extension, 
as in our first example, (L.17) appears problematic. Misgivings about this 
principle—hence about the corresponding instance of top mirroring—come in 
various forms. First of all, there are arguments that purport to show that the 
principle is empirically false (Parsons 2000). Second, the principle rules out a 
priori the possibility of spatially extended mereological atoms, as in Fig. 15.25, 
right. To the extent that one can conceive of such things, it is argued, it should 
not be a conceptual truth that every region ubiquitously occupied by an object 
is exactly occupied by a part of that object. (See Markosian 1998a; other 


1022 HANDBOOK OF SPATIAL LOGICS 


e<————-e & 
Ne 
e< eS 
+ 
Ne 


er ri er ryo er 


Figure 15.25. Three violations of Top Mirroring. 


contemporary philosophers who endorse the possibility of extended simples 
include Parsons 2004 and Simons 2004, but the view goes back to Democritus’s 
claim that atoms come in an infinite variety of shapes and sizes.) Third, there 
are arguments to the effect that (L.17) sits ill with the thought that ordinary 
material bodies can gain or lose some parts (van Inwagen 1981). To illustrate, 
consider again Tibbles, the cat whose tail gets annihilated at t, and suppose 
we agree that it survives the accident. Prior to t, (L.17) would suggest that in 
addition to the whole cat there exist also two externally connected proper parts: 
Tail and Tib (the remainder). Now consider the following statements: 


(143) Tib (before t) = Tib (after t) 

(144) Tib (after t) = Tibbles without Tail (after t) 

(145)  Tibbles without Tail (after t) = Tibbles with Tail (before t) 
(146)  Tibbles with Tail (before t) Æ Tib (before t) 


These four statements are jointly inconsistent, unless one is willing to give up 
the transitivity of identity, so something must give. (146), however, is trivially 
true: there is no way one could identify a whole cat with its tailless portion. 
And (145) is true by assumption: to give it up is to deny that Tibbles survives 
the accident, unless one is willing to construe cats as four-dimensional entities 
whose temporal parts are numerically distinct (Heller 1984). As for (144), its 
denial would obviously incur in a commitment to properly co-located entities, 
let alone a violation to mereological extensionality (Wiggins 1968). Thus—the 
argument goes—unless one is ready to accept such unpalatable consequences, 
the only option is to give up (143): that identity is false for the simple reason that 
prior to the accident Tib does not even exists; it only exists after the accident, 
and it exists as tailless Tibbles.!° (One might also say that before the accident 
Tib does not actually exist. The view that undetached parts are mere ‘potential 
entities’ has been the focus of an intense debate in early modern philosophy; see 





'5The puzzle raised by (143)—(146) has been introduced to contemporary discussion by Wiggins (1968), but 
it goes back at least to the Stoics; see e.g. Sorabji (1988, §1.6). For a detailed overview, see Simons (1987, 
§3.3) and the introduction to Rea (1997). 
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Holden 2004. Brentano 1933 endorsed a similar view, too, and some authors 
have applied it explicitly to the puzzle in question—e.g., Smith 1994, §3.5.) 

Whether any such arguments are found compelling is, of course, an open 
issue. Nonetheless, the obvious moral is that (L.14) can hardly be regarded as a 
conceptual truth about location. Even if we confine ourselves to its single, prima 
facie plausible instance in which w is the relation of (proper) extension, i.e., 
the converse of (proper) parthood, top mirroring is a substantive metaphysical 
thesis whose addition to S must be independently motivated. 


3.3.3 Further locative relations. In discussing such matters, it is 
useful to keep in mind that the lack of a full correspondence between the mer- 
eotopology of space and the mereotopology of its tenants may find expression 
in the failure of other principles or equivalences that might otherwise sug- 
gest themselves. Consider, for instance, the following relation (from Parsons 
2006, 84): 


(147) Whay =a Vz(Pza — GLzy) Whole Location 


Intuitively, this says that an object is located in this room (for instance) if every 
part of the object is generically located in this room, i.e., if none of it is missing 
from the room. This might sound like a different way of saying that the object 
is entirely located in this room, in the original sense of (124) (corresponding to 
Lp-location) or in the alternative sense of (139) (LP-location). In fact, however, 
all these notions are distinct. Not only do Lp-location and L?-location come 
apart, as seen above. They also differ from whole location: the diagram in 
Fig. 15.24, left, corresponds to a model in which an object, a, is both Lp- and 
LP located at a region, r, in spite of not being wholly located there; the diagram 
in Fig. 15.25, right, depicts a model in which a (an extended atom) is wolly 
located at region rı in spite of being neither Lp- nor L?-located there.!° (Note 
that both of these models are extensional, and would continue to exhibit these 
features even if closed under every KGE MT-axiom.) 

The notion of whole location is just one example. In general, for any 
mereotopological relation 7, one can define two additional locative relations 
besides L,, and L”, obtained by switching to a universal quantifier and replacing 
‘L’ by ‘GL’: 


(148) Lyyzy =a Vz(wzy > GLaz) 
(149) LY’ ry =ap Vz(waz > GLzy) 


Here WL = Ly», and it should be obvious from our single example that the 
equivalence between the new predicates in (148)-(149) and the old predicates 





16In Casati and Varzi (1999, §7.2), entire location is called ‘whole location’ and labelled ‘WL’. 
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in (138)-(139) is generally an open question. Indeed, at this point we get the 
full picture by further generalizing these four basic patterns in the obvious 
(recursive) way: if is any locative relation, then so are the following: 


(150) Lay,rvy =ap dz(wzy A Azz) 
(151) LA” ry =a dz(az A Azy) 
(152) Lyyazy =ap Vz(Wwzy > Azz) 
(153) LY ry =a Vz(paz > Azy) 





There are lots of redundancies and empty relations in this picture, whose com- 
plexity depends significantly on the mereotopological axioms governing w. 
Nonetheless, it is only through a careful study of such intricacies—and of the 
corresponding mirroring principles, at the moment vastly unexplored—that a 
reasonably complete theory of location can emerge. 


3.4 Relative locations 


The axiomatization of the region predicate ‘R’ and the positing of suitable mir- 
roring principles constitute the two main directions in which theory S' can be 
extended. Let us briefly mention a third direction, whose ramifications span 
philosophical and methodological issues alike. We have said that in ordinary 
language location is often understood as a relation between two objects, as 
opposed to an object and a spatial region, and we have said that such under- 
standings need not be taken to express any fundamental relationships: often, 
such ‘relative’ locations are mereotopological relations in disguise, as in exam- 
ples (118)-(120). There are, however, cases that resist this sort of explanation. 
Consider: 


(154) The brain is located inside the cranial cavity. 
(155) The swimming pool is located behind the house. 
(156) The bus stop is located right across the old oak tree. 


Surely the truth conditions of these statements can hardly be explained in terms 
of mereotopological relations, so one can hardly leave it at that. Can S' be 
strengthened so as to account for such cases as well? 

In a way, the answer is straightforward. Consider (154). Although there 
is no direct mereotopological relationship between the brain and the cranial 
cavity (unless one thinks the latter literally surrounds the former), one can 
still explain the relevant truth conditions by reference to the mereotopology 
of the corresponding spatial locations: statement (154) is true if, and only if, 
the location of the brain is an interior proper part of the location of the cranial 
cavity. Equivalently, (154) is true if and only if the brain is located entirely in the 
interior of the spatial region occupied by the cranial cavity. This suggests that 
cases such as this can easily be accommodated within the present framework. 
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Figure 15.26. Object-centered (left) and observer-centered (right) frames of reference. 


When we say that a ‘target’ object, x, bears a certain locative relation to a 
‘reference’ object, y, we mean to say that x bears that relation to y’s place. To 
make this clear, let us introduce an explicit location functor, whose uniqueness 
follows directly from the functionality axiom (L.1): 


(157) px =p 2yLay place 


(See Donnelly 2004 for a location theory with p treated as a primitive.) Then, 
for any locative relation À and any spatial objects x and y, we can define a 
corresponding predicate of relative location ‘RA’ as follows: 


(158) RAwy =ap Ax(py) Relative Location 


In the brain-cavity case, A is the relation IEL defined in (135), i.e., Lipp, but 
the same pattern would apply to a large variety of other cases, provided the 
reference object has a location somewhere. In fact, the same pattern can be 
applied to account for the initial examples in (116)-(118), too: the biceps 
muscle is RLp-located at the arm; the parking area is RLec-located at the 
stadium; the elevator is RLipp-located at the main building. In the presence of 
suitable bottom mirroring conditions, these three claims are equivalent to the 
pure mereotopological claims obtained by replacing the relation RL, with w 
itself. 

Cases such as (155) and (156) are different. Here the difficulty does not 
just lie in the fact that the target object and the reference object do not stand 
in any mereotopological relation to each other. The difficulty is that the spatial 
relationships reported by such statements—corresponding to such prepositions 
as ‘behind’ and ‘across’, but also ‘above’, ‘underneath’, ‘left of’, etc.—have 
little or nothing to do with mereotopology. There is in fact a large literature 
devoted to the semantics of spatial prepositions (beginning with the classic work 
of Herskovitz 1986) and it is fair to say that their treatment requires a degree of 
sophistication that goes far beyond the conceptual apparatus developed above. 
Among other things, there are well-known complications owing to the fact 
that their treatment calls for a systematic distinction between object-centered 
frames of reference, as in (155), and observer-centered frames, as in (156) (see 
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Fig. 15.26), whereas mereotopological relations are completely independent 
of subjective or perspectival considerations. Still, this is not a limitation that 
speaks against the employment of a primitive such as ‘L’. It is, rather, an 
indication that the ensuing theory, S, may have to be matched with a more 
sophisticated background than a mereotopological theory can afford. Thus, 
suppose we allow the relational predicate ‘°’ in the definitions of Sec. 3.3 
to stand for relations that are not purely mereotopological: relations such as 
‘behind’, ‘across’, etc. (in each of their multiple uses). Then the idea illustrated 
with reference to the brain-cavity example can in principle be applied also to 
(155), (156), and the like. To say that the swimming pool is located behind the 
house is to say that it is RLpehing-located at the house. To say that the bus stop 
is located across the tree is to say that it is RLacross-located at the tree. And 
so on. Exactly how these relations should be formalized, i.e., what principles 
should be posited to fix the logical behavior of the relevant w, is where things 
get difficult and may require a lot of detailed work. (We have, after all, seen 
how difficult it is to do this when w is a mereotopological relation.) But that is 
not to say that relative locations require a sui generis treatment that a suitable 
extension of S could not accommodate. 

On the other hand, here is where the main metaphysical assumption under- 
lying S may be questioned. The accommodation comes with definition (158), 
which allows one to handle relative locations in terms of a primitive relation, L, 
whose range consists exclusively of spatial regions. In other words, it allows 
one to express a spatial relation between a target and a reference object as a 
relation between the target and the reference’s place. But one might object 
that this has things the wrong way round. Relative locations are ontologically 
neutral with respect to the status of space, whereas the proposed treatment de- 
pends crucially on the assumption that places, and spatial regions generally, 
are entities of a kind. A relationist about space might therefore reject the ac- 
count and take the opportunity to reverse the order of the explanation: to be 
RL,,-related to a place is to be w-related to the place’s tenant, and talk about 
places is shorthand for talk about spatial objects. Also a substantivalist about 
space might think that when it comes to spatial reasoning, objects are concep- 
tually prior to their locations, since we cannot identify the latter independently 
of the former (Strawson 1959, ch. 1.) Even a forerunning substantivalist such 
as Newton emphasized that absolute places, defined as in (157), are scarcely 
useful for locating things in the world: we do not locate an object on a moving 
ship with reference to an immobile environment but, rather, with reference to 
the ship itself (Principia, Definitions, Sch. 4). In short, there may be philosoph- 
ical as well as methodological reasons for resisting the treatment of locative 
relations indicated above, and if these reasons are taken seriously, then cases 
such as (154)—(156) run afoul of the basic framework outlined here and call for 
independent treatment. An articulated proposal in this spirit may be found in 
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Donnelly (2005), but much recent literature devoted to the formal representation 
of direction and other qualitative spatial relations (from the works of Mukerjee 
and Joe 1990, Frank 1992, and Hernandez 1994 to more interdisciplinary works 
such as those collected in van der Zee and Slack 2003) may be viewed in this 
perspective. 


3.5 Location, connection, and parthood 


Let us conclude with a few remarks about the whole conceptual package that 
we have been putting together. We have at this point three main primitive no- 
tions: location, connection, and parthood. Some structural relationships among 
these notions have been examined, but the general question of their ontolog- 
ical intertwining is still open. Generally speaking, parthood and connection 
are independent from each other, unless one accepts the converse monotonicity 
principle (C.4) (Sec. 2.3.2). But what about location? Let us keep with the 
assumption that location is a relation between a thing and its place. Is that 
relation completely autonomous or does it entail a mereotopological linkage of 
some sort? 

A purely mereological linkage seems out of the question. There is no reason 
to think that I share any parts with the space I occupy, just as there is no reason to 
think that movement—change of location —is a form of mereological change. 
This is not to say that location implies disjointness, since the first argument 
of L may itself be a region, or a hybrid fusion including regions among other 
things. In general, however, it seems perfectly reasonable to assume that the 
implication holds for ordinary cases of location—a thesis that can be put as 
follows: 





(L.18) PLay — 3z(Pzæ A Lzy A Dzy) Spatial Disjointness 


What about a topological linkage? In this case the picture seems different. 
If an object is located at some place, then it might be plausible to suppose that 
the object and its place are connected in some way: 


(L.19) Lay — Cary Spatial Connection 


In what way would I be connected to my place? Since overlap is excluded, the 
relevant linkage would have to be one of external connection. However, this 
means that the plausibility of (L.19) depends crucially on the interpretation of 
the topological primitive ‘C’. 

If we go along with the standard interpretation, corresponding to a boundary- 
based theory such as KGE MT, the prospects are slim. On that interpretation, 
two things can be externally connected only if one is open and the other closed, 
at least in the relevant contact area: two closed entities, or two open entities, can 
only connect through mereological overlap (Sec. 2.4.1). Now, suppose I am a 
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closed body. Then (L.19), together with (L.18), imply that the region at which 
I am located—my place—must be open. That region, however, has a closure, 
and one should think that the closure of a spatial region is itself a region: its 
boundary is pure space, too. But my boundary is not pure space: whatever 
it is, it is part of me, and none of me is made of space. Thus, the closure of 
my place and I are disjoint—which is to say that we are not connected after 
all. A different way of putting this involves the thought that my place and I 
must have the same topology: if I am closed (for instance), then my place must 
be closed too, hence we cannot be externally connected. Strictly speaking, 
this involves an appeal to the principle of bottom mirroring, specifically the 
following instance of corollary (142): 


(159) Lay ALzw AIPPaxz — IPPyw 


My interior (x) is a proper interior part of me (z), hence its place (y) must be 
an interior proper part of my place (w). We have seen that bottom mirroring 
may not generally hold, but this specific instance seems fair. Yet (L.19) would 
require the opposite: it would require my place to be an interior proper part of 
the place of my interior—absurd. 

By contrast, suppose we go along with a different interpretation. For in- 
stance, consider those interpretations that explain connection explicitly in terms 
of spatial location. We have seen three such interpretations, corresponding to 
the following necessary and sufficient conditions for two entities x and y to be 
connected: (i) the place of x is connected to the place of y (Sec. 2.3.2, thesis 
(74)); (ii) the place of x overlaps the place of y (Sec. 2.4.2, thesis (89)); or (iii) 
the closure of the place of x overlaps the closure of the place of y (Sec. 2.4.3, 
thesis (90)). With the help of ‘L’, and assuming the spatiality axiom (L.3), these 
three interpretations can now be formally stated as follows: 


(160) Cay < C(px) (py) 
(161) Cay < O(px)(py) 
(162) Cay O(cpx)(cpy) 


And, plainly, each of these statements is perfectly compatible with the spatial 
connection principle (L.19). In fact, since C and O are both reflexive, each 
statement entails (L.19) in view of the following S-theorem: 


(163) Lay — px = y = py 


This is not surprising. After all, these theories establish an intimate relationship 
between topology and spatiality, and the claim that location implies connection 
is but one way of making that explicit. One might even go as far as to say that 
on these theories location is connection of a kind: itis the relation of connection 
that always holds in the special case where one of the relata equals the place 
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of the other. On the other hand, it is fair to note that all of this depends on 
the assumption that L is conditionally reflexive (L.4). Should one decide to go 
for the alternative option (L.5), treating L as a relation whose domain does not 
contain any spatial regions, (163) would not hold and (L.19) would not follow 
from (160)-(162). Indeed, on that way of interpreting ‘L’, the picture would 
be perfectly reversed: nothing would be connected to its place because places 
would lack a place of their own, hence (L.19) would be just as unacceptable in 
these theories as it is in KGEMT. 

These are just some examples, but they suffice to show that the ontological in- 
tertwining between locative and mereotopological concepts cannot be assessed 
without a clear stand on some very basic semantic issues concerning those con- 
cepts. Of course, this is also true of the interplay between mereological and 
topological concepts, but in the present case the stakes are higher. Some of the 
options leave room for no intertwining whatsoever; other options trivialize the 
intertwining by explaining one sort of concept directly in terms of the other. 
There is, to be sure, some room for compromise. After all, KGE MT is a pretty 
strong theory and its weaker children are compatible with (L.19). Similarly, 
there is strictly speaking no constraint to supplement its main alternatives with 
the explicit equivalences in (160)-(162), so the status of (L.19) is in principle 
open for those theories, too. One final remark, however, is in order. For sup- 
pose we do accept (L.19) together with (L.8), that is, suppose we do establish an 
overt mereotopological linkage of external connection between a spatial object 
and its spatial location. Then it follows that no spatial entity will ever qualify 
as a interior proper part of anything, except for empty regions of space: 


(164) PLay — -IPPxz A AIPPyz 


This is an immediate consequence of the definition of ‘IPP’ (Sec. 2.3.1): my 
‘interior’ proper parts, for example, would immediately turn into tangential 
proper parts by virtue of being connected to something with which I have no 
parts in common—their places; and the ‘interior’ proper parts of my place, at 
least those proper parts that are not empty, would immediately turn into tan- 
gential proper parts by virtue of being connected to something with which they 
have nothing in common—their material guests. This is bad news, for it means 
that our mereotopology would collapse altogether. Or rather, it means that it 
would have to be largely re-written by replacing throughout our topological 
primitive ‘C’ with the following impure connection predicate: 


(165) RCay =a Cary ^ (Ra = Ry) Restricted Connection 


It is, of course, this notion of connection that we had in mind in setting up 
KGEMT and its variants. RC never cuts across levels: in order for two 
entities to be RC-related, both of them or neither of them must be regions of 
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space. But then we have come around the circle, for the RC-variant of (L.19) 
is clearly false. 

We face, here, the fundamental limit of topology—and mereotopology—as 
a general theory of space. One way or the other, the analysis situs cannot do 
proper justice to the fact that objects are situated, which is why the theory 
of location is independently needed. One way or the other, spatial reasoning 
must come to terms with the fundamental metaphysical mystery on which it 
depends—embedding in space. 
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grey-level, 872 
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logics, 920 
necessity modality, 925 
Essentialism 
mereotopological, 1013 
Essentially algebraic, 450 
Essentially propositional, 471 
Etale, 479 
Euclid, 2, 91 
fifth postulate, 344, 393, 397 
Euclidean 
dilation, 641 
field, 351 
geometry, 4, 101, 150, 629, 668, 680 
hierarchy, 255 
isometry, 640 
plane, 102, 120 
Euler 
formula, 47 
Event horizon, 664, 687, 691, 695, 698, 700-701, 
703 
Event, 611, 622, 624, 627, 679 
Exchange property, 360 
Exhaustive split, 168 
Existential 
commitment, 102 
rule, 447 
Explicit definition, 650 
Exponential, 465 
Expressive power, 150, 531 
Expressiveness 
first-order, 829 
on finite relations over the reals, 829 
on infinite relations, 829 
Expressivity and ontology, 1001 
Extension 
proper, 952 
interior, 982 
tangential, 982 
simple functional, 484 
Extensionality 
axiom, 83, 119-121, 124, 129, 135, 151 
mereological, 1020, 1022, 957-959, 967-968, 
972, 985 
and atomism, 962 
and supplementation, 957 
and vagueness, 972, 975 
topological, 980, 985, 989 
Extensive connection, 22, 91 
Extent, 440 
Exterior 
mereotopological, 990 
Extremally disconnected, 233 
Face, 750, 755 
Fano 
axiom, 385 
plane, 364 
Fibre, 490 
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Fibred 
locale, 490 
product, 463, 475 
Field, 351, 619-620, 629, 638, 667, 672, 680 
real numbers, 805 
Euclidean, 619 
finite, 620 
linearly ordered, 618-619 
ordered, 619, 676 
quadratic, 640-641, 643-647, 650-651, 664, 
681-682 
quadratic, 619, 626, 629, 639 
real-closed, 647, 682 
Filter 
completely prime, 438 
Filtered colimit, 464, 470 
Filtering functor, 480 
Finite model property, 235 
Finite structures over the reals, 830 
Finite-cofinite algebra, 124 
Finitely decomposable 
mereotopology, 20 
structure, 39 
Finiteness 
Kuratowski, 454 
First response, 196 
First-order logic, 1, 5—6, 8, 21, 522, 608, 639, 
646-647, 649-651, 658, 660, 662, 676-677, 
683, 690 
over the reals, 805 
with linear constraints, 805, 809, 811, 824 
with operators, 842 
with polynomial constraints, 805, 807-808, 825 
with topological operators, 851-852 
with transitive closure, 842 
with while-loop, 841, 849, 
First-order temporal logic, 557 
First-order theory, 22, 171 
Five-segment axiom, 397 
Flat functor, 480 
continuous, 479 
Flat oriented matroid, 783 
Flat zone, 871 
Flat, 696 
Formal concept analysis, 897 
Formal semantics of relations, 169 
Formal topology, 438, 491 
Formula 
coherent, 436 
first-order, 805 
geometric, 436, 446 
in context, 446 
order-generic, 830 
quantifier-free, 805-806, 810, 813 
equivalence, 824 
prenex formal form, 826 
Four-dimensionalism, 1014, 1022, 958 
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Frame of reference, 173 
Frame, 108, 234, 305, 432, 436, 485, 490 
distributivity, 432, 437 
Free 
algebra, 466 
model, 451, 454, 461-463, 466 
Frobenius rule, 447, 461 
Full infinite binary tree, 242 
Full infinite quaternary tree, 235 
Function, 445 
Functor 
Cartesian, 450 
Fusion, 235, 954, 965, 967 
and vagueness, 969 
implausible, 969 
infinitary, 966 
over and above the parts, 970 
unique, 967 
unrestricted, 967, 969 
vs. mereological fusion, 999 
vs. set abstraction, 968 
vs. upper bound, 965 
Galaxy, 659 
Gale order, 791 
Galois 
connection, 884 
on sets, 885 
map, 884, 
Game of Life, 548 
General Principle of Relativity, 664 
Generalized 
element, 459 
model, 434 
point, 439, 469 
space, 429, 487 
Generic 
collapse, 829-830 
element, 459 
model, 434, 468 
point, 439 
Genus (topological), 1007 
Geodesic, 662, 668-669, 674, 680, 683, 689-690, 
703, 705 
photonlike, 671, 674, 677, 679, 682, 684, 690, 
697 
timelike, 670-671, 674, 677, 679, 682, 684, 690, 
692, 702 
Geodesical reconstruction, 865 
Geographic information system, 512, 810-811 
Geometric 
attribute, 801 
category, 461 
formula, 436, 446 
morphism, 462, 469 
theory, 436, 446, 484 
type theory, 452-453, 461, 484 
type, 463 
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Geometrization, 608, 638 
Geometry of Solids, 3—4, 6, 92, 102 
Geometry, 657, 664, 682 
absolute, 345 
affine, 228, 344 
elementary, 345 
Euclidean, 4, 101, 150, 629, 668, 680 
general affine, 391 
hyperbolic, 344 
Minkowski, 638, 648, 650 
o-minimal, 812 
observer-independent, 648 
of Solids, 102 
projective, 345, 765 
real algebraic, 812 
Get-Refinements algorithm, 187 
Giraud’s theorem, 461, 485 
GIS, 512, 810-811 
Global 
element, 459 
point, 440, 467 
Glue transformation, 833-836 
Grain, 864 
Granularity, 165 
mereological, 963 
Granulometry, 896 
Graph 
convexity, 726 
neighbourhood, 43 
Dacey, 734 
Gravity, 608, 660-664, 668, 674, 686, 690, 692, 700 
Greedy algorithm, 791 
Grey-level images, 867 
ordering, 867 
supremum and infimum, 867 
Grothendieck 
topology, 479 
topos, 460 
Group, 350, 643 
Gunk 
atomless, 961 
H-open, 263 
Half-space, 20 
Hall Marshall Jr. 
planar ternary ring, 370 
Hausdorff space, 233 
Heine-Borel theorem, 443, 445 
Helly graph, 721 
Hereditarily irresolvable, 253 
Heuristics, 180 
backtracking, 168, 194 
Heyting algebra, 117, 432, 740 
pseudo-complement, 117 
Hilbert 
10th problem, 526, 832 
axiomatization, 2 
axioms for incidence, 359 
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coordinatization, 651 

system, 59 
Hit-or-miss transform, 861 
Hole, 1003, 1006, 1021, 985, 992 

and ontology, 1010 

as a negative object, 1010 

internal, 1008 

knotted, 1009 

perforating, 1008 
Homogeneous 

mereotopology, 56 
Homogeneously embedded, 56 
Homomorphism 

frame, 432, 490 

model, 448-449 
Homothety, 379 
Horizontal topology, 263 
Horizontally open, 263 
HORNSAT, 179 
HV-open, 263 
Hyper-rectangular convex, 254 
Hyperplane 

arrangement, 774 

axis-oriented, 58 

rational, 21 
Hyperspace, 490 
Idempotence, 863 
Identity, 878, 953, 958 

contingent, 958 

diachronic, 958 

refinement, 190 

relation, 110, 164 

rule, 436 

through change, 958 

transitivity of, 1022 
Image, 458, 460, 463 
Imaginary numbers, 619 
Implication, 459 
Implicit definition, 649-650 
Implicit knowledge, 273 
Incidence, 349, 358 

basis, 359 

frame, 358, 413 

affine, 414 
projective, 414 

geometry, 359 
Indefinite knowledge, 165 
Independent, 639, 645 

lines, 357 

points, 360 

linearly, 667 
Indiscernibility, 719, 954 
Induced subgraph, 719 
Induction algebra, 452 
Inertial, 612 

body, 620 

motion, 609 
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reference frame, 609 
Infinitely chequered subsets of IR°° 
logic of, 255 
Initial model, 450, 461 
Instance 
database, 807, 810 
relation, 810 
Interaction of relations, 198 
Interchange law, 463 
Interdefinability, 102 
Interface (topological), 1011 
Interior, 232—233, 301, 724 
algebra, 108 
image, 233 
mereotopological, 990 
operator, 108, 113, 116 
point, 232 
vs. mereotopological interior, 1000 
Interpolation 
axiom, 83, 124, 130, 133-134 
Intersection, 164, 349, 359 
Interval algebra, 166 
Intractability, 178 
Inverse, 619 
image, 457, 462 
Involuted monoid, 113 
Involution, 758, 769 
Irreflexive fragment, 258 
Irresolvable, 253 
Isomorphism, 5, 643, 649-651, 653, 674-676, 681, 
684-685, 693, 879 
between linear spaces, 361 
dual, 879 
Isotopy, 832 
Isotropic, 609, 621, 660 
JEPD relations, 165 
Join 
directed, 439, 464 
Jointly exhaustive relations, 165 
Jordan arc, 35 
Jordan curve, 35, 136 
theorem, 35 
converse of, 36 
KD45, 329 
Kennison’s theorem, 451 
Khalimsky 
line, 256 
plane, 256 
Kinematics, 992 
Newtonian, 608, 610 
special relativistic, 608, 610, 612 
Knowledge Representation, 102, 119, 134 
Kripke 
frame, 108, 400, 570 
logic of a class of, 401 
model, 401 
semantics, 143, 305 
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Kruskal’s tree theorem, 556 
Kuratowski 
axioms, 507 
finiteness, 454 
theorem, 46 
Lambek calculus with permutation, 287 
Lattice 
inversion, 880 
Law of Nature, 663-664 
Left exact 
sketch, 450 
theory, 450 
Leibniz’s law, 958-959 
Leibniz’s principle, 646 
Length, 746 
Length-contraction, 632, 634-637, 640 
Liar paradox, 661, 701 
Lifeline, 611, 620 
Light Axiom, 609, 612, 616, 621, 623, 642, 645, 
660, 663, 680 
Light signals, 608 
Light-cone, 642, 652-654, 668, 688, 691, 695, 
698-699, 703 
Lightlike separated, 653 
Limit point, 232 
Lindenbaum algebra, 143—144, 433, 439, 451, 461, 
466 
Line of slopes, 369 
Line-element, 673, 684, 686, 695, 697-698, 702 
Linear mapping, 639 
Linear Time Temporal Logic, 527, 566, 577 
Linear 
algebra, 285 
space, 359-360 
planar, 362 
transformation, 361 
variety, 360 
Lines, 349, 358 
List type, 454 
Local 
conical structure, 819 
frame, 665, 667-671 
homeomorphism, 475-476, 478-479 
inertial frame, 683 
section, 476 
Locale, 438, 471, 486 
empty, 440 
one-point, 440 
Localic 
reflection, 487 
topos, 471 
Locally Special Relativity Principle, 663—665 
Location, 1012, 946 
w-, 1018 
as connection of a kind, 1028 
Location 
Axioms for 
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Bottom Mirroring (L.13), 1019 
Conditional Emptiness (L.5), 1016 
Conditional Fullness (L.17), 1021 
Conditional Reflexivity (L.4), 1015 
Conditional Spatiality (L.15), 1019 
Cumulativity (L.8), 1017 
Dissectiveness (L.7), 1017 
Exactness (L.2), 1015 
Exclusiveness (L.16), 1021 
Fullness (L.6), 1016 
Functionality (L.1), 1014 
R-Atomicity (L.12), 1017 
R-Atomlessness (L.10), 1017 
R-Boundarylessness (L.11), 1017 
R-Density (L.9), 1017 
Spatial Connection (L.19), 1027 
Spatial Disjointness (L.18), 1027 
Spatiality (L.3), 1015 
Top Mirroring (L.14), 1019 
basic principles, 1014 
entire, 1013 
interior, 1018 
tangential, 1018 
exact, 1013 
frame of reference, 1025 
generalized, 1023 
generic, 1013 
implies connection, 1028 
implies disjointness, 1027 
mirroring principles, 1018 
multiple, 1015 
object-centered, 1025 
observer-centered, 1025 
pervasive, 1013 
proper, 1016 
relative, 1024-1025 
theories of, 1012 
ubiquitous, 1013 
interior, 1018 
tangential, 1018 
varieties of, 1012, 1023 
vs. connection, 1027 
vs. parthood, 1027 
whole, 1023 
Loop, 350 
Lorentz boost, 639 
Lorentz transformation, 639-641 
Lossy channel system, 555 
Lower powerlocale, 491 
Lowenheim-Skolem theorem, 89 
Majority, 829 
Malament-Hogarth computer, 703 
Malament-Hogarth event, 700-701 
Manifold, 69, 665, 704 
Lorentz, 676, 681—682 
Many-sorted logic, 134 
Many-sorted, 649 
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Map, 435, 439, 462 

Matching, 482 

Mathematical morphology, 8, 10, 285, 857 
adjacency, 905 
binary, 860 
computational complexity, 875 
distances, 904 
flat, 871 
fuzzy adjacency, 915 
fuzzy directional position, 916 
fuzzy distances, 913 
fuzzy sets, 906 
grey-level, 874 
logics, 920 
modal logics, 923 

Matrix, 673-674, 684, 700 

Matroid, 762 

Matter vs. void, 1011 

Maximal tractable subset, 184 

Maximally consistent, 239 

Meagre space, 361 

Mereology, 18, 101-102, 119, 946-947 
and nominalism, 949 
and set theory, 949, 952 
and spatial reasoning, 946 
as a Boolean algebra, 968 
as a calculus of individuals, 949 
as formal ontology, 949 
Atomistic Mereology (AX), 962 
atomistic variant of, 962 
Atomless Mereology (4X), 962 
atomless variant of, 962 
Basic (M), 954 
basic principles, 950 
composition principles, 954, 964 
decomposition principles, 954 
Extensional Mereology (EM), 957 
extensional, 957 
General Extensional Mereology (GEM), 967 
history of, 947 
Minimal Mereology (M M), 956 
non-well-founded, 952 
ontological innocence of, 949 
rough, 972 

Mereotopology, 18, 170, 980 
and dimensionality of space, 1005 
and Euclidean space, 1017 
and spatial embedding, 1030 
and void, 1011 
Basic (T), 980 
basic principles, 978 
boundary-free, 956, 984, 996 
Boundaryless Mereotopology (BX), 998 
boundaryless variant of, 998 
bridging principles, 980 
expressivity of, 1001 
finitely decomposable, 20 
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General Extensional Mereotopology (GE MT), 
989 

Kuratowski extension of, 990 

Kuratowski General Extensional Mereotopology 
(KGEMT), 990 

layered, 1006 

Minimal Mereotopology (MT), 982 

Reductive Mereotopology (RMT), 984 

vs. kinematics, 992 


Merging of objects, 991-992 

Metric, 283, 506 

Metric-geometry, 9, 648, 651-652, 657, 681 
Metric space, 506 

Metric temporal model, 544 

Metric-tensor, 672, 674-676, 681—683, 686, 688, 


694-695, 700 


Michelson-Morley experiment, 609, 663 
Minimal labels problem, 169 
Minkowski 


addition, 286, 860 

circle, 655-656, 685 
geometry, 681 

metric, 650 

operations, 8 

orthogonal, 651, 655, 657 
scalar-product, 656, 672 
space, 665-666 
subtraction, 860 


Mitchell-Benabou language, 466 
Modal logic, 1, 8, 142, 150 


axiomatic definition, 401 

axiomatization, 402 

bi-modal spatial logic, 145 

completeness for Alexandroff spaces, 511, 590 

completeness for Cantor Space, 604 

completeness for finite spaces, 511, 590 

completeness for the class of homeomorphisms, 
586 

completeness for the reals, 586 

completeness theorem, 402 

finite model property, 590 

for incidence geometries, 415 

for projective geometries, 416 

of collinearity, 406 

of elsewhere, 405 

of everywhere, 405 

of orthogonality, 408 

of parallelism and intersection, 410 

of parallelism, 407 

of qualitative distance, 406 

S4, 144, 510 

multi-dimensional, 150 

normal, 142, 144 


Modal 


algebra, 108-109, 143 
definability, 401 
encoding, 144, 150 
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operator, 4, 108 
Model, 99, 234, 434, 448 
free, 451, 454, 461-463, 466 
generalized, 434 
generic, 434, 468 
initial, 450, 461 
standard, 434, 437, 440 
Monodic fragment, 557 
Moore family, 880 
dual, 881 
Morphism 
theory, 446 
Morphological filter, 891 
by composition of opening and closing, 864 
Motion, 608, 611-612, 617, 623, 635, 664, 670 
accelerated, 620 
faster-than-light, 635, 659-660 
non-uniform, 620 
slower-than-light, 612 
uniform, 620 
Multi-aspect calculi, 197 
Multi-location, 1015 
Multi-piece region, 151 
Multiplication in affine plane, 371 
Multiplicative linear logic, 287 
N-chequered subsets of IR” 
logic of, 254 
N-chequered, 254 
N-true, 254 
N-valid, 254 
Natural language, 102-103 
Natural numbers, 452, 454 
object, 466 
Natural transformation, 463 
Necessity operator, 108, 509 
Negation, 459 
Negative direction, 204 
Neighbourhood, 719 
function, 888 
models, 223 
Newtonian kinematics, 638 
Newtonian mechanics, 659 
Newtonian worldview, 609, 663 
Nihilism 
mereological, 971 
Non-deterministic algorithm, 178 
Non-tangential proper part axiom, 135 
NOT-ALL-THREE-SAT, 179 
NP 
complexity class, 178 
NP-complete, 179 
NP-hard, 179 
Null entity, 952, 966 
Null region, 119, 121, 135-136, 141 
Number theory, 832-833 
O-notation, 178 
Object classifier, 467, 474, 487 
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Observation oriented, 641 
Observational, 646, 658, 690, 700 
Observer, 610, 618, 638, 651, 662, 667, 676 
accelerated, 646, 661—664, 669, 684, 686, 692 
uniformly, 684-685 
inertial, 609, 646, 661-663, 668, 670, 676, 682, 
684 
suspended, 663, 690-691, 695, 697, 702 
Observer-independent, 638, 648, 651-652, 690 
Occam’s razor, 682 
Omega-categoricity, 88 
Omega-rule, 59-60 
Omitting Types Theorem, 62 
ONE-IN-THREE-SAT, 179 
Ontological dependence, 947, 995 
Ontology, 101, 947 
Open entity, 991 
vs. Open*, 1000 
Open, 233, 439, 487 
cover, 232 
image, 233 
map, 475 
neighbourhood, 113, 231 
set, 113, 231, 301, 507 
Open/closed distinction, 990-991 
and figure/ground opposition, 993 
vs. open*/closed* distinction, 1000 
Opening, 286 
activity ordering, 895 
algebraic, 881 
area, 866 
binary, 861 
by reconstruction, 865 
flat, 870 
grey-level, 874 
structural, 882 
topological, 882 
Operator, 878 
modal, 4 
topological, 4, 510 
activity ordering, 894 
anti-extensive, 879 
composition, 878 
decreasing, 878 
extensive, 879 
flat, 868 
idempotent, 879 
increasing, 878 
infimum, 878 
invariance domain of, 879 
ordering, 878 
overpotent, 891 
range of, 879 
supremum, 878 
translation-invariant, 879 
underpotent, 891 
Orbit, 546 
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ORD-Horn, 190 
Order, 386 

collinearity geometry, 388 
Ordinate, 369 
Orientation, 173 
Origin, 368 
Orthoclosed, 729 
Orthocomplement, 729 
Orthogonal, 778 
Orthogonality, 349, 355 

2-dimensional model, 356 

frame, 355 

planar, 355-356 

real frame, 356 

space, 720 

n-dimensional model, 357 
Ortholattice, 729 
Orthomodular, 734 
Orthoposet, 733 
Over-constrained instances, 181 
Overfilter, 891 
Overlap 

mereological, 952 

necessary for fusion, 969 
P 

complexity class, 178 
Padoa’s method, 352 
Pairs of relations, 199 
Pairwise disjoint relations, 165 
Pappian plane, 376 
Pappus property, 406 

first, 375 

second, 375 
Paradigmatic effects, 624, 631, 641-642, 645, 

659-660 

Parallelism, 352, 355 

class, 353 

frame, 353 

model, 353 

postulate, 344 

pre-model, 353 

quasi, 361 

real model, 354 

strict, 349, 353, 361 

weak, 349, 353, 362 
Parallelogram relation, 374 
Parity, 829 
Part relation, 120 
Part 

o-, 951 

arbitrarily demarcated, 948 

cognitively salient, 948 

direct, 951 

distinguished, 951 

extended, 948 

functional, 951 

gerrymandered, 948 
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homogeneous, 948 
immaterial, 948 
material, 948 
proper, 950, 982 
interior, 1005, 1029, 982 
tangential, 1005, 982 
self-connected, 948 
spatial, 948 
spatio-temporal, 958 
temporal, 1022, 948 
undetached, 1022, 948 
unextended, 948 
vs. component, 948 
Part-of 
definability of, 27 
relation, 948 
Part-whole relation, 947, 975 
Parthood relation, 170 
Parthood, 948 
and existence, 969 
and identity, 970 
and location theory, 951 
and vagueness, 971 
and ‘part’, 948 
Antisymmetry (P.3), 952 
antisymmetry of, 950, 952 
as a partial ordering, 950 
Atomicity (P.8), 962 
Atomlessness (P.7), 962 
basic principles, 950 
borderline cases of, 972 
Boundedness (P.10,,), 965 
Complementation (P.6), 960 
composition principles, 954 
conceptually prior to identity, 954 
connected, 983 
decomposition principles, 954 
definability of, 953, 968, 983 
Density (P.9), 964 
Determinacy (P.16), 975 
determinacy of, 975 
extensional, 1020, 1022, 957-959, 962, 967—968, 
972, 985 
functional, 951 
Fusion (P.11,,), 965 
Fusiong (P.11 ya), 965 
Fusiony (P.11 40), 965 
fuzzy, 972 
improper, 950 
indeterminacy of, 971 
Indiscernibility (P.3’), 954 
Infinitary Boundedness (P.12¢), 966 
Infinitary Fusion (P.13¢), 966 
isomorphic to set inclusion, 968 
proper, 950, 952, 955 
Reflexivity (P.1), 952 
reflexivity of, 950, 952 
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Restricted Atomicity (P.84), 963 
Restricted Atomlessness (P.74), 963 
Strong Atomicity (P.15), 971 
Strong Company (P.4,), 955 
Strong Supplementation (P.5), 956 
Supplementation (P.4), 955 
time-indexed, 958 
Transitivity (P.2), 952 
transitivity of, 950, 952 
Unique Unrestricted Fusion (P.14), 967 
Unrestricted Fusion (P.13), 967 
varieties of, 948 
vs. conceptual inclusion, 948 
vs. enclosure, 982, 999 
vs. material constitution, 948, 951, 985 
vs. mixture composition, 948 
vs. part-whole relation, 947, 975, 979 
Weak Company (P.4a), 955 
Partial order, 876 
Partition, 39 
connected, 39 
graph, 48 
manual, 731 
radial, 45 
ch, 42 
Pasch, 345 
axiom, 229, 388-389, 391, 397 
Pasting, 476, 482 
Path-connectedness, 988 
Path-consistency, 165 
algorithm, 166 
method, 165 
operation, 165 
refinements, 186 
Path-consistent, 165 
Peirce’s puzzle, 993 
Pencil, 355 
Penrose diagram, 697—698, 702 
Percentiles, 192 
Perception, 101 
Perpendicularity, 349, 355 
Persistent, 310 
Phase portrait, 547 
Phase-transition, 181 
Phenomenology, 101 
Photon, 613, 618, 621, 638, 651, 662, 668, 671, 677 
Pieri Mario, 345, 395 
relation, 394 
PL-complex, 76 
Place, 1013, 1025 
absolute vs. relative, 1026 
Plane graph 
geometrical dual, 46 
piecewise linear, 46 
rational piecewise linear, 46 
semi-algebraic, 46 
Plane, 102, 360 


HANDBOOK OF SPATIAL LOGICS 


Plotkin 
powerdomain, 490 
Poincaré Recurrence Theorem, 567, 573 
Point structure, 821, 837 
Point, 101-103, 106, 467 
generalized, 439, 469 
generic, 439 
global, 440, 467 
regular, 821 
singular, 821 
Point-based representation, 101—102 
Pointless topology, 116 
Points, 170, 348, 358 
Polarity, 885 
constraints, 182 
Polygon, 20 
Polygonal regions, 102 
Polyhedral ball 
definability of, 74-75 
Polyhedron, 20 
Polynomial, 805 
linear, 813 
multivariate, 813 
Polytope, 20, 781 
basic, 20 
Pore, 864 
Poset 
directed complete, 439 
Positional information, 177 
Positive 
direction, 204 
logic, 436 
Possibility operator, 108, 142, 509 
Possible world, 143, 400 
Post’s correspondence problem, 552 
Power lattice, 877 
Powerdomain, 490 
Powerlocale, 490 
Powerobject, 465 
Powerset 
finite, 454 
Practical efficiency, 190 
Preconvex relations, 174 
Predicate, 445 
Predicative, 491 
Preorder, 305 
Preposition 
spatial, 1025 
Presheaf, 476, 480 
Pretopos 
co-, 460-461 
Prime ideal theorem, 84 
Prime model, 86 
Primitive, 101, 119, 135 
Primitive recursive time, 556 
Principle of equivalence, 663—664 
Principle of relativity, 609, 661, 663 
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Product, 135 
frame, 236 
graphs, 718 
mereological, 967-968 
of lines, 358 
of modal logics, 558 
Projective 
3D-space, 363 
extension, 366 
geometry, 345, 765 
plane, 1, 363 
space, 362 
validity, 363 
Proper, 241 
Propositional, 445 
encoding, 142 
satisfiability problem, 178 
Proximity 
connection algebra, 132 
standard, 133 
relation, 118 
space, 117-118, 127, 129-130, 132, 151 
separated, 118, 133 
standard, 118 
Pseudo-complement, 15, 33, 738 
Pseudo-pullback, 469 
Pythagorean field, 384 
Q-cell, 76 
face vertex of, 79 
partition, 79 
QEPCAD, 826 
Qua object, 956 
Quadrangle, 385 
Qualitative Spatial Reasoning, 7, 119, 121, 134, 512 
Qualitative 
distance, 176 
size relations, 199 
Quantale, 753, 758 
Quantification, 150 
Quantifier elimination, 806 
by cylindrical algebraic decomposition, 826 
first-order logic with linear constraints, 824 
Fourier-Motzkin algorithm, 824 
Tarski’s algorithm, 825 
first-order logic with polynomial constraints, 825 
Quantifier-free, 103 
Quantity, 618, 638 
Quantum 
gravity, 613 
logic, 735 
mechanics, 613 
Quasi-Boolean structure, 135 
Quasi-equation, 752 
Quasi-equational, 450 
Quasi-parallel, 361 
Quasi-tree, 248 
Query language, 806-808, 811 
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closed, 809 
closure property, 823 
computationally complete, 849, 851 
expressiveness, 829 
first-order, 800, 804 
Query, 807 
composition, 823 
consistent, 808 
constraint, 808 
emptiness test, 823 
expressibility, 809 
membership test, 823 
plug-in evaluation, 822 
topological, 809, 832, 837, 851-852 
unrestricted, 808 
Quotient 
simple geometric, 484 
Randomly generated instances, 180, 191 
Rank of space, 363 
Rasiowa-Sikorski system, 147 
Raster model, 810 
Real field, 384 
Real line, 442 
localic, 443 
Real numbers, 610, 618-619, 629 
Real root counting, 826 
Real-closed field, 351, 384 
Receptacle, 997 
Rectangle algebra, 175 
Reduct, 449, 460, 469, 471, 649 
Reduction, 179 
by refinement, 186 
polynomial, 179 
Reference frame, 618 
inertial, 609 
Reference system, 609 
Refinement, 166, 186 
basic matrix, 187 
matrix, 187 
strategy, 189 
Reflection 
localic, 487 
Reflects Alexandroff extensions, 252 
Reflexive closure, 258 
Region Connection Calculus, 69, 513, 1001, 1004, 
1008, 102, 120-122, 124, 134, 139, 141, 
146, 149, 170, 715, 737, 983, 999 
algebra, 124, 130 
RCC-8 relations, 121—122, 139, 141, 146, 170, 
512 
Region, 3, 512, 1012, 101-102, 106, 729, 752, 945, 
952, 956-957 
as a set of points, 968 
extended, 956 
extensional, 957 
spatial, 1012, 945, 952, 956-957, 968 
unoccupied, 1016 
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vs. concrete spatial entity, 1012, 1019, 1021, 
945, 957, 978, 986 
geographic, 136 
Region-based, 102 
representation, 101-103, 140 
Regionhood, 1016 
Regular 
category, 461 
closed, 115, 119, 125, 130, 136, 140 
algebra, 115, 129 
set, 512 
closure, 730 
set, 512 
open, 14, 115, 119, 125, 140, 727 
algebra, 115 
set, 115 
Relation algebra, 103—104, 110, 112, 121, 149, 164, 
654-655 
operations on relations, 164 
Relation, 103, 799, 807 
attribute, 799 
constraint, 807 
database, 810 
equivalent, 807 
Euclidean, 331 
finite, 804 
finitely representable, 804 
linear, 810 
polynomial constraint, 807 
semantics of a, 807 
serial, 331 
Relational 
composition, 110 
converse, 110 
semantics, 305 
Relationism, 1012, 1026, 945 
Relative interior, 750 
Relative, 623, 638 
distance, 176 
Relativity theory, 9, 607, 629, 641, 649-650, 
657-659 
general, 9, 607 
special, 9, 607 
general, 618, 620, 646-647, 657, 660-664, 
676-677, 680, 683, 704, 706 
special, 619, 630, 646-647, 657-658, 661-665, 
670, 680, 705-706 
Remainder 
mereological, 1007, 1022, 955-956 
Representation theorem, 109, 127, 129-130, 149 
Residuation law, 287 
Resolvable, 253 
Respects components, 18, 20, 28 
Restriction, 476 
Retract, 719 
Reverse mathematics, 608, 645, 659 
Reverse operator, 206 
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Reverse relativity theory, 608, 645, 659 
Rich language, 394 
Riemann-tensor, 678 
Ring, 350 
Root, 241, 257 
Rooted, 235, 241, 257 
Rotation, 380 
Rough sets, 898 
adjunction, 900 
duality, 901 
fuzzy, 912 
S-true, 254 
S-valid, 254 
S40, 568 
S4C, 567 
canonical dynamic Kripke model, 593 
completeness for Kripke models, 593 
completeness in Cantor Space, 569, 592, 599 
completeness, 592, 594 
finite model property, 592, 594 
incompleteness in the Reals, 569, 592 
the bimodal dynamic topological logic of 
continuous functions, 592 
Sahlqvist theorem, 402 
SAT, 178 
Satisfaction, 805 
Satisfiability, 178, 505 
relation, 401 
Satisfy, 459 
Scattered, 253 
Scenario, 167 
Schema 
database, 799, 807, 810-811 
Schonflies theorem, 35, 71 
polyhedral, 71 
Scott 
continuity, 464 
Search space 
size, 168 
Second countable 
topological space, 89 
Section 
local, 476 
Selective unravelling, 242 
Selective, 242 
Self-connected, 151 
Self-connectedness, 986, 988 
Semantical, 649 
consequence, 623, 647 
Semantics 
non-terminating, 843 
of a linear tuple, 810 
of a program, 850 
of a relation, 807 
of a transitive closure formula, 843, 848 
terminating, 843 
Semi-algebraic function, 815-816 
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Semi-algebraic set, 19, 803, 806-807, 812-816, 
818-819, 822, 837, 846, 852 
approximation of a, 831, 847 
decomposition, 815 
linear approximation, 831 
plane graph, 51 
Semi-linear set, 803, 806, 810, 812-813, 818, 841, 
845, 849 
Semi-regularisation, 115 
Semilattice, 454 
Sense data, 101 
Sentence, 805 
Separable, 233 
Separated 
lightlike, 653 
spacelike, 653 
timelike, 653 
Separating set, 461 
Separation condition, 113, 130 
Tı, 114, 129, 134 
To, 114 
T3, 115 
T4, 115 
T;-, 27 
Separation number, 1006 
Sequent, 436, 447 
Serial subsets of IR 
logic of, 254 
Serial, 227, 254, 353 
Set of constraints, 164 
Set theory 
fuzzy, 972 
non-well-founded, 951 
Shadow, 1020 
Sheaf, 462, 468, 474, 476-477, 482 
Sheavification, 483 
Shellable, 45 
Shorter-than relation, 394 
Sieve, 479 
Signature, 433, 445 
Similarly situated, 38 
Simple functional extension, 484 
Simple geometric quotient, 484 
Simple, 241 
extended (mereological), 1021 
mereological, 961 
Simplex, 822 
Simultaneity, 613, 625-626, 642 
Simultaneous, 613, 622, 625-626, 629, 632 
Single-aspect calculi, 197 
Singularity, 687, 691-692, 696-698, 701, 703 
Site, 474, 479, 482 
Skeleton, 248 
Skew 
field, 351 
lines, 349, 381 
Skin (topological), 1011 


1055 


Slice, 478 
Slope of line, 369 
Slope, 611, 621, 625, 643, 668, 684, 692, 697, 699 
Smooth, 662, 666-668, 674-678, 681, 700 
Sober space, 442, 487 
Social geometry, 10 
Sort, 445, 618, 649, 651 
Space 
as a fiction, 945 
atomicity of, 1017 
atomlessness of, 1017 
Bolzanian, 996 
boundarylessness of, 1017 
cumulativity of, 1017 
density of, 1017 
dimensionality of, 1006 
dissectiveness of, 1017 
Euclidean, 508, 1017, 991 
fullness of, 1016, 1021 
Leibnizian conception of, 1012, 945 
metric, 506 
modalities, 403 
Newtonian conception of, 1012, 945, 996 
perceptually remote, 945 
relationist conception of, 1012, 1026, 945 
substantivalist conception of, 1012, 1026, 945, 
996 
Whiteheadian conception of, 956, 963, 984, 997 
Space-dilation, 641 
Space-isometry, 640, 646, 675 
Space-time location, 624 
Space-time, 9, 608, 617, 638, 642, 665 
location, 619-620, 624 
anti de Sitter, 703 
de Sitter, 703, 705 
Eddington-Finkelstein, 695—696 
general relativistic, 660-667, 670-672, 674, 676, 
686 
Gédel’s rotating universe, 701, 703, 705 
Kerr, 700 
Kruskal-Szekeres, 697 
Minkowski, 683—685, 688, 705 
Schwarzschild, 686-687, 694, 698 
Tipler-Stockum, 701 
vacuum, 705 
Spacelike separated, 653 
Span of lines, 357 
Spatial entities 
points, 162, 173, 502 
regions, 162, 502 
Spatial reasoning 
object-oriented, 946, 957, 986 
space-oriented, 946, 957, 986 
fusion, 917 
mathematical morphology, 903 
model-based recognition, 917 
qualitative, 512, 931 
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spatial relations, 903 
Spatial regions, 170, 512 
dimension, 171 
holes, 171 
internal connectedness, 171 
non-emptiness, 171 
regularity, 171 
Spatial, 442 
distance, 621 
preposition, 1025 
Spatio-temporal, 101, 150, 502 
logic, 497 
Special composition question, 965, 969 
Special Principle of Relativity, 663-664 
Specialization, 724 
morphism, 463, 487 
order, 238, 439, 472 
Speed of light, 609, 621, 642, 663, 688, 696 
Speed, 621, 677 
Sphere, 32 
Spherical geometry, 757, 770 
Spherical oriented matroid, 771 
Split set, 168 
Splittable 
mereotopology, 58 
Splitting of objects, 991 
Stage of definition, 439, 459 
Stage, 467 
Stalk, 474, 477 
Standard 
contact, 130 
relation, 125 
model, 434, 437, 440 
product topology, 263 
Ly-theory, 58, 69, 90 
Star calculus, 174 
Stereographic projection, 33 
Stone 
space, 435 
theorem, 106, 127-128 
Straight line, 612, 620, 629, 641, 643, 655, 662, 
668, 680, 684, 692, 697, 699 
String-theory, 618 
Structuring element, 861 
accessibility relation, 924 
Structuring function, 873 
Sturm theorem, 826 
Subalgebra, 105 
Sublocale, 491 
Subobject classifier, 465 
Subset frame, 308 
Subspace, 233 
Substantivalism, 1012, 1026, 945, 996 
Substitution, 458 
rule, 447 
Sum, 718 
axiom, 124 
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connected (mereological), 976 
function, 135 
mereological, 954, 968 
Supervaluationism, 974 
Supplement, 740 
Supplementation 
and complementation (mereological), 960 
mereological, 955, 960, 967, 998 
strong (mereological), 956, 966 
Surface, 69, 1001, 747, 956, 977, 992 
Surrounding set, 758 
Swelling, 248 
Symmetric difference, 110 
Synchronism, 613, 616, 632, 637, 640, 642 
Syntactic category, 451, 483 
Syntactical, 649-650 
Tangent line, 840 
Tarski-Seidenberg theorem, 814 
Tarski- Vaught lemma, 57 
Temporal logic, 527 
branching, 528 
first-order, 557 
linear, 527 
Temporal modalities, 527, 566 
henceforth, 527, 566 
next, 527, 566 
Temporal order, 614, 630 
TERA project, 828 
Term, 805, 811 
in context, 446 
Ternary 
operation, 369 
relation, 516 
ring, 370 
attached to plane, 370 
linear, 373 
Test instances, 191 
Test space, 730 
Theory, 351 
w-categorical, 352 
Cartesian, 449 
coherent type, 454 
coherent, 436, 446 
complete, 351 
decidable, 352 
essentially algebraic, 450 
finitary algebraic, 449 
geometric type, 453, 461 
geometric, 436, 446, 484 
left exact, 450 
morphism, 446 
quasi-equational, 450 
Thought-experiment, 610, 615, 621, 627, 634, 637 
Threshold set, 868 
Tidal force, 692, 695 
Time modalities, 403 
Time travel, 661, 701 


INDEX 


Time 
branching, 528 
inner, 670 
linear, 527 
proper, 656, 670 
wristwatch, 668, 670, 678, 689-691 
Time-axis, 611, 620, 640-641, 673, 687, 690-691 
Time-dilation, 632, 634-635, 637, 640, 690 
Time-orientation, 647, 656, 700 
Timelike, 668 
separated, 653 
curve, 668 
Timewarp, 607 
Tips, 412 
Tolerance space, 719 
Topo-bisimulation, 225 
Topo-definable, 251 
Topo-model, 218 
Topologic, 314, 509 
Topological space, 1, 106, 113, 127, 149, 231, 507, 
565 
completely regular, 114, 134 
Hausdorff, 27, 114 
locally compact, 32 
locally connected, 18 
normal, 27, 114 
regular, 27, 114 
semi-regular, 18, 27, 114 
weakly normal, 114, 134 
weakly regular, 27, 114, 129-130 
connected, 134, 515 
Topological 
bisimulation, 225 
definability, 251 
dynamics, 550, 566 
interior, 507, 565 
model, 218, 302, 510, 569 
product logic, 268 
property, 852 
query, 832, 837, 851 
semantics, 218, 565, 569 
sum, 233 
temporal model, 533 
type, 840 
Topology, 301, 507, 664, 946 
Alexandroff, 306, 508 
and common sense, 991 
and mereology, 975 
and spatial reasoning, 946, 975 
intrinsic, 1010 
limits of, 1010 
Topos theory 
fundamental theorem, 478 
Topos, 9, 445 
elementary, 465 
Grothendieck, 460 
localic, 471, 487 
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Touching (topological), 981 
Tractability, 150, 178 
Transformation, 639, 643 
cut, 833-836 
glue, 833-836 
coordinate, 639-640 
Galilean, 640 
Newtonian, 640 
Lorentz, 637, 639-640, 675 
worldview, 639-640, 662 
Transitive closure logic, 841-842, 845 
with start point and parameters, 848 
with stop conditions, 847 
Transitive closure, 842 
Translation, 260, 379 
Trapezium relation, 375 
Tree metric space, 522 
Triangle relation, 349 
Triangulation, 76, 761, 821 
theorem, 821, 847 
Trivial, 232, 818 
semi-algebraically, 818 
Triviality, 818 
theorem, 818—819, 840, 852 
Trivialization, 818 
Trivially flawed instances, 191 
True, 234, 237 
Truth in Kripke models, 401 
Tuple, 799 
linear, 810 
Turing 
barrier, 703 
computable, 849 
machine, 499 
reduction, 169 
Twin paradox, 659-660, 670 
Two-sorted 
Kripke models, 417 
modal language, 416 
modal logic for affine geometries, 420 
modal logic for projective geometries, 418 
Type theory 
coherent, 454 
geometric, 452-453, 461, 463, 484 
Ultra-contact, 126, 131 
Itrafilter, 84, 105, 126 
mbra, 868 
ndecidability, 171 
nder-constrained instances, 181 
nderfilter, 891 
nderlap 
mereological, 952 
niform finiteness property, 817, 831 
niformly finite, 817 
nit of measurement, 621—622, 653, 705 
nit 
line, 368 
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point, 368 
Unit-vector, 627, 639, 646, 667, 673-674, 694-695 
Universal entity, 952 
Universal 

quantification 

finitely bounded, 454 

region, 135 

relation, 164, 166 
Universalism 

mereological, 971 
Universe 

mereological, 968 
Upper powerlocale, 491 
Upset, 238 
V-open, 263 
Vacuum, 701-703, 705 
Vagueness, 742 

of composition, 969 

of parthood, 971 

semantic vs. ontological, 973 

de re vs. de dicto, 973 
Valid, 234, 237 

topologically, 303 
Validity in Kripke frames, 401 
Valuation, 234, 401 
Variety, 107 

graphs, 722 
Vector 

model, 810 

space, 8, 10, 290, 620 
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Vertical topology, 263 

Vertically open, 263 

Vietoris powerlocale, 490 

Visited nodes, 192 

Visual field, 101 

Vocabulary, 618, 648-650, 658-659, 676, 678, 805 

Weak closure system, 726 

Weakly transitive, 257 

Well-powered, 457, 460 

Wheel, 720 

White hole, 697-698 

Whitney’s theorem, 49 

Whole, 947, 975 

scattered, 961, 969, 975, 987 

Why-type question, 659 

Windowing function, 888 

Worldcurve, 678, 690 

Worldline, 611, 620, 646, 662, 668, 670, 677-678, 
680, 682, 690 

Worldview relation, 618, 662 

Worldview transformation, 639-641, 643, 645, 662, 
680, 684 

Worldview, 638-639, 643, 646, 648, 661-663, 677, 
682, 684, 691 

Wormhole, 607, 660—661, 676, 697, 704 

Yoneda embedding, 480 

2SAT, 179 

3SAT, 179 

4-intersection model, 136-137 

9-intersection model, 137, 515 

9-intersection, 172, 515 


